Along the last decades, the genetic evaluation methodology has evolved, improving breeding value estimates. Many breeding programmes have historical phenotypic records and large number of generations, but to make use of them could result in more inconveniences than benefits. In this study, the prediction ability of genotyped young animals was assessed by simultaneously evaluating the removal of historical data, two pedigree deepness and two methodologies (traditional BLUP and single−step genomic BLUP or ssGBLUP), using milk yield records of 40 years of three Latxa dairy sheep populations. The linear regression method was used to compare predictions of young rams before and after progeny testing, with six cut-off points, by intervals of 4 years (from 1992 to 2012), and statistics of ratio of accuracies, bias, and dispersion were calculated. The prediction accuracy of selection candidates, when genomic information was included, was the highest in all Latxa populations (between 0.54 and 0.69 with full data set). Nevertheless, the deletion of historical phenotypic data resulted on moderate accuracy gain in the bigger data size populations (mean gain 2.5%), and the smaller population took advantage of a moderate data deletion (2.7% gain by removing data until 2004), reducing accuracy when more records were removed. The bias of validation individuals was lower when the breeding value was predicted based on genomic information (between 2.1 and 13.9), being lower when the biggest amount of data was deleted in the bigger data size populations (5.2% reduction), and the smaller population was benefited from data deletion between 1996 and 2008 (3.8% bias reduction). Meanwhile, the slope of estimated genetic trend was lower when less data were included, and an overestimation of the unknown parent group estimates was observed. The results indicated that ssGBLUP evaluations were outstanding, compared with traditional BLUP evaluations, while the depth of pedigree had a very small influence, and deletion of historical phenotypic data was beneficial. Thus, Latxa routine genetic evaluations would benefit from truncating phenotypic records between 2000 and 2004, the use of two pedigree generations and the implementation of ssGBLUP methodology.