BackgroundCroatia is a geographically small country with a remarkable diversity of cultivated and spontaneous grapevines. Local germplasm has been characterised by microsatellite markers, but a detailed analysis based on single nucleotide polymorphisms (SNPs) is still lacking. Here we characterize the genetic diversity of 149 accessions from three germplasm repositories and four natural sites using 516,101 SNPs to identify complete parent-offspring trios and their relations with spontaneous populations, offering a proof-of-concept for the use of reduced-representation genome sequencing in population genetics and genome-wide association studies (GWAS).ResultsPrincipal component analysis revealed a clear discontinuity between cultivated (V. vinifera subsp. sativa) and spontaneous grapevines, supporting the notion that the latter represent local populations of the wild progenitor (V. vinifera subsp. sylvestris). ADMIXTURE identified three ancestry components. Two sativa components are alternatively predominant in cultivars grown either in northern Adriatic Croatia and Continental Croatia or in Dalmatia (i.e. central and southern Adriatic Croatia). A sylvestris component, which is predominant in accessions from spontaneous populations, is a minor ancestry component in cultivated accessions. TREEMIX provided evidence of unidirectional migration from the vineyards to natural sites, suggesting that gene flow has gone preferentially from the introduced domesticated germplasm into local wild populations rather than vice versa. Identity-by-descent analysis indicated an extensive kinship network, including 14 complete parent-offspring trios, involving only cultivated accessions, six full-sibling relationships and invalidated a presumed pedigree of one of the most important varieties in Croatia, ‘Plavac Mali’. Despite this strong population structure, significant association was found between 143 SNPs and berry skin colour and between 2 SNPs and leaf hairiness, across two previously known genomic regions.ConclusionsThe clear genetic separation between Croatian cultivars and sylvestris ruled out the hypothesis that those cultivars originated from local domestication events. On the other hand, the evidence of a crop-to-wild gene flow signals the need for an urgent adoption of conservation strategies that preserve the residual genetic integrity of wild relatives. The use of this reduced-representation genome sequencing protocol in grapevine enables an accurate pedigree reconstruction and can be recommended for GWAS experiments.