Abstract In recent years, there has been a strong interest in applying machine learning techniques to path synthesis of linkage mechanisms. However, progress has been stymied due to a scarcity of high-quality datasets. In this paper, we present a comprehensive dataset comprising nearly three million samples of four-, six-, and eight-bar linkage mechanisms with open and closed coupler curves. Current machine learning approaches to path synthesis also lack standardized metrics for evaluating outcomes. To address this gap, we propose six key metrics to quantify results, providing a foundational framework for researchers to compare new models with existing ones. We also present a Variational AutoEncoder based model in conjunction with a k-nearest neighbor search approach to demonstrate the utility of our dataset. In the end, we provide example mechanisms that generate various curves along with numerical evaluation of the proposed metrics.