Abstract Airborne laser scanning (ALS) data are routinely used to estimate and map structure-related forest inventory variables. The further development, refinement and evaluation of methods to derive forest inventory variables from ALS data require extensive datasets of forest stand information on an individual tree-level and corresponding ALS data. A cost-efficient method to obtain such datasets is the combination of virtual forest stands with a laser scanning simulator. We present an approach to simulate ALS data of forest stands by combining forest inventory information, a tree point cloud database and the laser scanning simulation framework HELIOS++. ALS data of six 1-ha plots were simulated and compared to real ALS data of these plots. The synthetic 3D representations of the forest stands were composed of real laser scanning point clouds of individual trees that were acquired by an uncrewed aerial vehicle (UAV), and, for comparison, simplified tree models with cylindrical stems and spheroidal crowns. The simulated ALS point clouds of the six plots were compared with the real point clouds based on canopy cover, height distribution of returns and several other point cloud metrics. In addition, the performance of biomass models trained using these synthetic data was evaluated. The comparison revealed that, in general, both the real tree models and the simplified tree models can be used to generate synthetic data. The results differed for the different study sites and depending on whether all returns or only first returns were considered. The measure of canopy cover was better represented by the data of the simplified tree models, whereas the height distribution of the returns was – for most of the study sites – better represented by the real tree model data. Training biomass models with metrics derived from the real tree model data led to an overestimation of biomass, while using metrics of the simplified tree model data resulted in an underestimation of biomass. Still, the accuracy of models trained with simulated data was only slightly lower compared to models trained with real ALS data. Our results suggest that the presented approach can be used to generate ALS data that are sufficiently realistic for many applications. The synthetic data may be used to develop new or refine existing ALS-based forest inventory methods, to systematically investigate the relationship between point cloud metrics and forest inventory variables and to analyse how this relationship is affected by laser scanning acquisition settings and field reference data characteristics.