- New
- Research Article
- 10.3897/bdj.14.e181774
- Jan 19, 2026
- Biodiversity Data Journal
- Fadil Millaku + 3 more
Targeted field surveys, during the 2025 vegetative period, across a range of natural habitats revealed four vascular plant taxa previously unconfirmed for the flora of Kosovo: Lycopodium annotinum subsp. annotinum, Najas marina , Cyperus fuscus and C. rotundus . Specimens were collected from upper-montane spruce forests, in lake and littoral zones and serpentine foothill streams and were identified using standard regional floras; all vouchers are deposited in the Herbarium of the University of Prishtina (UPH). The discovery of L. annotinum subsp. annotinum adds a new boreal-montane element to the national Lycopodiaceae, while N. marina represents the first documented member of the genus Najas in the country. Likewise, C. fuscus and C. rotundus are the first confirmed representatives of Cyperus , demonstrating that cyperaceous diversity in Kosovo has been significantly underestimated. These findings highlight the ecological and biogeographical complexity of the region, emphasise the importance of wetlands, aquatic systems, upper-montane coniferous forests and serpentine landscapes for harbouring overlooked taxa and underscore the need for continued systematic floristic surveys to refine the national inventory and support future conservation and biodiversity research. Four new species are being reported for the flora of Kosovo: Lycopodium annotinum subsp. annotinum, Najas marina , Cyperus fuscus and C. rotundus .
- New
- Research Article
- 10.3897/bdj.14.e182222
- Jan 19, 2026
- Biodiversity Data Journal
- Stephen Mason, Jr + 10 more
Natural history collections serve as invaluable records of biological complexity across time and space. However, only a small fraction of these collections has been digitised globally, leaving the majority of specimen data inaccessible for research and digital analysis. Hog Island Audubon Camp, located in Bremen, Maine, United States, is a nature and birding education centre operated by the National Audubon Society’s Seabird Institute. It houses a small, but historically valuable natural history collection, primarily used for science education and outreach programmes. By digitising these data, we have increased the research value and accessibility of the collection by enabling scientists to use these records in support of ecological, evolutionary and conservation goals. Between 2022 and 2024, we digitised all specimens in the collection with available occurrence data using Darwin Core (DwC) as our digital standard. In total, 764 records were captured, representing 417 unique scientific names across animal, plant, fungal and chromista specimens. Most specimens originate from the New England region and the tri-state areas of the United States. Notably, over 300 specimens were collected during the 1960s and 1970s, a period when the harmful effects of DDT (dichlorodiphenyltrichloroethane) on humans and wildlife were becoming increasingly recognised. The oldest specimens in the collection are five New World warblers collected between 1872 and 1897.
- New
- Research Article
- 10.3897/bdj.14.e177202
- Jan 19, 2026
- Biodiversity Data Journal
- Juan Wen
The digitization of herbarium specimens is crucial for advancing biodiversity research and data sharing. However, this process is often hindered by the inefficiency of manual transcription and the technical challenges posed by the massive volume of specimens, heterogeneous label layouts, and the prevalence of handwritten texts. To overcome these bottlenecks, this study proposed an automated pipeline that integrates the PadddleOCR engine with the DeepSeek large language model (LLM) for structured information extraction from specimen labels. The pipeline is designed to extract 16 key metadata fields from both printed and handwritten labels. Evaluated on a benchmark dataset, it achieved a high field-level accuracy of 95.4% for printed labels, demonstrating strong reliability. For handwritten labels, the system maintained functionality while correctly identifying its limitations through a confidence-based quality control mechanism. A key finding was the compensatory role of the LLM, which effectively corrected upstream OCR errors, as evidenced by a weak correlation ( r = 0.32) between OCR (Optical Character Recognition) confidence and final extraction accuracy. This hybrid architecture ensures data security through local image processing and cost-efficiency via text-only LLM parsing. This work provides a robust, scalable, and practical solution for accelerating the digitization of botanical collections. The method is directly applicable to real-world digitization workflows and promises to significantly enhance the efficiency of biodiversity data creation and sharing.
- New
- Research Article
- 10.3897/bdj.14.e176933
- Jan 16, 2026
- Biodiversity Data Journal
- Nao Omi
The genus Allostoma von Beneden, 1861 (Pseudostomidae Graff, 1904) consists of generally small, often conspicuously colored species. These organisms are typically found on hard bottoms, in gravel, or among algae in marine and brackish-water environments. The Pseudostomid taxon encompasses approximately 55 known species globally, spanning regions across Europe, Asia, and North and South America. In Japan, however, only three species of this taxon have been reported to date: Allostoma durum (Fuhrmann, 1896), Allostoma densissimabursa (Omi, 2020) comb. nov., and Cylindrostoma monotrochum (Graff, 1882). Samples of seaweed and sand were collected from the coast of Shimane Prefecture in western Japan. Turbellarians were subsequently isolated and examined both in their live state and in histological sections. This investigation led to the description of a new species, Allostoma matsueensis sp. nov., characterized by a pear-shaped granular vesicle, non-sclerotized retractile penis, a narrow rigidly sclerotized conical spermatic duct, a supraterminal female pore, and a thin membraned bursa. The species further exhibits a yellow-to-pale brown body. This discovery constitutes the first record of the genus Allostoma from the western marine coast of Japan.
- New
- Research Article
- 10.3897/bdj.14.e173032
- Jan 14, 2026
- Biodiversity Data Journal
- Sang-Yeon Lee + 4 more
The Grey-headed woodpecker ( Picus canus ) inhabiting South Korea is an insectivorous bird species that favours ants. However, during winter, most insects enter a dormant stage, reducing their availability as food sources. This may prompt P. canus to change its primary food sources. To verify this hypothesis, we fabricated and installed 91 nest boxes in forests in Seocheon-gun, Chungnam Province, considering the body size of P. canus . From these, we collected faeces samples totalling 94 conical tubes of 1.5 ml each, obtained from 17 nest boxes used by P. canus as roosting sites. The collected faeces samples were examined under a microscope and their relative frequencies were calculated at the family or order level. Formicidae exceeded 70%, confirming it as the primary food source for P. canus during the breeding season and also in winter. Anacardiaceae fruits accounted for approximately 16%, significantly lower than Formicidae, but still considered a major winter food source for P. canus .
- New
- Research Article
- 10.3897/bdj.14.e180901
- Jan 13, 2026
- Biodiversity Data Journal
- He-Xiong Shi + 3 more
The genus Drapetisca comprises 7 species distributed in Canada, Central Asia, China, Caucasus, Europe, Japan, Russia (Europe to the Far East), and the USA, out of which five species have been recorded in China (World Spider Catalog 2025). The males and females of Drapetisca can be distinguished from all other Linyphiidae genera by the palpal distal arm of the paracymbium anchor-shaped in the retrolateral view and the protruding epigyne without a scape. While examining Linyphiidae Blackwall, 1859 specimens from Gele Mountain in Chongqing, we discovered a new species of the genus Drapetisca , D. dentata sp. nov. Detailed descriptions, photographs of copulatory organs and somatic features, and a distribution map are provided.
- New
- Research Article
- 10.3897/bdj.14.e179239
- Jan 13, 2026
- Biodiversity Data Journal
- Alexander Balakirev + 3 more
During fieldwork conducted in the Bac Yen District of Son La Province, near the Ta Xua Nature Reserve, a representative of a previously never registered genus and species of rodent in Vietnam, the Yunnan field mouse ( A. ilex O. Thomas, 1922), was discovered. This finding significantly expands our knowledge of the distribution of Apodemus species in northeastern Indochina. Morphological and genetic analysis of the specimen ruled out accidental introduction and established that the Vietnamese population is natural and belongs to the eastern phylogenetic branch of this species, which also inhabits the southern Yunnan-Guizhou Plateau, as well as the upper reaches of the Mekong, Hong Ha, and Da Rivers. Thus, the list of rodent species and genera in Vietnam has been expanded with a new species. Data on the species' geographic distribution across its range are also provided.
- New
- Research Article
- 10.3897/bdj.14.e173221
- Jan 12, 2026
- Biodiversity Data Journal
- Guilherme Oyarzabal + 10 more
BackgroundSpecies functional traits provide critical insights into how organisms interact with and respond to their environment. Key characteristics, such as body size, dispersal ability and trophic specialisation influence species' survival, reproduction and adaptability. Island ecosystems, particularly oceanic archipelagos like the Azores, serve as ideal natural laboratories for studying these traits due to their unique biogeographic history and high endemism. Arthropods, as dominant colonisers and ecosystem engineers, exhibit rapid adaptation and trait diversification in these isolated settings. However, island arthropods face escalating threats from habitat loss, climate change and invasive species, which disrupt ecological functions and increase extinction risks. Under the scope of BALA (Biodiversity of Arthropods from the Laurisilva of Azores) project (1999-2021) and SLAM (Long Term Ecological Study of the Impacts of Climate Change in the natural forest of Azores) project (2012-2025), we obtained a comprehensive and standardised dataset of arthropods functional traits currently known to occur in the Azores Archipelago.New informationWe present a standardised functional trait database for Azorean arthropods, building on 25 years of research on 602 species and subspecies across seven classes and 27 orders. The dataset includes endemic, native non-endemic and exotic species, with traits selected for their relevance to disturbance responses (e.g. body size, dispersal, verticality) and ecological interactions (e.g. trophic level, feeding behaviour). By synthesising these data, we aim to support predictive modelling of biodiversity responses to environmental change and provide information for conservation strategies. This resource provides a foundation for global comparisons and advances in trait-based ecology in island systems.
- New
- Research Article
- 10.3897/bdj.14.e173579
- Jan 8, 2026
- Biodiversity Data Journal
- Huan Trong Phan + 7 more
The combined pressures of overharvesting for ornamental and medicinal markets, along with pervasive habitat degradation, have precipitated a critical threat to wild populations of the jewel orchid, Ludisiadiscolor, within the biodiversity hotspots of Vietnam. The development of robust molecular tools is therefore essential to inform effective conservation strategies. This study established a suite of polymorphic Expressed Sequence Tag-derived Simple Sequence Repeat (EST-SSR) markers for L.discolor and applied them to assess genetic diversity and population structure. Transcriptome sequencing on the Illumina HiSeq™ 4000 platform yielded 44,764,702 high-quality reads. De novo assembly generated 26,271 unigenes (N50 = 2,160 bp; Q30 = 96.3%), which were comprehensively annotated against public databases (Nr, Swiss-Prot) and classified into Gene Ontology, KOG, and KEGG pathways. From 9,495 identified EST-SSR loci, 15 highly polymorphic markers were validated. Analysis of Vietnamese populations revealed moderate genetic diversity (Na = 3.17; He = 0.54; PIC = 0.59). Analysis of molecular variance (AMOVA) indicated significant genetic differentiation, with 23% of variation partitioned among populations. Bayesian clustering, Principal Coordinates Analysis (PCoA), and Neighbor-Joining (NJ) tree reconstruction consistently identified two distinct genetic clusters correlated with geography, suggesting restricted gene flow. These transcriptome-derived EST-SSR markers have proven effective for population genetic studies in L.discolor. The genomic resources and findings presented here provide a critical foundation for genetic monitoring and support the integration of molecular data into conservation and restoration programs for this ecologically and culturally significant orchid.
- New
- Research Article
- 10.3897/bdj.14.e163267
- Jan 7, 2026
- Biodiversity Data Journal
- Veikko Yrjölä + 7 more
BackgroundAgricultural expansion, a leading driver of biodiversity loss, has widespread effects on ecosystem services, particularly in tropical regions. In West Africa, the impact of intensified agriculture on local biodiversity – especially predator and decomposer species like spiders and ants – is understudied. This study aims to provide a checklist of terrestrial spiders and ants associated with savannahs and mango orchards in northern Ghana thus creating a baseline for further ecological studies on the community composition of these groups.New informationIn this data paper, we publish the baseline checklist and morphological measurements of spiders (Araneae) and ants (Hymenoptera, Formicidae) associated with forest savannahs and mango orchards located in northern Ghana. In total, we collected 64 species (28 unidentified morphospecies) of spiders and 64 species (24 unidentified morphospecies) of ants. Of these, almost all spider species and nine ant species were new records for Ghana, while many of the morphospecies could potentially be species new to science. In addition, we publish standardised morphological measurements of each species for potential functional diversity studies in the future.