Calcareous nannofossils serve as crucial indicators for establishing the biostratigraphic age of chalk macrofossil specimens in natural science collections. Better age control of specimens collected several hundred years ago enables us to uncover the dark data hidden within these collections and incorporate these data into current research projects, examining ecosystem response to past climate change. However, the manual identification of these microscopic organisms is laborious and subjective, and so we are harnessing deep learning techniques for automatic nannofossil detection and identification. This approach required the construction of a robust dataset, currently comprising over 100,000 labelled images, complemented by the development of multiple specialised deep learning models. While some models focus on detecting target species, others are dedicated to species classification. Evaluation on an independent test set showcases the efficacy of our methodology, with the current detection model achieving a balanced accuracy of 93%. Similarly, the classification model demonstrates robust performance, attaining an average balanced accuracy of 96%. Furthermore, as well as assisting with our biostratigraphic studies, the dataset of accurately labelled images has enabled us to test other aspects of ecosystem response. For example, examining morphometric changes in nannofossils over geological time can provide valuable insights into the potential impact of current global warming on modern phytoplankton assemblages (Mancini et al. 2021). This is particularly important for coccolithophores (Young et al. 2005), which play a critical role as primary producers in the global carbon cycle. A decrease in their size could lead to bottom-up ecosystem impacts and reduced carbon sequestration (Poulton et al. 2007, Krumhardt et al. 2017). Using our dataset, we conducted a deep-learning-enhanced automatic morphometric analysis focusing on the nannofossil species, Tranolithus orionatus. Our analysis revealed that two key morphometric parameters, minor axis and area size, showed statistically significant differences between the Cenomanian stage (approximately 100.5 to 93.9 million years ago) and the post-Cenomanian stages of the Late Cretaceous (approximately 93.9 to 66.0 million years ago). Kolmogorov-Smirnov tests (Massey 1951) between the two samples yielded p-values of 0.039 for the minor axis and 0.031 for the area size. Understanding these morphometric changes is crucial due to the close parallels between current climate projections and the warming and greenhouse climate of the Late Cretaceous, particularly the Cenomanian-Turonian boundary event (Arthur et al. 1990). Insights into how organisms changed morphologically during past periods of environmental stress can help us more effectively predict future responses of organisms under similar conditions (Razmjooei et al. 2020). These findings underscore the effectiveness of our approach in automating the identification and recognition of chalk nannofossils, helping to unlock natural science collections and to address key questions related to marine response to past climate change.
Read full abstract