Our novel approach for fish age prediction uses quantitative analysis of Fourier transform near-infrared (FT-NIR) spectra of otoliths by means of multimodal convolutional neural networks (MMCNN). We integrate two key data modalities that are related to fish ages: the entire range of wavenumbers of FT-NIR spectra and corresponding biological and geospatial data for nearly 9000 walleye pollock ( Gadus chalcogrammus) specimens. The proposed model extracts informative spectral features automatically and elucidates hidden structural relationships associated with fish growth to improve age predictions. Absorbance associated with 7000 to 4000 cm−1 wavenumbers had the highest influence on model predictions followed by fish length, latitude, depth, and temperature. The optimal model resulted in good overall performance with an R2 of 0.93 and RMSE of 0.83 for training data set and R2 of 0.92 and RMSE of 0.91 for test data set. MMCNN's age predictions were comparable to microscope-based ages yielding as good or slightly better precision. Moreover, the model outperformed classical partial least squares analysis of otolith spectra and remedied prediction bias at older ages of fish.
Read full abstract