The theory of Chinese medicine (TCM) constitution contributes to the optimisation of individualised healthcare programmes. However, at present, TCM constitution identification mainly relies on inefficient questionnaires with subjective bias. Efficient and accurate TCM constitution identification can play an important role in individualised medicine and healthcare. Building an efficient model for identifying traditional Chinese medicine constitutions using objective tongue features and machine learning techniques. The DS01-A device was applied to collect tongue images and extract features. We trained and evaluated five machine learning models: Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), LightGBM (LGBM), and CatBoost (CB). Among these, we selected the model with the best performance as the base classifier for constructing our heterogeneous ensemble learning model. Using various performance metrics, including classification accuracy, precision, recall, F1 score, and area under curve (AUC), to comprehensively evaluate model performance. A total of 1149 tongue images were obtained and 45 features were extracted, forming dataset 1. RF, LGBM, and CB were selected as the base learners for the RLC-Stacking. On dataset 1, RLC-Stacking1 achieved an accuracy of 0.8122, outperforming individual classifiers. After feature selection, the classification accuracy of RLC-Stacking2 improved to 0.8287, an improvement of 0.00165 compared to RLC-Stacking1. RLC-Stacking2 achieved an accuracy exceeding 0.85 for identifying each TCM constitution type, indicating excellent identification performance. The study provides a reliable method for the accurate and rapid identification of TCM constitutions and can assist clinicians in tailoring individualized medical treatments based on personal constitution types and guide daily health care. The information extracted from tongue images serves as an effective marker for objective TCM constitution identification.
Read full abstract