Machine Learning-based Automated Essay Scoring System for Chinese Proficiency Test (HSK)

Rui Xiao,Xiaoyan Ma,Jiaqi Jiang,Yunchun Zhang,Wenbin Guo

doi:10.1145/3443279.3443299

Abstract

Automated essay scoring (AES) gains momentum recently in English-based environment. However, the development of Chinese AES system is slow and fruitless. Many foreign students participate in the Chinese Proficiency Test (HSK) so a HSK automated essay scoring system (HSK AES) is in high demand. To develop an effective and reliable HSK AES system, this paper proposes three machine learning and deep learning models that take HSK essays as input. We apply Word2vec and TF-IDF (term frequency-inverse document frequency) methods to extract important features from the original essays. Three machine learning models, including XGBoost, one deep neural network with flatten and dense layer and another deep neural network with LSTM (long short-term memory) and dense layer, are trained. The experimental results show that XGBoost with TF-IDF outperforms the other two models with the lowest MAE (mean absolute error) as 6.7%. We also prove that deep neural networks either with LSTM (long short-term memory) or with flatten perform unsatisfactory on HSK AES.

Full Text