Abstract

Recently, Wordle has become popular worldwide as a daily puzzle game launched by the New York Times. Players try to solve the puzzle by guessing a five-letter word in six tries or less. According to Wordle's statistical data, this paper first uses the K-means algorithm to cluster the difficulty of solution words to quantify the difficulty of English words and analyzes the accuracy and scientificity of the clustering results. Then, the paper uses the Random Forest model to classify the difficulty of words into three categories: ‘easy’, ‘normal’ and ‘hard’. The results show that the classification accuracy on the training set and the test set reaches 0.972 and 0.978 respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call