PurposeThe purpose of constructing the technology/function matrix is to analyze the patents in the target domain. The extraction of technology words is an important part of the construction of technology/function matrix. This algorithm is used to solve the problem of low efficiency of traditional Chinese process patents technology words extraction.Design/methodology/approachThe authors propose a Chinese process patents technology words extraction method based on the improved term frequency–inverse document frequency (TF-IDF) algorithm to help technicians obtain the technology words in the target domain. According to the characteristics of Chinese process patents technology words, the TF value of candidate technology words is divided into four parts, and the corpus of IDF value calculation of candidate technology words is selected.FindingsThrough the test of Chinese process patents in the domain of path planning, this study shows that the method is feasible and practical. It can help users quickly and accurately obtain the technology words of Chinese process patents in the target domain.Practical implicationsWith the increasing number of patents on the network-based patent information platform, patent analysis of massive Chinese process patents has become a research focus. The method proposed in this paper can facilitate users to extract technology words from massive Chinese process patents for patent analysis.Originality/valueThis paper aims to improve the efficiency of Chinese process patents technology words extraction. The authors hope that the proposed method can reduce the labor and time cost of Chinese process patents technology words extraction.
Read full abstract