Solar cell research aims to improve power conversion efficiency (PCE). This field has an extensive body of literature on the Web of Science. For researchers, it is impossible to understand the development of the entire field comprehensively through traditional reading methods. Knowledge is recorded in the literature by text and numbers. Researchers acquire knowledge through literature surveying, text reading, and thinking. The conversion from text and numbers to knowledge can be automatically completed by machines, which can avoid path‐dependent perspectives. In this work, an intelligent machine learning method for literature structure delineation and information extraction is proposed. As an example, a knowledge base of organic solar cells (OSCs) is extracted including topic analysis of literature, numerical characteristics of performance, and material information. Seven major research directions of OSCs are identified. The correlations between key performance parameters, including PCE, short‐circuit current density (JSC), open‐circuit voltage (VOC), and fill factor (FF), are revealed from text mining. A donor–acceptor material map of PCE is constructed which provides a road map for OSCs, indicating the bottleneck of this field. Moreover, the method of machine intelligence developed here can be used in any other materials field, aiding a comprehensive understanding of the development quickly.
Read full abstract