This scientific paper presents a novel approach to explore and predict the potential of imidazole-based organic dyes for use in Dye-Sensitized Solar Cells (DSSCs) using a machine learning web application. The design of efficient and cost-effective organic dyes is critical to enhance the performance of DSSCs. Traditional experimental methods are time-consuming and resource-intensive, making it challenging to screen a large number of potential dyes. In this study, we propose a machine learning-based approach to accelerate the discovery process by predicting the photovoltaic performance of imidazole-based organic dyes. Machin learning predictions provide valuable insights into the expected PCE% and behaviors of the molecules toward DSSCs. Based on the RDKit library, several fingerprints such as Molecular ACCess System, Avalon, Daylight, Pharmacophore and Morgan with different radius (r2, r3, r4), were studied. In addition, more than 20 ML algorithms using different cross validation (3, 5, 7, 10) were also evaluated. Among of these, Deep Neural Network models of MLPRegressor algorithm based on the daylight fingerprint shows a significant coefficient of determination combined with the lowest errors. Utilize the trained ML models to screen of 50 million SMILE structure for identify promising imidazole and nitrogen-containing derivative as a doner group. By replacing the donor groups in the well-known MK2 dye structure with the top imidazole derivatives proposed by machine learning, significant improvements in PCE were observed, increasing from 7.70% to as high as 11.49%, representing nearly a 50% enhancement over the control. DFT calculations confirm the ML predictions and clarify the significantly higher oscillator strength and charge transfer properties of MK2-DM1, compared to MK2. This result provides a promising pathway for developing new dye materials that can push the efficiency limits of DSSCs, leading to more efficient solar energy conversion technologies in the future. In addition, a developed web application offers a user-friendly interface for researchers to input their molecular structures and obtain PCE% predictions toward DSSCs. This information can guide researchers in designing a new imidazole dye with high photovoltaic performance to validate and refine the predictions without time consuming.
Read full abstract