We propose a novel latent factor pricing model to extract latent pricing factors and corresponding factor loadings from multi-source heterogeneous information through a deep learning architecture. Notably, we pioneer the extraction of policy pricing factors from China’s national strategies (“Five-Year Plans”, “Government Work Reports”, and “Monetary Policy Reports”) using natural language processing and a dynamic topic model. The proposed mixed-frequency deep factor asset pricing (MIDAS-DF) model learns from mixed-frequency heterogeneous data and captures nonlinear joint patterns between inputs and outputs, providing more nuanced insights into asset pricing. The empirical analyses of the Chinese A-share market from January 1, 2003 to July 31, 2022 show that the MIDAS-DF model outperforms competing models in pricing individual stocks, various test portfolios, and investment portfolios. The results also demonstrate that low-frequency policy information anchors long-term pricing trends, while high-frequency market and sentiment information refine short-term pricing accuracy. They work together to enhance the pricing performance.
Read full abstract