Background/Objectives: Cell lines do not faithfully replicate the authentic transcriptomic condition of the disease under study. The HepG2 cell line is widely used for studying hepatocellular carcinoma (HCC), but not all biological processes and genes exhibit congruent expression patterns between cell lines and the actual disease. The objective of this study is to perform a comparative transcriptomic analysis of the HepG2 cell line, HCC, and primary hepatocytes (PH) in order to identify genes suitable for research in HepG2 as a model for PH or HCC research. Methods: We conducted a differential expression analysis between publicly available data from HCC patients, PH, and HepG2. We examined specific overlaps of differentially expressed genes (DEGs) in a pairwise manner between groups in order to obtain a valuable gene list for studying HCC or PH using different parameter filtering. We looked into the function and druggability of these genes. Conclusions: In total, we identified 397 genes for HepG2 as a valuable HCC model and 421 genes for HepG2 as a valuable PH model, and with more stringent criteria, we derived a smaller list of 40 and 21 genes, respectively. The majority of genes identified as a valuable set for the HCC model are involved in DNA repair and protein degradation mechanisms. This research aims to provide detailed guidance on gene selection for studying diseases like hepatocellular carcinoma, primary hepatocytes, or others using cell lines.
Read full abstract