The first Chinese Gaofen (GF) remote sensing satellite was launched in August 2013 and has been in orbit for more than 10 years, providing a rich variety of image product data for remote sensing applications in various industries, with other remote sensing satellites of the GF series. To ensure the reliability of the information generated via remote sensing applications, all remote sensing images must undergo systematic quality evaluation. The quality of GF satellites is evaluated almost comprehensively during in-orbit testing at the early stage after launch; however, the remaining evaluations of product quality including geometric or radiometric calibrations are performed only annually. Thus, long-term systematic image quality evaluations are lacking, and the quality level and long-term change trends of the image products are unknown. These deficits have introduced uncertainty in the application of these products. This study aimed to establish a radiometric quality evaluation index framework for GF satellites to comprehensively measure the radiometric quality of GF image products. Multiple types of sample data, such as uniform field, knife-edge target, and radiometric calibration field data with different reflectance features, were acquired globally. Four core indexes, namely, the relative radiometric correction accuracy (RRCA), absolute radiometric correction accuracy (ARCA), modulation transfer function (MTF), and signal-to-noise ratio (SNR), were used to analyze and evaluate the long-term radiometric quality of the images from four GF optical remote sensing satellites: GF1, GF2, GF6, and GF7. The four GF satellites had high SNR and RRCA but low dynamic MTF and variable ARCA for different reflectance features. For example, the average SNRs for the four satellites reached 42.66, 43.91, 44.33, and 47.46 dB; the average RRCA (@root-mean-square errors) values reached 1.6%, 1.0%, 1.6%, and 0.1%; the average MTF values (at Nyquist frequency) of the panchromatic bands were only 0.15, 0.12, 0.07 and 0.04; and the difference in ARCA for different reflectance targets was approximately 10%. In the long time series, the SNRs for each sensor of the four GF satellites were high and stable for high-reflectance features, whereas the RCA and SNR values for low-reflectance features and the MTF of the sensors fluctuated in a cyclic manner. Meanwhile, large dynamic fluctuations and episodic image quality problems occurred at certain local time points, which must be considered during practical applications. The findings presented here will lay a foundation for the in-depth application of GF images.