Abstract Our study shows that the intercomparison among sea surface temperature (SST) products is influenced by the choice of SST reference, and the interpolation of SST products. The influence of reference SST depends on whether the reference SSTs are averaged to a grid or in pointwise in situ locations, including buoy or Argo observations, and filtered by first-guess or climatology quality control (QC) algorithms. The influence of the interpolation depends on whether SST products are in their original grids or preprocessed into common coarse grids. The impacts of these factors are demonstrated in our assessments of eight widely used SST products (DOISST, MUR25, MGDSST, GAMSSA, OSTIA, GPB, CCI, CMC) relative to buoy observations: (i) when the reference SSTs are averaged onto 0.25° × 0.25° grid boxes, the magnitude of biases is lower in DOISST and MGDSST (<0.03°C), and magnitude of root-mean-square differences (RMSDs) is lower in DOISST (0.38°C) and OSTIA (0.43°C); (ii) when the same reference SSTs are evaluated at pointwise in situ locations, the standard deviations (SDs) are smaller in DOISST (0.38°C) and OSTIA (0.39°C) on 0.25° × 0.25° grids; but the SDs become smaller in OSTIA (0.34°C) and CMC (0.37°C) on products’ original grids, showing the advantage of those high-resolution analyses for resolving finer-scale SSTs; (iii) when a loose QC algorithm is applied to the reference buoy observations, SDs increase; and vice versa; however, the relative performance of products remains the same; and (iv) when the drifting-buoy or Argo observations are used as the reference, the magnitude of RMSDs and SDs become smaller, potentially due to changes in observing intervals. These results suggest that high-resolution SST analyses may take advantage in intercomparisons. Significance Statement Intercomparisons of gridded SST products be affected by how the products are compared with in situ observations: whether the products are in coarse (0.25°) or original (0.05°–0.10°) grids, whether the in situ SSTs are in their reported locations or gridded and how they are quality controlled, and whether the biases of satellite SSTs are corrected by localized matchups or large-scale patterns. By taking all these factors into account, our analyses indicate that the NOAA DOISST is among the best SST products for the long period (1981–present) and relatively coarse (0.25°) resolution that it was designed for.