The precise modeling and knowledge of non-gravitational forces acting on satellites is of big interest to many scientific tasks and missions. Since 2002, the twin GRACE satellites have measured these forces in a low Earth orbit with highly precise accelerometers, for about 15 years. Besides the significance for the GRACE mission, these measurement data allow the evaluation of modeling approaches and the improvement of force models. Unfortunately, before any scientific usage, the accelerometer measurements need to be calibrated, namely scale factor and bias have to be regularly estimated.In this study we demonstrate an accelerometer calibration approach, solely based on high precision non-gravitational force modeling without any use of empirically or stochastically estimated parameters, using our in-house developed satellite simulation tool XHPS. The aim of this work is twofold, first we use the accelerometer data and the residuals resulting from the calibration to quantitatively analyze and validate different non-gravitational force model approaches. In a second step, we compare the calibration results to three different calibration methods from different authors, based on gravity field recovery, GPS-based precise orbit determination, and based on modeled accelerations.We consider atmospheric drag forces and winds, as well as radiation forces due to solar radiation pressure, albedo, Earth infrared and thermal radiation (TRP) of the satellite itself. For TRP, we investigate different transient temperature calculation approaches for the satellite surfaces with absorbed power from the aforementioned radiation sources. A detailed finite element model of the satellite is utilized for every force, considering orientation, material properties and shadowing conditions for each element.For cross-track and radial direction, which are mainly affected by the radiative forces, our calibration residuals are quite small when drag is not super dominant (1–3 nm/s2 for total accelerations around ±50 nm/s2). For these directions the calibration seems to perform better than the other compared methods, where some bigger differences were found. For the drag dominated along-track direction it is vice versa, here our method is not sensitive enough because the difference between modeled and measured drag is bigger (e.g. residuals around 10 nm/s2 for total accelerations around ±70 nm/s2 for low solar activity). In along-track direction the orbit determination based methods are more sensitive and produce more reliable results. Results for the complete GRACE mission time span from 2003 to 2017 are shown, covering different seasonal environmental conditions.