Computer codes containing both hydrodynamics and radiation play a central role in simulating both astrophysical and inertial confinement fusion (ICF) phenomena. A crucial aspect of these codes is that they require an implicit solution of the radiation diffusion equations. We present in this paper the results of a comparison of five different linear solvers on a range of complex radiation and radiation–hydrodynamics problems. The linear solvers used are diagonally scaled conjugate gradient, GMRES with incomplete LU preconditioning, conjugate gradient with incomplete Cholesky preconditioning, multigrid, and multigrid-preconditioned conjugate gradient. These problems involve shock propagation, opacities varying over 5–6 orders of magnitude, tabular equations of state, and dynamic ALE (Arbitrary Lagrangian Eulerian) meshes. We perform a problem size scalability study by comparing linear solver performance over a wide range of problem sizes from 1000 to 100,000 zones. The fundamental question we address in this paper is: Is it more efficient to invert the matrix in many inexpensive steps (like diagonally scaled conjugate gradient) or in fewer expensive steps (like multigrid)? In addition, what is the answer to this question as a function of problem size and is the answer problem dependent? We find that the diagonally scaled conjugate gradient method performs poorly with the growth of problem size, increasing in both iteration count and overall CPU time with the size of the problem and also increasing for larger time steps. For all problems considered, the multigrid algorithms scale almost perfectly (i.e., the iteration count is approximately independent of problem size and problem time step). For pure radiation flow problems (i.e., no hydrodynamics), we see speedups in CPU time of factors of ≈15–30 for the largest problems, when comparing the multigrid solvers relative to diagonal scaled conjugate gradient. For the incomplete factorization preconditioners, we see a weak dependence of iteration count on problem size. The speedups observed for pure radiation flow are typically on the order of 10 relative to diagonal scaled conjugate gradient. For radiation–hydrodynamics problems, we again see multigrid scaling perfectly. However, for the problems considered, we see speedups relative to diagonal scaled conjugate gradient of no more than ≈10, with incomplete Cholesky in fact either equaling or outperforming multigrid. We trace these observations to the time step control and the feature of ALE to relax distorted zones.