Geometric stability via information theory, Discrete Analysis, 2016:10, 28pp. Let $A$ be a subset of $\mathbb R^3$. Then we can project $A$ onto the $xy$-plane, the $yz$-plane and the $xz$-plane. If we are given the areas of these projections, how large can the volume of $A$ be? A natural example to consider is when $A$ is a cuboid aligned with the coordinate axes. In that case the volume of $A$ is the square root of the product of the areas of the projections. That turns out to be the best we can do, by a famous result of Loomis and Whitney. The Loomis-Whitney inequality is the following more general statement. Let $g_1,\dots,g_d$ be non-negative functions defined on $\mathbb R^{d-1}$ and for $1\leq i\leq d$ let $\tilde{\pi}_i:\mathbb R^d\to\mathbb R^{d-1}$ be the projection that removes the $i$th coordinate from a vector. Then $$\int_{\mathbb R^d}\prod_{i=1}^dg_i(\tilde{\pi}_i(x))\,dx\leq\prod_{i=1}^d\|g_i\|_{d-1}.$$ To see how this relates to the statement about volumes, consider the case where each $g_i$ takes values 0 and 1. Then we can think of $g_i$ as the characteristic function of a set $A_i$, and $\prod (g_i \circ \tilde{\pi}_i)$ is then the characteristic function of the largest set $A$ such that $\tilde{\pi}_i(A)\subset A_i$ for each $i$. Also, $\|g_i\|_{d-1}=|A_i|^{1/(d-1)}$, so the right-hand side is equal to the product of the $(d-1)$th roots of the volumes of the $A_i$. Setting $d=3$ we obtain the result stated in the first paragraph. The Loomis-Whitney inequality has been further generalized. For example, the uniform-cover inequality of Chung, Frankl, Graham and Shearer (which is equivalent to the Box Theorem of Bollobas and Thomason) relates the volume of a set to the volumes of lower-dimensional projections (where these dimensions may differ), and the Brascamp-Lieb inequality can be used to relate the volume of a set to the volumes of projections that are not necessarily orthogonal to each other. Given any inequality, it is natural to try to characterize the equality cases, which for the Loomis-Whitney and uniform-cover inequalities are given by products of subsets of $\mathbb{R}$. It is also interesting, and often very useful, to understand the _near_-equality cases. The aim here tends to be to prove _stability_ results, which state that if equality holds, then the example for which it holds is close (in some appropriate sense) to one for which it holds exactly. This paper proves stability versions of the Loomis-Whitney and uniform-cover inequalities, and obtains as a consequence a stability version of the edge-isoperimetric inequality in the grid $\mathbb Z^d$. This last inequality states that if $A$ is a subset of $\mathbb{Z}^d$ of cardinality $m$, then the number of edges incident with points in $A$ is minimized when $A$ is a cuboid (in the obvious sense). Returning to the result mentioned at the beginning, it follows from the main result of this paper that if $A\subset\mathbb R^3$ has volume as large as the product of the square roots of the areas of its three projections, then there exist three subsets $K_1,K_2,K_3 \subset \mathbb{R}$ such that the measure of the symmetric difference $A\bigtriangleup (K_1 \times K_2 \times K_3)$ is small. To prove a stability version of an inequality, it is sometimes enough to examine carefully the proof that gives the equality cases and replace each step by its almost version. For example, it is an easy exercise to prove a stability version of the Cauchy-Schwarz inequality, which will say that if $u$ and $v$ are vectors in an inner-product space and the real part of the inner product $\langle u,v\rangle$ is at least $(1-\epsilon)\|u\|_2\|v\|_2$, then the unit vectors $u/\|u\|_2$ and $v/\|v\|_2$ are close. (More precisely, their distance is at most $\sqrt{2\epsilon}$, and this cannot be improved.) A naive argument of this kind can be used for the Loomis-Whitney inequality as well (based on the proof of the Loomis-Whitney inequality which uses Holder's inequality and induction on the dimension), but it does not give an optimal dependence on the closeness parameter $\epsilon$. In general for stability results, having an optimal dependence is highly desirable, because it means that in a certain sense one has a complete structural characterization of the near-extremal cases. That is, the fact that near-equality holds implies structural information, and that structural information implies that near-equality holds, and if one applies both implications, one ends up back where one started without any appreciable loss of information. The authors base their arguments on one of the well-known approaches to proving the Loomis-Whitney inequality: entropy methods. The main difficulty they face is that entropy methods yield probability distributions, whereas they need to end up with a set (in fact, a Cartesian product of 1-dimensional sets). Considerable care is needed to prove that in moving back from distributions to sets, they do not throw too much away. This requires them to introduce a new measure of distance, which they call _hole weight_.