3D imaging of Earth’s surface layers (such as canopy, sub-surface, or ice) requires not just the penetration of radar signal into the medium, but also the ability to discriminate multiple scatterers within a slant-range and azimuth resolution cell. The latter requires having multiple radar channels distributed in across-track direction. Here, we describe the theory of multi-static radar tomography with emphasis on resolution, SNR, sidelobes, and nearest ambiguity location vs. platform distribution, observation geometry, and different multi-static modes. Signal-based 1D and 2D simulations are developed and results for various observation geometries, target distributions, acquisition modes, and radar parameters are shown and compared with the theory. Pros and cons of multi-static modes are compared and discussed. Results for various platform formations are shown, revealing that unequal spacing is useful to suppress ambiguities at the cost of increased multiplicative noise. In particular, we demonstrate that the multiple-input multiple-output (MIMO) mode, in combination with nonlinear spacing, outperforms the other modes in terms of ambiguity, sidelobe levels, and noise suppression. These findings are key to guiding the design of tomographic SAR formations for accurate surface topography and vegetation mapping.