With representation of the global carbon cycle becoming increasingly complex in climate models, it is important to develop ways to quantitatively evaluate model performance against in situ and remote sensing observations. Here we present a systematic framework, the Carbon‐LAnd Model Intercomparison Project (C‐LAMP), for assessing terrestrial biogeochemistry models coupled to climate models using observations that span a wide range of temporal and spatial scales. As an example of the value of such comparisons, we used this framework to evaluate two biogeochemistry models that are integrated within the Community Climate System Model (CCSM) – Carnegie‐Ames‐Stanford Approach′ (CASA′) and carbon–nitrogen (CN). Both models underestimated the magnitude of net carbon uptake during the growing season in temperate and boreal forest ecosystems, based on comparison with atmospheric CO2 measurements and eddy covariance measurements of net ecosystem exchange. Comparison with MODerate Resolution Imaging Spectroradiometer (MODIS) measurements show that this low bias in model fluxes was caused, at least in part, by 1–3 month delays in the timing of maximum leaf area. In the tropics, the models overestimated carbon storage in woody biomass based on comparison with datasets from the Amazon. Reducing this model bias will probably weaken the sensitivity of terrestrial carbon fluxes to both atmospheric CO2 and climate. Global carbon sinks during the 1990s differed by a factor of two (2.4 Pg C yr−1 for CASA′ vs. 1.2 Pg C yr−1 for CN), with fluxes from both models compatible with the atmospheric budget given uncertainties in other terms. The models captured some of the timing of interannual global terrestrial carbon exchange during 1988–2004 based on comparison with atmospheric inversion results from TRANSCOM (r=0.66 for CASA′ and r=0.73 for CN). Adding (CASA′) or improving (CN) the representation of deforestation fires may further increase agreement with the atmospheric record. Information from C‐LAMP has enhanced model performance within CCSM and serves as a benchmark for future development. We propose that an open source, community‐wide platform for model‐data intercomparison is needed to speed model development and to strengthen ties between modeling and measurement communities. Important next steps include the design and analysis of land use change simulations (in both uncoupled and coupled modes), and the entrainment of additional ecological and earth system observations. Model results from C‐LAMP are publicly available on the Earth System Grid.