Radiocarbon measurement is a well-established, routinely used, yet complex series of interlinked procedures. The degree of sample pre-treatment varies considerably depending on the material, the methods of processing pre-treated material vary across laboratories and the detection of 14 C at low levels remains challenging. As in any complex measurement process, the questions of quality assurance and quality control become paramount, both internally, i.e. within a laboratory and externally, across laboratories. The issue of comparability of measurements (and thus bias, accuracy and precision of measurement) from the diverse laboratories is one that has been the focus of considerable attention for some time, both within the 14 C community and the wider user communities. In the early years of the technique when there was only a small number of laboratories in existence, inter-comparisons would function on an ad hoc basis, usually involving small numbers of laboratories (e.g.Otlet et al, 1980). However, as more laboratories were setup and the detection methods were further developed (e.g. new AMS facilities), the need for more systematic work was recognised. The international efforts to create a global calibration curve also requires the use of data generated by different laboratories at different times, so that evidence of laboratory offsets is needed to inform curve formation. As a result of these factors, but also as part of general good laboratory practice, including laboratory benchmarking and quality assurance, the 14 C community has undertaken a wide-scale, farreaching and evolving programme of global inter-comparisons, to the benefit of laboratories and users alike. This paper looks at some of that history and considers what has been achieved in the past 30 years.