Historical, i.e. pre-1957, upper-air data are a valuable source of information on the state of the atmosphere, in some parts of the World back to the early 20th century. However, to date reanalyses have only partially made use of these data, and only of observations made after 1948. Even for the period between 1948 (the start-5 ing year of the NCEP/NCAR reanalysis) and the International Geophysical Year in 1957 (the starting year of the ERA-40 reanalysis), when the global upper-air coverage reached more or less its current status, many observations have not been digitised until now. The Comprehensive Historical Upper-Air Network (CHUAN) already compiled a large collection of pre-1957 upper-air data. In the framework of the European project 10 ERA-CLIM, significant amounts of additional upper-air data have been catalogued (> 1.3 mio station days), imaged (> 200 000 images) and digitised (> 700 000 station days) in order to prepare a new input dataset for upcoming reanalyses. The records cover large parts of the globe, focussing on so far less well covered regions such as the Tropics, the polar regions and the Oceans, and on very early upper-air data from Eu-15 rope and the US. The total number of digitised/inventoried records is 61/101 for moving upper-air data, i.e. data from ships etc., and 735/1783 for fixed upper-air stations. Here, we give a detailed description of the resulting dataset including the metadata and the quality checking procedures applied.