A method for DNA histogram analysis is described that depends only on the simple assumption that the data are normally distributed and a requirement that a G1 peak is present. A probability density function was derived from the assumption that extracted the S-phase component from the whole histogram. The model was tested with simulated data, and good agreement between predicted and known proportions in G1, S, and G1+M was found. Good agreement was also found between duplicates of experimentally derived data. Some systematic errors are present in the analysis of certain types of histograms. However, these result in small errors when compared with biological and experimental variation and are less than the average of algorithms in current use.The program required only two queued requests, those of the start and the end channels over which the analysis is to be performed. The algorithms perform rapidly on a microcomputer with only 28K addressable memory. Only two failures occurred in over 350 analyses and the method can be used for drug-and radiation-perturbed populations as well as with unperturbed.
Key terms: Flow cytometry, DNA histogram analysisA number of models have been proposed for the analysis of flow cytometric DNA histogram data (1, 2, 4-9, 11-18, 25, 28-29) and a comparative review of the various methods has been published (3). .As expected, some models performed better than others, not only with simulated but also with experimentally derived data; however, none was ideal. The models that performed best tended to be those requiring a large mainframe computer, as they generally contained a large number of variables for which a solution had to be found. These include the position of the mean of the G1 and G2+M peaks, the standard deviations of these peaks, the age distribution of the population (22), the rate of DNA synthesis, and the relative durations of the G1, S, and G2 +M phase times with their standard deviations. Thus far, 12 variables have been defined, all of which may have to be considered simultaneously, which is not a trivial task, even with a large computer. Herein lies one of the central problems. The experimental DNA histogram contains, at best, two peaks and a trough corresponding to G1, G2 +M, and S-phase, respectively. This is a very simple data set with which to compute 12 variables, and this must be totally inadequate compared with the complexity of the biology.The work presented in this paper was undertaken to simplify the analysis of DNA histograms and to produce a robust method that gives results rapidly (within 30 s) on a microcomputer with only 28K addressable memory.
THEORY AssumptionsIn our attempt to produce a "minimum assumption and computing" model (MAC) we have only assumed that the data are normally distributed.
The Computer ProgramThe method of analysis will be illustrated by a description of the program flow diagram and the components of its various steps.Step 1. Eliminate high-frequency noise, if this exists, by "spreading" the data with a constant standard deviation o...