An important issue in the automation of two-dimensional gel electrophoresis image analysis is the detection and quantification of protein spots. A spot segmentation algorithm must detect, define the extent of, and measure the integrated density of spots under a wide variety of actual gel image conditions. Besides these functions, the algorithm must be memory efficient to be able to process very large gel images and do this in a reasonable amount of computation time on low-cost computers, such as workstations and personal computers. We have developed a fast spot segmentation algorithm, extending the GELLAB-II segmenter, which extracts spots in a single raster scanning pass through the gel image. The performance analysis of the algorithm will be given in the paper as well as a discussion of the algorithm.
Fast access of two-dimensional (2-D) gel quantitative databases is important for rapid searching for protein differences between sets of 2-D gels from an experiment. The GELLAB-II system organizes corresponding spots from the gels in the database into reference or "Rspot" sets. These Rspot numeric names index fixed regions in the paged composite gel database file. This is adequate for an existing database, but has several problems. (i) Building the initial database requires guessing how much disk space to pre-allocate for each corresponding spot (i.e. spots from different gels). If it ever runs out of pre-allocated space during this process, it must expand the size of each corresponding set of spots copying the old database data into the new in-place on the disk. (ii) When adding new gels or editing the database, if a new spot is created, the system may also go into this expansion mode. The time spent and wasted disk space can be appreciable--depending on the size of the database (order of 100 gel database). (iii) Because each set of corresponding spots is the same size, we waste space in most spot sets since they do not require the additional space a few spot sets require which contain additional fragmented spots. We present a new low-level disk object-based structure and algorithm, paged indexed buckets (PIB), which optimizes disk space usage while having similar retrieval speed to the original method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.