摘要: The accurate determination of the sequence nucleotide bases in a genomic region involves many steps, last which is DNA sequencing. Current sequencing methods rely on electrophoretic separation their reaction products, resolution considerably less than typical size clone being sequenced. process reconstruction from sequences its subclones described this chapter context large-scale human genome. It also discusses key informatics problems with reference to software that has been developed address them. Several Unix-based packages and components have for gel image processing, mostly by large genome centers aim streamlining operations improving quality data. first step identify remove all vector, because it product subcloning. approaches automated vector clipping developed, such as program VECTOR_CLIP. PREGAP, part Staden package, can take batch data variety machines, gather information required processing reading, good data, mark cloning Alu repeats. Cloning will not be present regions are already identified and, if at all, comprise one end or reading. Both VECTOR_CLIP CROSS_MATCH perform function well.