NCBI, producer and host of the GenBank database.
The GenBank sequence database is an open
access, annotated collection of all publicly available nucleotide sequences and their
protein translations. This database is produced at National Center for Biotechnology Information (NCBI) as part of the
International Nucleotide Sequence Database
Collaboration, or INSDC. GenBank and its
collaborators receive sequences produced in laboratories throughout the world from more than 100,000 distinct organisms. GenBank
continues to grow at an exponential rate, doubling every 10 months. Release 155, produced in August
2006, contained over 65 billion nucleotide bases in more than 61 million sequences. GenBank is built by direct submissions
from individual laboratories, as well as from bulk submissions from large-scale sequencing centers.
Direct submissions are made to GenBank using BankIt, which is a Web-based form, or the
stand-alone submission program, Sequin. Upon receipt of a sequence submission, the GenBank staff assigns an Accession number to the sequence and performs quality assurance checks. The
submissions are then released to the public database, where the entries are retrievable by Entrez
or downloadable by FTP. Bulk submissions of Expressed Sequence Tag (EST), Sequence Tagged Site (STS),
Genome Survey Sequence (GSS), and High-Throughput Genome Sequence
(HTGS) data are most often submitted by large-scale sequencing centers. The GenBank direct submissions group also processes
complete microbial genome sequences.
History
Walter Goad of the Theoretical Biology and Biophysics Group at Los Alamos National
Laboratory and others established the Los Alamos Sequence Database in 1979, which culminated in 1982 with the creation of
the public GenBank funded by the National Institutes of Health, the
National Science Foundation, the Department of Energy and the Department of Defense. LANL collaborated on GenBank with the firm
Bolt, Beranek, and Newman, and by the end of 1983 more than 2,000 sequences were stored in it.
In the mid 1980s, the Intelligenetics bioinformatics company at Stanford
University managed the GenBank project in collaboration with LANL. As one of the earliest bioinformatics community projects on the Internet, the GenBank project started BIOSCI/Bionet news groups for promoting open access communications among
bioscientists. During 1989 to 1992, the GenBank project transitioned to the newly created National Center for Biotechnology Information.
Growth of GenBank

See also
References
External links
Sources
This entry is from Wikipedia, the leading user-contributed encyclopedia. It may not have been reviewed by professional editors (see full disclaimer)