The GenBanksequence database is an annotated collection of all publicly available nucleotide sequences and their protein translations. This database is produced at National Center for Biotechnology Information (NCBI) as part of an international collaboration with the European Molecular Biology Laboratory (EMBL) Data Library from the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). GenBank and its collaborators receive sequences produced in laboratories throughout the world from more than 100,000 distinct organisms. GenBank continues to grow at an exponential rate, doubling every 10 months. Release 134, produced in February 2003, contained over 29.3 billion nucleotide bases in more than 23.0 million sequences. GenBank is built by direct submissions from individual laboratories, as well as from bulk submissions from large-scale sequencing centers. In the field of bioinformatics, a sequence database is a large collection of DNA, protein, or other sequences stored on a computer. ... A nucleotide is a chemical compound that consists of a heterocyclic base, a sugar, and one or more phosphate groups. ... A representation of the 3D structure of myoglobin, showing coloured alpha helices. ... The National Center for Biotechnology Information (NCBI) is part of the US National Library of Medicine (NLM), which is a branch of the US National Institutes of Health. ... The European Molecular Biology Laboratory (EMBL) is a molecular biology research institution supported by 18 European countries. ... The European Bioinformatics Institute (EBI) part of EMBL is a centre for research and services in bioinformatics. ... 2003 : January - February - March - April - May - June - July - August - September - October - November - December A timeline of events in the news for February, 2003. ...
Direct submissions are made to GenBank using BankIt, which is a Web-based form, or the stand-alone submission program, Sequin. Upon receipt of a sequence submission, the GenBank staff assigns an Accession number to the sequence and performs quality assurance checks. The submissions are then released to the public database, where the entries are retrievable by Entrez or downloadable by FTP. Bulk submissions of Expressed Sequence Tag (EST), Sequence Tagged Site (STS), Genome Survey Sequence (GSS), and High-Throughput Genome Sequence (HTGS) data are most often submitted by large-scale sequencing centers. The GenBank direct submissions group also processes complete microbial genome sequences. An Accession number is a unique identifier given to a sequence when it is submitted to one of the DNA repositories (GenBank, EMBL, DDBJ). ... The Entrez Global Query Cross-Database Search System allows access to databases at the National Center for Biotechnology Information (NCBI) website. ... FTP may refer to: File Transfer Protocol Foiled Twisted Pair This is a disambiguation page â a list of articles associated with the same title. ... An expressed sequence tag or EST is a short sub-sequence of a protein-coding DNA sequence. ...
There are approximately 85,759,586,764 bases in 82,853,685 sequence records in the traditional GenBank divisions and 108,635,736,141 bases in 27,439,206 sequence records in the WGS division as of February 2008.
GenBank is part of the International Nucleotide Sequence Database Collaboration, which comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at NCBI.
Revisions or updates to GenBank entries can be made by the submitters at any time and can be accepted through the Update option on the BankIt page, in the text of an e-mail message, or as a Sequin file.