FACTOID # 55: NationMaster.com is now 40 times the size of the CIA World Factbook!
 
 Home   Encyclopedia   Statistics   Countries A-Z   Flags   Maps   Education   Forum   FAQ   About 
 
WHAT'S NEW
RECENT ARTICLES
More Recent Articles »
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Consensus sequence

In molecular biology and bioinformatics, a consensus sequence is a way of representing the results of a multiple sequence alignment, where related seqeunces are compared to each other, and similar functional sequence motifs are found. The concensus sequence shows which residues are conserved (are always the same), and which residues are variable.


Developing software for pattern recognition is a major topic in genetics, molecular biology, and bioinformatics. Specific sequence motifs can function as regulatory sequences controling biosynthesis, or as signal sequences that direct a molecule to a specific site within the cell or regulate its maturation. Since the regulatory function of these sequences is important, they are thought to be conserved across long periods of evolution. In some cases, evolutionary relatedness can be estimated by the amount of conservation of these sites.


The conserved sequence motifs are called consensus sequences and they show which residues are conserved and which residues are variable. Consider the following example DNA sequence:


A[CT]N{A}


In this notation, A means that always an A is found in that position. [CT] stands for either C or T, N stands for any base, and {A} means any base except A.


In this example, the notation [CT] does not give any indication of the relative frequency of C or T occurring at that position. An alternative method of representing a consensus sequence uses a sequence logo. This is a graphical representation of the consensus sequence, in which the size of a symbol is related to the frequency that a given nucleotide (or amino acid) occurs at a certain position. In sequence logo's the more conserved the residue, the larger the symbol for that residue is drawn, the less frequent, the smaller the symbol. Sequence logo's can be generated using the Gestalt Workbench (http://www.t1dbase.org/gestalt/), a publically available visualization tool written by Gustavo Glusman at the Institute for Systems Biology (http://www.systemsbiology.org).


See also

  • weight matrix

  Results from FactBites:
 
Consensus sequence (290 words)
Consensus sequence is a genetic sequence found with minor variations and similar functions in widely divergent organisms or genetic locations.
In molecular biology and bioinformatics, one major goal is to parse genetic sequences and employ methods of statistical classification for the identification of consensus sequences.
A consensus sequence can vary in length and can be have multiple instances in a single genome (the full set of chromosomes of an individual, or the total number of genes in such a set).
Consensus sequence - Wikipedia, the free encyclopedia (464 words)
In molecular biology and bioinformatics, a consensus sequence is a way of representing the results of a multiple sequence alignment, where related sequences are compared to each other, and similar functional sequence motifs are found.
This is a graphical representation of the consensus sequence, in which the size of a symbol is related to the frequency that a given nucleotide (or amino acid) occurs at a certain position.
A consensus sequence may be a short sequence of nucleotides which is found several times in the genome and is thought to play the same role in its different locations.
  More results at FactBites »


 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments
Please enter the 5-letter protection code

Want to know more?
Search encyclopedia, statistics and forums:

 


Lesson Plans | Student Area | Student FAQ | Reviews | Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms.