DNA sequences

This MedLibrary.org supplementary page on DNA sequences is provided directly from the open source Wikipedia as a service to our readers. Please see the note below on authorship of this content, as well as the Wikipedia usage guidelines. To search for other content from our encyclopedia supplement, please use the form below:

Electropherogram printout from automated sequencer for determining part of a DNA sequence
Electropherogram printout from automated sequencer for determining part of a DNA sequence

A DNA sequence or genetic sequence is a succession of letters representing the primary structure of a real or hypothetical DNA molecule or strand, with the capacity to carry information as described by the central dogma of molecular biology.

The possible letters are A, C, G, and T, representing the four nucleotide bases of a DNA strand — adenine, cytosine, guanine, thyminecovalently linked to a phosphodiester backbone. In the typical case, the sequences are printed abutting one another without gaps, as in the sequence AAAGTCTGAC, read left to right in the 5' to 3' direction. Short sequences of nucleotides are referred to as oligonucleotides and are used in a range of laboratory applications in molecular biology. With regard to biological function, a DNA sequence may be considered sense or antisense, and either coding or noncoding. DNA sequences can also contain "junk DNA."

Sequences can be derived from the biological raw material through a process called DNA sequencing.

In some special cases, letters besides A, T, C, and G are present in a sequence. These letters represent ambiguity. Of all the molecules sampled, there is more than one kind of nucleotide at that position. The rules of the International Union of Pure and Applied Chemistry (IUPAC) are as follows:

       A = adenine           
       C = cytosine            
       G = guanine             
       T = thymine           
       R = G A (purine)        
       Y = T C (pyrimidine)    
       K = G T (keto)    
       M = A C (amino)
       S = G C (strong bonds)
       W = A T (weak bonds)
       B = G T C (all but A)
       D = G A T (all but C)
       H = A C T (all but G)
       V = G C A (all but T)
       N = A G C T (any)     

See also

External links

Wikipedia content modification information:

  • This page was last modified on 10 September 2008, at 07:35.

Wikipedia Authorship and Review

Wikipedia content provided here is not reviewed directly by MedLibrary.org. Wikipedia content is authored by an open community of volunteers and is not produced by or in any way affiliated with MedLibrary.org.

Wikipedia Usage Guidelines

This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article on "DNA sequences".

The URL for this specific entry is:

All Wikipedia text is available under the terms of the GNU Free Documentation License. (See Copyrights for details). Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc.