Variome Data Standards(V1.0 beta)
- 3. Data analysis standards
- 4. Nomenclature standards
4.6 Nucleotides and Amino Acid nomenclature [7]
The nucleotide and amino acid symbols using in this database follow the Nomenclature for Incompletely Specified Bases in Nucleic Acid Sequences (see IUBMB (NC-IUB)) as the DNA and amino acid nomenclature of the HGVS (Human Genome Variation Society) recommends [7].DNA [8-12]:
| Symbol | Meaning | Description |
|---|---|---|
| A | A | Adenine |
| C | C | Cytosine |
| G | G | Guanine |
| T | T | Thymine |
| B | C, G or T | not-A (B follows A in alphabet) |
| D | A, G or T | not-C (D follows C in alphabet) |
| H | A, C or T | not-G (H follows G in alphabet) |
| K | G or T | Keto |
| M | A or C | aMino |
| N | A, C, G or T | aNy |
| R | A or G | puRine |
| S | G or C | Strong interaction (3 H-bonds) |
| V | A, C or G | not-T / not-U ( V follows U ) |
| W | A or T | Weak interaction (2 H-bonds) |
| Y | C or T | pYrimidine |
| Amino acid | Single-letter code | Triplet (5'-3') | Possible Genotic Codons |
|---|---|---|---|
| Terminator | . | TRR (TAR and TGA) | TAA, TAG, TGA (translation termination) |
| Alanine | A | GCN | GCA, GCC, GCG, GCT |
| Aspartic acid or asparagine | B | RAY | AAC, AAT, GAC, GAT |
| Cysteine | C | TGY | TGC, TGT |
| Aspartic acid | D | GAY | GAC, GAT |
| Glutamic acid | E | GAR | GAA, GAG |
| Phenylalanine | F | TTY | TTC, TTT |
| Glycine | G | GGN | GGA, GGC, GGG, GGT |
| Histidine | H | CAY | CAC, CAT |
| Isoleucine | I | ATH | ATA, ATC, ATT |
| Lysine | K | AAR | AAA, AAG |
| Leucine | L | YTN (CTN and TTR) | CTA, CTC, CTG, CTT, TTA, TTG |
| Methionine | M | ATG | ATG (translation initiation) |
| Asparagine | N | AAY | AAC, AAT |
| Proline | P | CCN | CCA, CCC, CCG, CCT |
| Glutamine | Q | CAR | CAA, CAG |
| Arginine | R | MGN (CGN and AGR) | AGA, AGG, CGA, CGC, CGG, CGT |
| Serine | S | WSN (TCN and AGY) | AGC, AGT, TCA, TCC, TCG, TCT |
| Threonine | T | ACN | ACA, ACC, ACG, ACT |
| Selenocysteine | U | Sec | TGA |
| Valine | V | GTN | GTA, GTC, GTG, GTT |
| Tryptophan | W | TGG | TGG |
| Unknown | X | NNN | NNN |
| Tyrosine | Y | TAY | TAC, TAT |
| Glutamic acid or glutamine | Z | SAR |
