Locations
1) Chr.Header of each sequence, starting with the Feature field
2) Start and end.Start and end coordinates of each feature in the sequence
Coordinate should be a value of integer type
If the end is greater than the starting coordinate, the feature is considered to be located in the reverse strand of the sequence
3) CompletenessWhether the feature is complete. For example, if it is a coding gene, all CDS features of the gene should be divided by 3.
If incomplete, mark the incomplete direction, 5 ', 3', or both ends.
Attributes
1) FeatureName of feature within the coordinate range.
All available features are listed in the drop-down box.
For the feature name within the coordinate range, we suggest that each coding locus should have [gene] features as the basic unit, even if the feature can be omitted in some cases.
If the feature has more than one qualifier, the feature column should be empty when these qualifiers are listed under the feature, which indicates that these qualifiers belong to the same feature block.
2) QualifierAnnotations for multiple dimensions of the feature
All available qualifiers of this feature are listed in the drop-down box after selecting the corresponding feature.
We hope you can describe the characteristics of this locus in as much detail as possible, even though this position can sometimes be omitted.
When there is more than one qualifier, all qualifiers should be listed below the line declaring the feature, and the feature column on the left should be empty.
3) Qualifier valueAfter declaring the corresponding qualifier, the user should fill in the specific contents of the qualifier
The user can be prompted in qualifier hints for the filling specification and definition of this value.
Qualifier hints
1) Type and exampleFor each qualifier, qualifier hints give written suggestions including examples, formats and definitions according to the INSDC standard.
[Qualifier hints] are only used for type hints. No matter what its content is, it will not affect the format verification and conversion of xlsx
Description of each feature
Definition | Feature | Qualifiers | Type | Comment |
1) region at the 3' end of a mature transcript (following the stop codon) that is not translated into a protein; 2) region at the 3' end of an RNA virus (following the last stop codon) that is not translated into a protein; | 3'UTR | allele | text | NA |
citation | [number] | |||
db_xref | ||||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
1) region at the 5' end of a mature transcript (preceding the initiation codon) that is not translated into a protein; 2) region at the 5' end of an RNA virus genome (preceding the first initiation codon) that is not translated into a protein; | 5'UTR | allele | text | NA |
citation | [number] | |||
db_xref | ||||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
coding sequence; sequence of nucleotides that corresponds with the sequence of amino acids in a protein (location includes stop codon); feature includes amino acid conceptual translation. | CDS | artificial_location | [artificial_location_value] | /codon_start has valid value of 1 or 2 or 3, indicating the offset at which the first complete codon of a coding feature can be found, relative to the first base of that feature; /transl_table defines the genetic code table used if other than the universal genetic code table; genetic code exceptions outside the range of the specified tables is reported in /transl_except qualifier; /protein_id consists of a stable ID portion (from the end of 2018 new accessions may be extended to a 3+7 accession format with 3 position letters and 7 numbers; existing data before the end of 2018 uses a 3+5 format) plus a version number after the decimal point; when the protein sequence encoded by the CDS changes, only the version number of the /protein_id value is incremented; the stable part of the /protein_id remains unchanged and as a result will permanently be associated with a given protein; |
circular_RNA | boolean | |||
citation | [number] | |||
codon_start | <1 or 2 or 3> | |||
db_xref | <database>:<identifier> | |||
EC_number | text | |||
exception | [exception_value] | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
number | unquoted text (single token) | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
protein_id | ||||
pseudo | boolean | |||
pseudogene | TYPE | |||
ribosomal_slippage | boolean | |||
standard_name | text | |||
translation | text | |||
transl_except | (pos:<location>,aa:<amino_acid>) | |||
transl_table | <integer> | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
constant region of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; includes one or more exons depending on the particular chain | C_region | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
displacement loop; a region within mitochondrial DNA in which a short stretch of RNA is paired with one strand of DNA, displacing the original partner DNA strand in this region; also used to describe the displacement of a region of one strand of duplex DNA by a single stranded invader in the reaction catalyzed by RecA protein | D-loop | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) |
Definition | Feature | Qualifiers | Type | Comment |
Diversity segment of immunoglobulin heavy chain, and T-cell receptor beta chain; | D_segment | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
joining segment of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; | J_segment | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
extra nucleotides inserted between rearranged immunoglobulinsegments. | N_region | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
sequence tagged site; short, single-copy DNA sequence that characterizes a mapping landmark on the genome and can be detected by PCR; a region of the genome can be mapped by determining the order of a series of STSs; | STS | allele | text | STS location to include primer(s) in primer_bind key or primers. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
switch region of immunoglobulin heavy chains; involved in the rearrangement of heavy chain DNA leading to the expression of a different immunoglobulin class from the same B-cell; | S_region | allele | text | STS location to include primer(s) in primer_bind key or primers. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
variable region of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; codes for the variable amino terminal portion; can be composed of V_segments, D_segments, N_regions, and J_segments; | V_region | allele | text | STS location to include primer(s) in primer_bind key or primers. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
variable segment of immunoglobulin light and heavy chains, and T-cell receptor alpha, beta, and gamma chains; codes for most of the variable region (V_region) and the last few amino acids of the leader peptide; | V_segment | allele | text | STS location to include primer(s) in primer_bind key or primers. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
gap between two components of a genome or transcriptome assembly; | assembly_gap | (Mandatory) estimated_length | unknown or <integer> | the location span of the assembly_gap feature for an unknown gap has to be specified by the submitter; the specified gap length has to be reasonable (less or = 1000) and will be indicated as "n"'s in sequence. However, the value for the estimated_length of assembly_gap features within a single (non-CON) transcriptome record must be an integer and can not be "unknown"; |
(Mandatory) gap_type | is "within scaffold", "repeat within scaffold" or "contamination". If there are multiple types of linkage_evidence they will appear as multiple | |||
(Mandatory) linkage_evidence | TYPE (Note: Mandatory only if the qualifier is invalid.) |
Definition | Feature | Qualifiers | Type | Comment |
region of biological interest identified as a centromere and which has been experimentally characterized; | centromere | citation | [number] | the centromere feature describes the interval of DNA thatcorresponds to a region where chromatids are held and a kinetochore is formed |
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
note | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
region of genome that codes for portion of spliced mRNA, rRNA and tRNA; may contain 5'UTR, all CDSs and 3' UTR; | exon | allele | text | STS location to include primer(s) in primer_bind key or primers. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
EC_number | text | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
number | unquoted text (single token) | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
gap in the sequence | gap | experiment | [CATEGORY:]text | the location span of the gap feature for an unknown gap is 100 bp, with the 100 bp indicated as 100 "n"'s in the sequence. Where estimated length is indicated by an integer, this is indicated by the same number of "n"'s in the sequence. No upper or lower limit is set on the size of the gap. |
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
map | text | |||
note | text |
Definition | Feature | Qualifiers | Type | Comment |
region of biological interest identified as a gene and for which a name hasbeen assigned; | gene | allele | text | the gene feature describes the interval of DNA that corresponds to a genetic trait or phenotype; the feature is, by definition, not strictly bound to it's positions at the ends; it is meant to represent a region where the gene is located. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
phenotype | text | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
intervening DNA; DNA which is eliminated through any of several kinds of recombination; | iDNA | allele | text | e.g., in the somatic processing of immunoglobulin genes. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
number | unquoted text (single token) | |||
old_locus_tag | text (single token) | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
a segment of DNA that is transcribed, but removed from within the transcript by splicing together the sequences | intron | allele | text | e.g., in the somatic processing of immunoglobulin genes. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
number | unquoted text (single token) | |||
old_locus_tag | text (single token) | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
messenger RNA; includes 5'untranslated region (5'UTR), coding sequences(CDS, exon) and 3'untranslated region (3'UTR); | mRNA | allele | text | NA |
artificial_location | [artificial_location_value] | |||
circular_RNA | boolean | |||
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
mature peptide or protein coding sequence; coding sequence for the mature or final peptide or protein product following post-translational modification; the location does not include the stop codon (unlike the corresponding CDS); | mat_peptide | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
EC_number | text | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
any transcript or RNA product that cannot be defined by other RNA keys (prim_transcript, precursor_RNA, mRNA, 5'UTR, 3'UTR, exon, CDS, sig_peptide, transit_peptide, mat_peptide, intron, polyA_site, ncRNA, rRNA and tRNA); | misc_RNA | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
EC_number | text | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
site in nucleic acid which covalently or non-covalently binds another moiety that cannot be described by any other binding key (primer_bind or protein_bind); | misc_binding | allele | text | note that feature key regulatory with /regulatory_class="ribosome_binding_site" should be used for ribosome binding sites. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) |
Definition | Feature | Qualifiers | Type | Comment |
feature sequence is different from that presented in the entry and cannot be described by any other difference key (old_sequence, variation, or modified_base); | misc_difference | allele | text | the misc_difference feature key should be used to describe variability that arises as a result of genetic manipulation (e.g. site directed mutagenesis); use /replace="" to annotate deletion, e.g. misc_difference 412..433 /replace="" |
citation | [number] | |||
clone | text | |||
compare | [accession-number.sequence-version] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
phenotype | text | |||
replace | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
region of biological interest which cannot be described bykey; a new or rare feature; | misc_feature | allele | text | this key should not be used when the need is merely to mark a region in order to comment on it or to use it in another feature's location |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
number | unquoted text (single token) | |||
old_locus_tag | text (single token) | |||
phenotype | text | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
site of any generalized, site-specific or replicative recombination event where there is a breakage and reunion of duplex DNA that cannot be described by other recombination keys or qualifiers of source key (/proviral); | misc_recomb | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
recombination_class | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
any secondary or tertiary nucleotide structure or conformation that cannot be described by other Structure keys (stem_loop and D-loop); | misc_structure | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
region of genome containing mobile elements; | mobile_element | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
rpt_family | text | |||
rpt_type | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
the indicated nucleotide is a and should be substituted for by the indicated molecule (given in the mod_base qualifier value) | modified_base | allele | text | value is limited to the restricted vocabulary for modified base abbreviations; |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
frequency | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) |
Definition | Feature | Qualifiers | Type | Comment |
a non-protein-coding gene, other than ribosomal RNA and transfer molecule of which is the RNA transcript; | ncRNA | allele | text | the ncRNA feature is not used for ribosomal and transfer RNA annotation, for which the rRNA and tRNA feature keys should be used, respectively; |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | text | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | Example | |||
ncRNA_class | TYPE |
Definition | Feature | Qualifiers | Type | Comment |
the presented sequence revises a previous version of the sequence at this location; | old_sequence | allele | text | /replace="" is used to annotate deletion, e.g. old_sequence 12..15 /replace="" NOTE: This feature key is not valid in entries/records created from 15-Oct-2007. |
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
replace | text |
Definition | Feature | Qualifiers | Type | Comment |
region containing transcript including a cluster of genes that are under the control of the same regulatory sequences/promoter and in the same biological pathway | operon | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
map | text | |||
note | text | |||
phenotype | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
origin of transfer; region of a DNA molecule where transfer is initiated during the process of conjugation or mobilization | oriT | allele | text | rep_origin should be used for origins of replication; /direction has legal values RIGHT, LEFT and BOTH, however only RIGHT and LEFT are valid when used in conjunction with the oriT feature; origins of transfer can be present in the chromosome; plasmids can contain multiple origins of transfer |
bound_moiety | text | |||
citation | [number] | |||
db_xref | <database>:<identifier> | |||
direction | value | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
rpt_family | text | |||
rpt_type | <repeat_type> | |||
rpt_unit_range | <base_range> | |||
rpt_unit_seq | TYPE | |||
standard_name | text Molecule Scope DNA |
Definition | Feature | Qualifiers | Type | Comment |
site on an RNA transcript to which will be added post-transcriptional polyadenylation; | polyA_site | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) |
Definition | Feature | Qualifiers | Type | Comment |
any RNA species that is not yet the mature RNA product; may include ncRNA, rRNA, tRNA, 5' untranslated region (5'UTR), coding sequences (CDS, exon), intervening sequences (intron) and 3' untranslated region (3'UTR); | precursor_RNA | allele | text | used for RNA which may be the result of post-transcriptional processing; if the RNA in question is known not to have been processed, use the prim_transcript key. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
standard_name | text | |||
trans_splicing | boolean |
Definition | Feature | Qualifiers | Type | Comment |
primary (initial, unprocessed) transcript; mayinclude ncRNA, rRNA, tRNA, 5' untranslated region (5'UTR), coding sequences (CDS, exon), intervening sequences (intron) and 3' untranslated region (3'UTR); | prim_transcript | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
non-covalent primer binding site for initiation of transcription, or reverse transcription; includes site(s) for synthetic e.g., PCR primer elements; | primer_bind | allele | text | used to annotate the site on a given sequence to which a primer molecule binds - not intended to represent the sequence of the primer molecule itself; PCR components and reaction times may be stored under the "/PCR_conditions" qualifier; since PCR reactions most often involve pairs of primers, a single primer_bind key may use the order() operator with two locations, or a pair of primer_bind keys may be used. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
standard_name | text | |||
PCR_conditions | text |
Definition | Feature | Qualifiers | Type | Comment |
propeptide coding sequence; coding sequence for the domain of a proprotein that is cleaved to form the mature protein product. | propeptide | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
non-covalent protein binding site on nucleic acid; | protein_bind | allele | text | note that feature key regulatory with /regulatory_class="ribosome_binding_site" should be used for ribosome binding sites. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
mature ribosomal RNA; RNA component of the ribonucleoprotein particle (ribosome) which assembles amino acids into proteins. | rRNA | allele | text | rRNA sizes should be annotated with the /product qualifier. |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
any region of sequence that functions in the regulation of transcription, translation, replication or chromatin structure; | regulatory | allele | text | This feature has replaced the following Feature Keys on 15- DEC-2014: enhancer, promoter, CAAT_signal, TATA_signal, - 35_signal, -10_signal, RBS, GC_signal, polyA_signal, attenuator, terminator, misc_signal. |
bound_moiety | text | |||
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
phenotype | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
region of genome containing repeating units; | repeat_region | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
rpt_family | text | |||
rpt_type | <repeat_type> | |||
rpt_unit_range | <base_range> | |||
rpt_unit_seq | TYPE | |||
satellite | <satellite_type>[:<class>][ <identifier>] | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
signal peptide coding sequence; coding sequence for an N-terminal domain of a secreted protein; this domain is involved in attaching nascent polypeptide to the membrane leader sequence; | sig_peptide | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
identifies the biological source of the specified span of the sequence; this key is mandatory; more than one source key per sequence is allowed; every entry/record will have, as a minimum, either a single source key spanning the entire sequence or multiple source keys, which together, span the entire sequence. | source | altitude | text | transgenic sequences must have at least two source feature keys; in a transgenic sequence the source feature key describing the organism that is the recipient of the DNA must span the entire sequence; see Appendix III /organelle for a list of <organelle_value> |
bio_material | [<institution-code>:[<collection-code>:]]<material_id> | |||
cell_line | text | |||
cell_type | text | |||
chromosome | text | |||
citation | [number] | |||
clone | text | |||
clone_lib | text | |||
collected_by | text | |||
collection_date | text | |||
country | <country_value>[:<region>][, <locality>] | |||
cultivar | text | |||
culture_collection | <institution-code>:[<collection-code>:]<culture_id> | |||
db_xref | <database>:<identifier> | |||
dev_stage | text | |||
ecotype | text | |||
environmental_sample | boolean | |||
focus | boolean | |||
germline | boolean | |||
haplogroup | text | |||
haplotype | text | |||
host | text | |||
identified_by | text | |||
isolate | text | |||
isolation_source | text | |||
lab_host | text | |||
lat_lon | text | |||
macronuclear | boolean | |||
map | text | |||
mating_type | text | |||
metagenome_source | text | |||
note | text | |||
organelle | <organelle_value> | |||
PCR_primers | [fwd_name: XXX, ]fwd_seq: xxxxx, [rev_name: YYY, ]rev_seq: yyyyy | |||
plasmid | text | |||
pop_variant | text | |||
proviral | boolean | |||
rearranged | boolean | |||
segment | text | |||
serotype | text | |||
serovar | text | |||
sex | text | |||
specimen_voucher | [<institution-code>:[<collection-code>:]]<specimen_id> | |||
strain | text | |||
sub_clone | text | |||
submitter_seqid | text | |||
sub_species | text | |||
sub_strain | text | |||
tissue_lib | text | |||
tissue_type | text | |||
transgenic | boolean | |||
type_material | <type-of-type> of <organism name> | |||
variety | text |
Definition | Feature | Qualifiers | Type | Comment |
hairpin; a double-helical region formed by base-pairing between adjacent (inverted) complementary sequences in a single strand of RNA or DNA. | stem_loop | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
mature transfer RNA, a small RNA molecule (75-85 bases long) that mediates the translation of a nucleic acid sequence into an amino acid sequence; | tRNA | allele | text | NA |
anticodon | (pos:<location>,aa:<amino_acid>,seq:<text>) | |||
circular_RNA | boolean | |||
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
operon | text | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text | |||
trans_splicing | text |
Definition | Feature | Qualifiers | Type | Comment |
region of biological interest identified as a telomere and which has been experimentally characterized; | telomere | citation | [number] | the telomere feature describes the interval of DNA that corresponds to a specific structure at the end of the linear eukaryotic chromosome which is required for the integrity and maintenance of the end; this region is unique compared to the rest of the chromosome and represent the physical end of the chromosome; |
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
note | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
rpt_type | <repeat_type> | |||
rpt_unit_range | <base_range> | |||
rpt_unit_seq | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
transfer messenger RNA; tmRNA acts as a tRNA first, and then as an mRNA that encodes a peptide tag; the ribosome translates this mRNA region of tmRNA and attaches the encoded peptide tag to the C-terminus of the unfinished protein; this attached tag targets the protein for destruction or proteolysis; | tmRNA | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text | |||
tag_peptide | <base_range> |
Definition | Feature | Qualifiers | Type | Comment |
transit peptide coding sequence; coding sequence for an N-terminal domain of a nuclear-encoded organellar protein; this domain is involved in post-translational import of the protein into the organelle; | transit_peptide | allele | text | NA |
citation | [number] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
function | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
product | text | |||
pseudo | boolean | |||
pseudogene | TYPE | |||
standard_name | text |
Definition | Feature | Qualifiers | Type | Comment |
a small region of sequenced bases, generally 10 or fewer in its length, which could not be confidently identified. Such a region might contain called bases (A, T, G, or C), or a mixture of called-bases and uncalled-bases ('N'). The unsure feature should not be used when annotating gaps in genome assemblies. Please refer to assembly_gap feature for gaps within the sequence of an assembled genome. For annotation of gaps in other sequences than assembled genomes use the gap feature. | unsure | allele | text | use /replace="" to annotate deletion, e.g. unsure 11..15 /replace="" |
citation | [number] | |||
compare | [accession-number.sequence-version] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
replace | text |
Definition | Feature | Qualifiers | Type | Comment |
a related strain contains stable mutations from the same gene (e.g., RFLPs, polymorphisms, etc.) which differ from the presented sequence at this location (and possibly others); | variation | allele | text | used to describe alleles, RFLP's,and other naturally occurring mutations and polymorphisms; variability arising as a result of genetic manipulation (e.g. site directed mutagenesis) should be described with the misc_difference feature; use /replace="" to annotate deletion, e.g. variation 4..5 /replace="" |
citation | [number] | |||
compare | [accession-number.sequence-version] | |||
db_xref | <database>:<identifier> | |||
experiment | [CATEGORY:]text | |||
frequency | text | |||
gene | text | |||
gene_synonym | text | |||
inference | [CATEGORY:]TYPE[ (same species)][:EVIDENCE_BASIS] | |||
locus_tag | text (single token) | |||
map | text | |||
note | text | |||
old_locus_tag | text (single token) | |||
phenotype | text | |||
product | text | |||
replace | boolean | |||
standard_name | text |
Download