Difference between revisions of "NONHSAT122918"

From LncRNAWiki
Jump to: navigation, search
Line 53: Line 53:
 
sequence = <dnaseq>CGAAACACAAAGTAAATTTCGAGCTAAGGGTTACAGTCCTTTAAATCTGGAAATTTCAAGTATCCCTCGGGATCGTGTCCCTGGGAGGCGGATAGGATTCAGCACTCTAGAATTGTCTCACCATTTGAATAGGTGCTGCTCAGCCCTTTTGAGTCTGTCATCGGAAAGACCATGGCTTTCTCGCTTTTCATCTTTTTGGATGGCTGCTTGTCGCAATATTGTCTTCTTCGCTTTAGACAAtgtcttactatgttgaccgtgctggactcgaattcctgggctcaagcaatccttctgcctcaaccttctgattatttcgactacgggtgtgcgccaccatgcctggcTTAGAAAAGGACTTTATGTTGAACCTCTTTTCTCTTACAATCAAATAATAAATTCATTGAGTCATACTA</dnaseq>|
 
sequence = <dnaseq>CGAAACACAAAGTAAATTTCGAGCTAAGGGTTACAGTCCTTTAAATCTGGAAATTTCAAGTATCCCTCGGGATCGTGTCCCTGGGAGGCGGATAGGATTCAGCACTCTAGAATTGTCTCACCATTTGAATAGGTGCTGCTCAGCCCTTTTGAGTCTGTCATCGGAAAGACCATGGCTTTCTCGCTTTTCATCTTTTTGGATGGCTGCTTGTCGCAATATTGTCTTCTTCGCTTTAGACAAtgtcttactatgttgaccgtgctggactcgaattcctgggctcaagcaatccttctgcctcaaccttctgattatttcgactacgggtgtgcgccaccatgcctggcTTAGAAAAGGACTTTATGTTGAACCTCTTTTCTCTTACAATCAAATAATAAATTCATTGAGTCATACTA</dnaseq>|
 
}}
 
}}
[[Category:Intergenic]]
+
[[Category:Intergenic]][[Category:NONHSAG048616]]

Revision as of 07:59, 13 October 2014

Please input one-sentence summary here.

Annotated Information

Name

ST7OT

Characteristics

Four noncoding RNAs from the RAY1/ST7 gene locus. Complex gene locus with a large number of coding and noncoding isoforms. ST7OT1 & 2 are antisense to ST7. ST7OT3 & 4 are on the sense strand. ST7OT1 - ~2kb unspliced antisense transcript that initiates in intron 1 of ST7, covers the first exon and terminates in the ST7 promoter region. Polyadenylated. Nb: it also overlaps exon 1 of ST7OT4. ST7OT2 - Spliced antisense transcripts. Multiple isoforms that share either some 5' or some 3' exons. Isoforms are between 900-1300 bp, with 4-6 exons and polyadenlyated. ST7OT3 - Spliced ~1kb transcript. Begins in intron 10 of ST7, shares exons 11-13 and sometimes 14 & 15 with ST7 plus has a number of unique exons. Some exons may also act as a alternative 3' end for ST7. ST7OT4 - Multiple spliced transcripts within the long first intron of ST7. However, Refseq ST7OT4 transcript may be protein coding. Transcription begins just ~200bp downstream from the end of first ST7 exon.

Function

Please input function information here.

Expression

ST7OT1 - abundantly expressed in testis, also expressed in cerebellum, skeletal muscle, uterus, cerebellum, spinal cord, fetal brain, and prostate. ST7OT2 - testis, fetal brain, cerebellum, mammary gland, fibroblasts. ST7OT3 - expression only detected in cerebellum. ST7OT4 - no expression detected.

Conservation

ST7OT1 - some conserved sequence with mouse, most of this is where ST7OT1 overlaps the first exon of ST7 however. ST7OT2 - 3' end of some isoforms shows some conservation with mouse. ST7OT3 - some conserved sequence, most of this is shared exons with ST7 though. ST7OT4 - sequence not conserved with mouse.

Misc

ST7OT1 - NCBI Gene ID: 93653. ST7OT2 - NCBI Gene ID: 93654. ST7OT3 - NCBI Gene ID: 93655. ST7OT4 - NCBI Gene ID: 338069

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Regulation

Please input regulation information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT122918

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

416 nt

Genomic location

chr7+:116594674..116608628

Exon number

3

Exons

116594674..116594733,116595028..116595207,116608453..116608628

Genome context

Sequence
000001 CGAAACACAA AGTAAATTTC GAGCTAAGGG TTACAGTCCT TTAAATCTGG AAATTTCAAG TATCCCTCGG GATCGTGTCC 000080
000081 CTGGGAGGCG GATAGGATTC AGCACTCTAG AATTGTCTCA CCATTTGAAT AGGTGCTGCT CAGCCCTTTT GAGTCTGTCA 000160
000161 TCGGAAAGAC CATGGCTTTC TCGCTTTTCA TCTTTTTGGA TGGCTGCTTG TCGCAATATT GTCTTCTTCG CTTTAGACAA 000240
000241 tgtcttacta tgttgaccgt gctggactcg aattcctggg ctcaagcaat ccttctgcct caaccttctg attatttcga 000320
000321 ctacgggtgt gcgccaccat gcctggcTTA GAAAAGGACT TTATGTTGAA CCTCTTTTCT CTTACAATCA AATAATAAAT 000400
000401 TCATTGAGTC ATACTA
[back to top]

Predicted Small Protein

Name NONHSAT122918_smProtein_239:310
Length 24
Molecular weight 2499.7052
Aromaticity 0.173913043478
Instability index 45.0826086957
Isoelectric point 4.36907958984
Runs 6
Runs residual 0.0331262939959
Runs probability 0.0293823235
Amino acid sequence MSYYVDRAGLEFLGSSNPSASTF
Secondary structure LEEEEELHHHEEELLLLLLLLLL
PRMN -
PiMo -