NONHSAT081014

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT081014

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3406 nt

Genomic location

chr20+:62667490..62671314

Exon number

2

Exons

62667490..62669555,62669975..62671314

Genome context

Sequence
000001 CTTGGAGGAG CCAACTTCAA AAGCCTCCGC CTGGCCTGCC CCTCTGCAGT CCTCCAGGCA GCCTGTCCAG GGAAGCGGTT 000080
000081 GGCTGGTGTT GGACCTGGTG CGGCAGTGCA CACCCTGCCC AAGGACCCCA GGATCATGTG CGTGGCCTGC TGGCGGGAGT 000160
000161 GAGGGAGGGC AAGACCCTGA AAGGCCCATG GTGAGCAGCT GTGGGCCTCC TCTAGGGAGG GGGCTTACTT GGGTGGAGGG 000240
000241 GCTGGGGGTA CACACAGCTC AGGAGACACT GCTTGTTCCC CTGAGCCCCA CCAGGAAATC TCCGGTTGGC TCTGGTCACT 000320
000321 GCGTCCCTGT GGGTGGCACT GGGTGCATGT GCCCTGGCTT CTGGCAGTTG GGGGAGGGGG GCTCCTGGGT GCCGGGCAAC 000400
000401 CCTGTTCTAA GCTGGGCTGA GGCTTGAGCC TGGAAGTGGT ACTAACAGGC CAGTTGGGAC TCCCAGGGCA GCCAGTCTGC 000480
000481 CCCACCTGGG TGGGACACAG CAGGTGCTGG ACATCCTTGC AGCTCCAGCT GGCCTGCTCA GGAGTCAGGC TCTGGTTCTC 000560
000561 GCTATGACCT CTGACCCAGA GCAGGCAGAG GGAATGACGT GGCAATTCTG AGCCCACCAA GTTGGCTGTG GCGTGTGTGG 000640
000641 TTGGGGGGGT CAGGTTTCAG GAAGGGGCCA CAGCAAGCCA TCTGCTCTTC CAGCCATTGC TGCTTTGTGC TCTGAGCAGG 000720
000721 AGCTGCCACC GCCCACCAGG CCCCCAGCTC TATGCAAGGG TTGGGGTGCT GCTTGTTCAG GCTCCCCTGA CAGGGCTAGG 000800
000801 AAGCCGAGTC GAGTCCTCTT ATCTTGGTTG GGTTCGCATG ATTAATTTCC AGCCCTAGAT CTGCTGTCTG ACTCCATCCT 000880
000881 GGGGGCCCAG GGGAAGGGTT GGGGCTGGCC CCTTTTGGGT GGCTGGGAAC CTGGGGGCCG TCAGCTGTCT CCTTGATGTT 000960
000961 TGGGCTTTAT CCCTGCGGGC GCAGGCCCTC CATTTTGGGT CGGGGGAGGC CTGTGGCACA GGCTGCCCGG AGGGATGGCG 001040
001041 TGGGGCGGCT GGTACAGGCA GAGGGGCCTT TTGCGACTGT TGGTTGGGGC TGGATGTGGT GGCTCCTTCC TCCTCTATAG 001120
001121 CTTCTGGGTG TCCCCCTAGC TTTCCCCTTC CAATAGGGTG TGGCTTGGAG CTCTGAGCCC CTGCCCCAGT TGGGTTGGGG 001200
001201 GTACAAGGAG TCCCCAGTTC CCACGTCTCT GTGAGCTAAC CCAGAGGGCG CCAGTATGGG AGGTGTGGAG TTGAATGGGC 001280
001281 ATGGCCATGA GTGAGACAGG CTGTACTGGG ACAACTTCCT ACTGTACCCC CAAGAGCCTG CAGACCAGCC CCACTAGAGC 001360
001361 TGTACGGGGC TGGTGGGGGG CCCTGCTCTG TGCGGTCCTG AGGGTGCTCC ACCCCTTGCT GGCCCCACTC CCACGTGGAC 001440
001441 CACAGAGCCA GGCTCTCCTG ACTCTGGGCC TGCCCCCTCT ATCTCCTGAT GGGTTCTTCA GGACTCAGTG ACATCGCCTC 001520
001521 CCAGCCGACA TCACTCACCC CTTTCACATC TTTGCAGGGA GCATAGAACT GTCTGGTTTG CAGACCTATC TCTTGTGTCT 001600
001601 CCTGATTTAT CCCGCCAGGC CTGTGCCTAC AGCCTTTCTT GATTGGTGAC ACGTCCATTT CATTCGCTCA GAGATCCCGT 001680
001681 GGGGTGGGGC AGGGGGTGGG GCTTCTACGG AAACAAAGTG CAGGTGGGCT GGCTGCTCCC TGTGCTGGGA ACACCCCCCA 001760
001761 GGCCTAACCC CAACCTGCTG TCTCCTCCAG GTACCCCCTA AGCCTGCACT CTGGGCGCTC CTGCTGGCGC TGCTGGGGAC 001840
001841 CGCGCCAAGC CGCGCCTATT CCCCGGCCTG CAGCGTCCCC GACGTGCTCC GCCACTATCG CGCCATCATC TTCGAGGATC 001920
001921 TGCAGGCCGC CGTGAAGTGG GGCGGGGCGG GGGCCGAAAA GACCAGGCCA GGCTCCAGAC ACTTTCATTT CATACAGAAA 002000
002001 AACCTGACTA GACCCGGGAG CTCGGGACGG CGGGGACGGC CTCGGGCCTC CTGTGGCGCC CAGAAGCGGA GCCGGCGGCC 002080
002081 CAAGATGCGC CCTGCCCGGC GTCGCGGCGG CCGAAGGCAG CTCCTGCTGC GCGCCCTGGA CGCCGTCGCC ACCTGCTGGG 002160
002161 AGAAGCTCTT CGCGCTGCGC GCCCCGGCCT CCAGGGACTC CTAGCGCGGC CCGTCCTGGC CCTGCGCGGG GAGGAGAACC 002240
002241 AGCGGGGCCG CGGCAGAGCC TGGAGACGCG CCTCGTTCTG TAGACTTGTT GGTGACCTCG GCCCCTCGCT CGACGCAGCC 002320
002321 CGCGCTCCCC GGAGGGCCCA GGACTTGGAG AAGGGAGCGC GCCTGGCCGC CGCTGGGTCA CGGAGGAGGC CCGCCCTCCA 002400
002401 CGCGCCGAAG GCCTCAATAA ACGGAGCTGG CGCTGCGGGT CCGGCACTCC CTTCGCCTGC CTCTCTGTGG GCCTAGGACC 002480
002481 GCCCCGGGAT CTGCGCCTCG GTGGCGGGGG GCGGTGGAGG GGGAATGCGG AGACCCGCAC TTCCTGTGGC ACCTGGACGG 002560
002561 GGCGGAGCCG CCGCTCCCAG CCTCGGGCAG CAGAGGGCGG ACGCGGGACG CTGGGCGCGC TCCGGCTCTG TCAGCGGCTC 002640
002641 CCGGCTGGGC GGAGGGCAGA GCGGGGGACT CCTGGGGCCC CTAAAACGTC ACGTATCCGT CCTGCTCCAC TTTGGACCGG 002720
002721 GGCCGGGGGT GGAAATTGAG GGGAGGGGAA GGCTGGACTT GGTGAATTCG GAGCAAAACA ACCGGATAAA GGAAACGGGC 002800
002801 CCGAAGGGAA GCGGGAGCTG CGGGCCTTGG CTGTGAGCTT GGCCAGGCTA TTTCCTGCTG CCTCCGGAGG GGTGACCACG 002880
002881 GCCTGGGCTC AGAGACTGCC CAGCCCCCTC TGCAGGACTG GCCAGGCTGG GGCCCGCTTT CTGGCCCCTG GAACCCTCCA 002960
002961 TTTGCTGGGG GACTGTCCCC TGGTTCCCCC ATCTGATGCG CAGGTCTGGG AGTCTGTCAC CCCCCTCCCC CAATCTCGGC 003040
003041 CCATGGAGGA GCAGGGGAGG TAAAGTACCT GACACCTGTT ACCCTCCCCT TCCCGGAAAG GCCGCGAGCA GGCTGTGCCT 003120
003121 CTGGCATTCT GTGGGATCAG GGGCAGCAGC AGAAGAAAGA AGTAACCTCC CAAGCTGACC CTGACCCAGC CCCACACTCT 003200
003201 GTGTTAATGT ATTTAGTCTG TGCAACGGGA GCTACTGTCT CTGTTTTATA GATGGGGAAA CTGAAGCACG AGGAGGCTGA 003280
003281 GACTTGCCCC AAATCACAGC CAGCAATGTA AGAGCTGGCC TCAGACCCAG GCCCAGGGCA TGGGATCTGC CACATGTCCC 003360
003361 CTGCCTGGAG GCTGGGGGCC AAGCTTCCTG GACTTCAGAG CCTTGG
[back to top]

Predicted Small Protein

Name NONHSAT081014_smProtein_1274:1510
Length 79
Molecular weight 8311.6862
Aromaticity 0.0641025641026
Instability index 56.7320512821
Isoelectric point 9.60748291016
Runs 9
Runs residual 0.0182926829268
Runs probability 0.0402755696873
Amino acid sequence MGMAMSETGCTGTTSYCTPKSLQTSPTRAVRGWWGALLCAVLRVLHPLLAPLPRGPQSQA
LLTLGLPPLSPDGFFRTQ
Secondary structure LLLEELLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHLLLLLLLLHHHH
HHHLLLLLLLLLLLEELL
PRMN -
PiMo -