NONHSAT082198

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT082198

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3980 nt

Genomic location

chr21+:40897510..40901782

Exon number

2

Exons

40897510..40897992,40898286..40901782

Genome context

Sequence
000001 ATGTACAGTG CCGAAGACCT CCTGATCACT CATGGGTACA AACTGTTGAG AGACCTCCCA GCACCACGCA GGGATAACCC 000080
000081 TGAGGGATGC CAGCCAGCAA AGACAAGGAT GCGAGTGGTC TGTGGCCTGC TGAATAGGCA TGAGGATGGC CCTGCGGCCT 000160
000161 TTGCACATCA TAAGACACCT GCTGGGAAAG GGCGTGGGAG TGACTCTGAA AGCTGCCGCG CCACACTGAG AGGCCACGGG 000240
000241 GATCCCCAGA GCACTTCTGC TTCTAGAACC TCTGAGGCAG GGCTTCATAA TCAACCAACC CCAGACTGTT AATGACCAAG 000320
000321 CCAACTGGAG AAGAGGAGCG CGGGAAGTCA GCAGCCTGCT GGGTCCGAGG GACCCAGAAG ACCTGCAGGA ATGACCCAAG 000400
000401 CCCACAGTCC GCCCGTCCAC ATGAGGCAGC GTCCATAGGA AGTCAGAGGA AGGGCAGAGG ATGTGACGAA TAAGACAACT 000480
000481 TGGAAGAAGG CAGCTTGGGA AGAAGAATTG AGAATGTTGG GTCCTGCCAA GTGGCTGAAC GTCTGCAACC AGCCAAGGAA 000560
000561 ATTCAGGTGG CGGGTGTCTG ATGGTGATGG GAAGAAATTA TTTCAAGATC TGTTACTGTT CATTCAAGGA GAACACGTGT 000640
000641 CGAATTCTCA AAACAAAGAG AAATCCCAAT TATTGCCTAG AACTCTCCCC TCTCCCACCC CCATAGCCCT CCGACCCACA 000720
000721 GCCTGAATTG CACAGAAATT TCCATTCCAT TAAATGACAG ACATTTACCT AAAATGTCAC AGTATTCTCC CAATTATGCA 000800
000801 ACAAATTTGA AATTCACAAG TGACCCTGAG AAGGGTGGCT CCTTGGCCTC TTTTCCTCAG CCTAAGTTTG GGAGACCCCT 000880
000881 CAAGCCCCCA TCTGACGGCT TGCACCACCA GTCCCATAGG GAGTAGAAAG TGGTGACTAT CAGGACAGCC AGCAGCGGAT 000960
000961 CCATAAGTCC CCAGGCATGA GCTCTGCATG TCTGACTTTG GATTGGAGCC TCCAGTAGGT ACCTCCACCA TCACACAGGT 001040
001041 CACCCCCCAG CACATCCCAA GCCCCTACTT GGAAGACACA GTGCTCACTC ATATGTGGCA TTCACGGTCA GCAGCAGCCT 001120
001121 CCAACTGAGA AGGCCGGGCT GGTGGTCGGC TTCCTTGTGG CCCCCTTGGA ACTGGGCCCT GCAAACACCC TCTTACCTCA 001200
001201 GGGCTCCCTC CACACCTCCA ATCTGCCACT GCCCATGACA GCTTCATTCA GTACATTCCC TTTGATGACC CAAGGATATG 001280
001281 ACATATCAAA CCAGCTCAAC CCCAGGGTTT CTTTGAGGAC ACAAGGTTTG GTGATGAATC TTATAACTCC AGTCCTGTCA 001360
001361 CTGACCAAGA GCCAGCTCAT GGAAAAATGC TGATGGTGCC ATTTGGAATC CACAGAGCCT GATAGCCTTG TCAGGGAATA 001440
001441 AGAGAGGCCC GGCCTTGGCT GATCCTAGCC CCTTGTGGCC ATGGGTCCAG CTTCCCAGAA ATAGAGAAAA TGGTGGCTTT 001520
001521 CCTGACCAAA GAGACAGCTG TGTCATGAGA GGATGGCAGC CTGACGTGAG GGGCAGCCCT CACAGACACG CAGAAAGCCA 001600
001601 AGTTTCTTCC CCAAGCCCAC AGGGCGAGGG CACTCATGAA ACTCAAACCA AACTCAAAAA GTTTGAAACT GGGATTCGGA 001680
001681 CCAAGAAAAG TTCAAAGAAG TAAATGAAAA CCAAACAAAA AACAAGACTG TATTTTGTTT GATTTTCATC CCTGTGAAAT 001760
001761 CAGAATCACA GCTGCCAGAT ACAGGTACGG ACAACAATGA CTTAAATCTG AGTGGGTCTG ATGAGAGCAC AGCTCTGCAA 001840
001841 CCACAGAGTC TGCTGAGCAT GTCCTCCAGC GACCTGGGGC TGCAGGCTGT CACAGGAAGC ATGGGTGGGA GAATGGAGCT 001920
001921 CCAGAAACAA GATCTGGGGG TGGGGATCAG AAGAAGATAA AGCAATCAAT GACCTTGGAT TCATTTACCT TACAAAGCAC 002000
002001 AGTGAACTCC AGCATTCTGG CTCTTGGCCA GGGCACCAGT ACAGAGATCA GCAAACACAA ACCAGTTTCC CCAAAGAATC 002080
002081 CCAAAGCTCG CAGCTCCTTC CTGTTGCAAA GCTGGGAGGG TCTAGTGTGT CCCTGACTCC AAAATGTTTG GACCCTGCTG 002160
002161 CCTCCGAAGC TCAGATGCAC ACGGCAGTCC CTTCTGGTGA CCACAAATAG AGGTGAAGTG CCTGCAACCT GAAAGGTCAC 002240
002241 AGGTCCCTCA GCCCAGCCGG CAACAGTGCT TTTCCAAGGA CTTCCTTGTC CATAAATCAG GCACTGATGA CAAAAGCAGG 002320
002321 CTGAAGTCAG CCCTGTGTGG GTGTCTGTGG GCATGCAGTC CATCCCGGCC CAAGTGGGAG GTGGTGAAGG AGGTACCCAC 002400
002401 ATGTCTGTGC AACAGCAAAC AATGATTTGG GCGTTTTCTC CTGACACCAG TCAGCCATTG TGCCTGGGCT TATTAAGTTA 002480
002481 ACGAGTAAGT TTTAACAAGG AGCTTCAAGA AGACAAAGAA GGAAGACAAA GAAAACAGCT GCTGCAGCAA TGAGGAAACG 002560
002561 GAGGCGGAAC AGCAGCAGGA AAACTGTGCT GACCCCAGAC TGGAGAATTC AGGCTTCTGT GCAAACAGCC CGGAAATGAG 002640
002641 TTGAGCAACA GCCAAGGATG TGGGTGCTGG AGGACCCTGG GTTTAGTTGG GAAAAGGTAG ACATAAGTCT GAGAGCTGAG 002720
002721 GTGAGGAGCT GGAACCTGGC CACCTGCGTG TCTGGCCTCT GTCCCTGGGT CCCTCTCAGG TGGAGACGAC AGCCATGCAT 002800
002801 CCTCCGGGCA GCAGATGAAA GCTTGAGAGC AGAAGAGAGA CACCAGAAGG TTGGCAATGG AGGTAATGAG CCCAGGTCCT 002880
002881 GCAAAGAGAG TGATGTCTTC AAGATCAAGT GACACAAAAC CAGTGCCCCT CTCCTATCCG GCTGAGCCAA GGCGTCCCAG 002960
002961 GAAAGTCAGA AACTTACCAG CACTTTCAGT TCTGTGAAAC TGAGTGAAGT GGCCCCTTGG AAGGTTGATA GTGGTGGAGA 003040
003041 GAGGGACACA GTGCTCCCGC TGTCCCTGAC TAACCAGAAC CGAGGGCTCT TGGCACCAGA CTTTACAGGC TATGGTGTTC 003120
003121 ACCACAGGGC AAGAGTAGAG TGAGCCAGCA AGCTAGAGGG GTCTTTGAGG GAAGTGGATG CAATAGAAAC CCTCCAGGCA 003200
003201 AGTCCCTGCA AGTGAGGGCT GTGAGGATCC TGGGCATTGA ATGGCGGTGG CATTTCTCCT GCCAGTGCCC TGAGGGTGGA 003280
003281 GGCAGAACCA GCCTCCCTGA GGTGAGCGTC TGCAGCCCAG AGGCCCCCAG GCAAGAGTCG CTGTCCAGCT CAGCACTGGC 003360
003361 TGATGGCCTT ATGGTGTCCA CTGGTGCCTT TTATGGCAGG AGAAAGTGTG GCTGGACTGA AAGCCCTCTC TTGGTAGGGG 003440
003441 AAGGGAACAG TGCCACGTGG GCTCCCCAGG CTTCTGAGCA CTCAGACCTC ACTTAGGGTT GTCACCAATG GAGCTTCCAG 003520
003521 CCTTGAGTCT CAGGCTAACC CCTTGGAGTC CAAGTCCTTA AAGGAAGTGG GGGCAAAACT GCCCTTCATA TTCACTTTGT 003600
003601 TCTACTTCAT AGAGAGGACT CCAAGTGTGG CAGACTCAGA AAAGAGGCTC AGAAGTGCTT TCAAAGTGAT TGAAAGTTCG 003680
003681 CAAGAGAAAC TGGCTTCCCC CTGGCCAGGA GAGCAGACCC TGCCACCTGA TGAGAATGGA AGAGGTGTGT TGAGTGTCTA 003760
003761 TTGAGCTTCA GGAGTGCTGA CTCCCTGGAG GAGGTGGAGG GACTGAAGGC TTGGAGGGGC CAGGTCCACC TTCCAGAAGG 003840
003841 CTCTGTGTCT CTGAGAGGCA GGGACGACAG GGTCAGAGTT GGCCACTCAC TGTCTTTCTC CAAGGATGGT ATCTCACGGG 003920
003921 AAGAAAATGA GCATCTGGTG TCCAGATTCG TATGACCCTA GCAGAGAGGA GAGACTGTGA
[back to top]

Predicted Small Protein

Name NONHSAT082198_smProtein_2657:2911
Length 85
Molecular weight 9725.0399
Aromaticity 0.0833333333333
Instability index 58.2702380952
Isoelectric point 8.45977783203
Runs 12
Runs residual 0.00887170154686
Runs probability 0.0430651239475
Amino acid sequence MWVLEDPGFSWEKVDISLRAEVRSWNLATCVSGLCPWVPLRWRRQPCILRAADESLRAEE
RHQKVGNGGNEPRSCKESDVFKIK
Secondary structure LEEEELLLLLEEEEEEEEEEEEEEHHHHHHHLLLLLLLLLLLLLLLHHHHHHLLLHHHHH
HHHHHLLLLLLLLLLLLLLLEEEL
PRMN -
PiMo -