NONHSAT134620

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT134620

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5222 nt

Genomic location

chr9-:125874493..125879825

Exon number

2

Exons

125874493..125878096,125878208..125879825

Genome context

Sequence
000001 CCTGGAGTTT GCTGTCTTCC CTCCAAGTTG GACCCGTGTA TGGTAAGCCC TGCTTGGATG GCTTTGGGGT CTCTAAATTG 000080
000081 CTGCCTGTGA ACCACCCTGT TGAGCGTGTT GGATTCTTCT GCTGCTGTAA CCACTTTACC AGCTCCTGGG CTAATGAATG 000160
000161 ACAGCTGGGT CTCTGCTCAG GCGTGGGTGG GTGCGGGGAT GGCTGCGGGA TCCTTTTCAG GACAAAGTAT TCCTCATCCA 000240
000241 CCCCAAGTCA TCCTCTTTCT GGGCACTTGG CAATGTTTCT GTGCACCCTG CTGTAATCCT GGAGGGGAGC AGGAAAAAGG 000320
000321 AACCTTTGAA TCCCAGGCTG TCGTGTTCTA GAGCATGACT GTGTCAGGGA GAATGAGCGC CTGGCATTCA TAGAACACAG 000400
000401 TCCTCTCTCA GGCCCTTTCC CGTTTGCCAT TTCATGACTT GCTGTACTGG TGGCCCTTGG AGAAGCAGCA GTGCATAGAT 000480
000481 CGTTGTTCCC ACCTTGCAGA CAGGGAGAGT AGAGCCCAGA GATCTCTCCA AACCACACTG CTTATTTGTG GCAAAGCATT 000560
000561 GTGGCCCCTA ATGAGTTTTT AACCACCAGA AGAAATAACA TTTCTTAAGA CTGTTTTTAG AGGTAACAGC AGCCCTTTCT 000640
000641 GAGGAATTAG TTTTGAATTT CACCACAACT GGAATTAGAA ATTTTCTCCT TTTACAGATG AGGAAATAGA TGCTTAGATA 000720
000721 AGGTGACTTG CCCCAAATTA TAGTGCGGTG AGTGGCGGCG CTCACCCCGT CTGAGGCTAT GCTGTGTGCA GGTGGTTTGC 000800
000801 GTCGCTAGTA GGCACCAATG TCCGTTCTGT CTTCCAAGCC CACTTGGCAG CAGTGGCTTC TCTCTCTGTG ATGTGTTCGG 000880
000881 CTCACCTTTC CTGTAGGCTT CTGGGCTATC CCCTGTGCGC CTTGAAACCA AGCAAAAACC AACCCACATC TCCCTGATCC 000960
000961 TTCCTTAGCC TCCTAAAATC CAGACTTTCT GGGCTCGTTC TTTTTCTGTT CCTTTGTTAG TGTGCTACTG GGCAAGGTGA 001040
001041 CCAACTGGTC AGCCCTGGGC CAGGCAGCCT GTTTTTTTGT TTTGTTTTCT TTTTTGCCAG CTACACAGTT GGGGTTAATT 001120
001121 TCTACAGACC ATCATTTTAT ACTGTAACTG AACCCTATAA TAAAGTGCAA ATTCAGTAAT TGCTACTTGG AGAGCTCCTC 001200
001201 AGGGGTGGGA AGGAGGACAG GGGATGTGCT GTTGGAAGTA TGTGTTTTAT GTTCCGCCTC ATCCACACTT TGGTTTTTGG 001280
001281 CCACATCCTC CGTTCCTGGC TCGGCCTTCA AACATGTGCT CTAACCCATG GGATCACCGT CACAGTAACA CATTTTCCAG 001360
001361 AGCAATCTGG TGTTGGGATC CCCTTGGATT CTTCCATCAT GAGACTTCTC GGAAGTCTCA GCTTTCTAAA AATCCCTCAT 001440
001441 TCCTGCAGAG GCTACGTTAA AGGAATATGG TAGCCATCAT CTGGGAGGTA AAAAACCTAT GGAAATTGGC ACCTGGGGAA 001520
001521 CTCAGGAAAG CTGAAAGCAC CTTGAAATGA GATGGAAGGA CCAGAAGGTT AAGGCTGGTG AAGAGGAGAC CTGGAGAGAA 001600
001601 CGTGGGCACT CCTGCCAGAC GCAGCCTTGA GAGAGCAGCT GAGGTGCGAC ATGAGCTCCC TGTAACCAGC ATGTTCAAGC 001680
001681 AAGAAGCCGG TTGTGTGGAC GGCAAAGAGG AAATGCTGTG GAATGTCAGT TTGGACGAAC TCTGATGTTG ACTGGTGATG 001760
001761 CTGCCAAGAC CATGTCTCCC TTCTTGGGCA CGTAGATTTA TTGAGAAAAA CCATTCACTG TCACCTACTG CTTCTTACCA 001840
001841 GGCTCTGTGA TGGGGGAGAA GGAGCAGTTG GAGAAAATGT CAGAATCTGC AGCTTAGACA ACTGAGTAGA AGGTGAAATC 001920
001921 ATTGACCCAG ATGGAGAGTA CAGGGAGGGG AAGAAAGAGG TGCAGAGGAT TTGAACAGAA TGAGAGTCCT CCTCTGTGCA 002000
002001 GGGAGAAAGG CTTCAGTGGG TCTCCCCCTC CCTGCTAGGG ATCCTCCTTG CCAGCCCTGG GCTGGGCGCT GGCAGGCCCG 002080
002081 TTCCCAGGTC TCCCGCCGAG GGCAGGAGAG GAGCCAGGCC TCAGGAGAGA AGTGCTTACT GAGTAATTTA CCAAGAGCGA 002160
002161 TAGGCTTCCC ACTGAGCCAG ACGGGATCCC CAGGAGGGGA GTCAGAGGCG GGCCTGGCAT GCATGCACAG AGAGAGTGTG 002240
002241 CCCACTTGGG GAGTGTGAAG GGCCTTGCTG TGGGACCCGA GAGGATGAGC AGAGTCAAGT GGCAGGTGGG ACACACAGTC 002320
002321 TGCAGTCACT TCTTAAAAGG CCCTGGGTGG TGTGTCCTTC AATGATGACC ATTGTCCACA ATAGTGAGTT CCAGAGGGTC 002400
002401 GATTGGGACA GTTGGATAGC CAAGGACCAG AGTCAGGAGG CAAGGCTGAT GAGCAAGAGG CTGTTGCCAT TGGCCAGGCG 002480
002481 AGGCGTGAGG GAGGCTGATG GAGCGCAGCA GCAGAGGGAT GGAGAGGCAG GAATGGCATC TCCTCACAGG CCAGGCTCCT 002560
002561 CTGCCGGATT GGGTAGGGAC GGGAAGATGA TGGCCCCCAA CCCTGGGCCA AGGAACAATA GCACCCATCC GGGGGCGGGG 002640
002641 GACCTGCTGG AGTACTTCTC CTGGACCACT GCTCAGTGGG CAATAGAGGG AACAATGGTG TTTCTGACCA GGACGGGGGA 002720
002721 AGGCAGGGGA ATGGAGGAAG ATACTGACAG GGTACAGAGG AGGTAGAATG GAGCGAGGGC CATCTCCACC GGGCTAGGAA 002800
002801 ATGGGGCTTT GGTGGGCCAC ACTCCATTGT GCTGGAGCAC TCTGGCATCT CCATTGGCAG TGGGGAGGGG CTTGTGACCT 002880
002881 ATCAGCATCT TCACCCTTTC TGAGTTTCCG CCGGGCTGGG CTCTCCAGCC AGGAGTCACA GGTACCGGGT GGTTGTTCAT 002960
002961 GCTGAGGCCT TTTGCTTGGC AAAAAGTTAA TCCATTTGTG TTCATTTTTA ACCTAAGCAA GGGTTGGAGC TTGTTGGATT 003040
003041 CAGCATCTGT GGAACACAGT CTGTAGCCCA TGGGAAGACG TGGACATTCT TTCCCAAGTT CTTGAAGGAC GGAAAGGACC 003120
003121 ATGAACTTGT GAGGTGACCG GAGGATTGGT TTTGGGTGTC AGTTTTGTGT CTGTCCCAGT GCTGCCACTT TTGCTTTGAA 003200
003201 ATGGACGGGG AGAGCTGAAG GCCAGCATGG GTTTCTGCAC CTGTTTCCTA CAGCTCTGCT GGGGACTGTG GCTCAGGTGG 003280
003281 CAGCTGTGGA CACGGATCCT GTCAGCCTTT GGGTGGCCAG AGCTCCAAGG CCAGGTGTCT CTTAGCTGCA AGCAGCCTTC 003360
003361 ACTGTTAATC AACAGGCTAC ACAAATCTCA CCTCTAAAGA GGTGAAATGT GAAAATATAT GGGAAGCGCT GCCTTAGCCC 003440
003441 TGTGTCCACA AATCCTTTCT GTAAAGGGCC AGATAGCATT TCAGGCTTTA TGGACCATGT GGTCTCTGCC ACAACTGCTC 003520
003521 AGCTCTGCCA CTGTAGCACA AAAGCAGCCA GAGACAGTGC GTAAATAGAT AGCATGGCTG TGTTCTGATG AAGCTTCACT 003600
003601 TACAAATACA CAGGGCTGTA GTTTGCCAAC CCCTGCAAAG TAGGGAGTAT TTTGCCAAAC TTGGCTGTAT TTCACCAACC 003680
003681 CCTTAGACTA TAGAGATGAC TTACCCTTTG GCTTAATCAT GACTGGAAAC TTCCTTCCCA TCCACTGCCT GGATCCATCC 003760
003761 ACCTCCCCAC AAAGGCTTCC TGAGCATCTC CTCCGTGTGA GCACTGGCTC AATGCACAGG ACGATGTAGT TCCTGCCCAT 003840
003841 ATGGAACTTA GGTTTTGAGT GACACGTGGG ACATTGGAAG AGGAGAGCAG GTGGCTAAAG AGCATCATAC AGAACAGGGC 003920
003921 TCAGGGAAGT CCCAGGTGAT AGAATCTCAA AGAGAAGGGC TGAGGGTGAA TCAGGAAAGC CCTGAAGGCA GGCAGCTCCG 004000
004001 GAGTCGGGTT TAGACGAGGA GGCTGCAGTT CTCCAGGAAG GGAAGCAGGA CCAGATAACG GATGAGATTG TGTCAAACAA 004080
004081 TGAGGAGGCC TGCATAGTCA CACAGGCGCA TGAGGTCTGC TCGTAAAGGA CCAGGAGGGC CAGGCTCAGG GGTGTGGACT 004160
004161 GGATTCTCTG GGCTGTAGGG AGCCACTGGA CAGCTGTGCT CAGGAGGGTC ATGTGATCAC AAATGCTTAT TAGAAATGTC 004240
004241 ACTGGCTATG GAGGACCATG TCTGTGGTTG TAAGTTACGC CTTTCCGTTT AGTCTGGACT TCCTGGTCAA GGTACTTTTG 004320
004321 GTATTAGAGT GGGGAAGAGA GGCCCAGAGA GTTGGACAGG TTTTCTGACC ATTGGCTTAT AGTCAGTGAG CACCCCACCT 004400
004401 TTATTTTGAC ATGACAGAGC TTTACAGAAT CTTCCTTCTT CAGGCGTTTT CTTTCCTGCT ATCCTAAAGA TTCACGTACG 004480
004481 AGGGCAGGAT AAATGAAACA TTCTGTCTCC AATCTAAGTT CGGCTTGGGG CCTCAGAATA TGAAGGTAAA GGGGCGGGGA 004560
004561 CGGACTGACT GCCCCTTGCT TCAGAATCAA GTCACATGGC GGGAACGCTC TATGGAGCAG ATTGCTTCCT TTGTACATGC 004640
004641 CCTAGGCCCG TGTTCAGCAG GCCTCAGAGA AGCAGAGGTG CTGGCCTCTC GTCTCCAGAG GCTGCCAGCT ACTAAGGGAG 004720
004721 GTGGAAGCAA CTACCCCTGG GGGTGGACTG CTCTGGGCCG GGCCCGACGT CAGGCGTAGG GGACTGAGGT GAATCAGACG 004800
004801 TGGTGGCTGC ATTGAGGAAT GCTCAGTGTA GTTGGGGTGT TTGTAGCGCT TTCTGCTGAG CATGGTCTTT CCTCTCAGAG 004880
004881 GGGTTTCTAG GCACGTTGAA GGGAGCTGCC GTTCAGAGCA TCGGAGAGGT GCCAGGGACG TGAGCTGCTG CCAGTCTGAC 004960
004961 GTGAACAGAA GGTGGGAATA GCTGGCGCTG CCGTGCGTTG ATAGGAGCAC AGTGTCCGCT GCGAAGGGGC GGCTAGGGCT 005040
005041 GGAGAAAGAT GGCCTGATAG AAGCGCCTCC ACAGGCGCTG ATTCAGCGAA GCCCCAGCAG CCCCGCGGGG CAGGCGAGGA 005120
005121 TCACATGCAG CTCGGAGAAG AGGAGGCACT TGCCCCTCTG CCTGAGCTGG GAGGTGGTGG CCAGCTGCAC ACAGGGCTGT 005200
005201 CCCAGGATGC TGACGAGGCG GG
[back to top]

Predicted Small Protein

Name NONHSAT134620_smProtein_1979:2257
Length 93
Molecular weight 9839.1725
Aromaticity 0.0434782608696
Instability index 74.9347826087
Isoelectric point 10.1276245117
Runs 15
Runs residual 0.0278385986863
Runs probability 0.0404816581288
Amino acid sequence MRVLLCAGRKASVGLPLPARDPPCQPWAGRWQARSQVSRRGQERSQASGEKCLLSNLPRA
IGFPLSQTGSPGGESEAGLACMHRESVPTWGV
Secondary structure LEEEEELLLEEEELLLLLLLLLLLLLLLLHHHHHHEELLLLLEEELLLLLLELLLLLLLL
LLLLLLLLLLLLLHHHHHHHEELLLLLLLLLL
PRMN -
PiMo -