NONHSAT128044

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT128044

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4718 nt

Genomic location

chr8+:102184675..102206979

Exon number

12

Exons

102184675..102185089,102185400..102185828,102187525..102188123,102189948..102190741,102193084..102193501,102194667..102195175,102197261..102197622,102198755..102199243,102205019..102205124,102205782..102205943,102206528..102206733,102206751..102206979

Genome context

Sequence
000001 GTTTAAAAAA GAAAATGGGG GGAGGGGAAT GTATGTGGGT TTGTTTAAAT GCTGAAGAAA ACTGTTCTAA GCAGCTGGAT 000080
000081 GAATAAAAGG GAGGAGCACT AAAGAAAGAA TGCGCTAAAT CTAAAGATAC AAGAACAACA CCCAGGAATT GATTGAAAAT 000160
000161 GCTTTGGTGA CAAGTGAGGC ATGTAATGAC AAGAAAGCAG CATGTGTGCA TATTGAGGAG TGCATGTGGA AAACAGGTTT 000240
000241 CTCCTTTTAA TTAGCTTTGC CGCTGGGAGG ACCTTCTCTC TCCACTTTCT GTCCAGCTCT TCCATCCCAG AATTCCACTC 000320
000321 TGGGTTCTTG TGATGTGCTA ACGAGGCTGC AGGGCAGCTC TGATTGAGTG GGCCAATAAT ACTGGCTCAA GGCCACCTAG 000400
000401 AGAAGTGTTC TCCCCGTGTA TGTAAGGGGA AGTGCTGGGT GCAGCCTCTG GGGTGGGGTT ATTCCTATCC TCCTCTTTCC 000480
000481 TGCCTGGACC ATGCAGATGA GGGCCATCCC CTAGAGATAA TGAAGCAATA AGATGGGAGA TACCTGGCAG TCCTTGACAC 000560
000561 CATGGACCCG CCATACTAGT CTTTGGCTGC AAACTCTCAC GCTGTCATGG AAGATAAATA AACCTCAATA CTGCTTAAGC 000640
000641 CATTGAAACT TTAGGTCTCT GTTGCACACA GATGAACCCA TAGTCCAACA AATGTATCTT CTTGACTAAA TCTTACCTTC 000720
000721 TTACTCTGAT GATTTATATT CAGAGAATAA ATTTAACTTT GTCTATATCT GGGGTAAAAA AGAACAGCTC TTCAATAAAT 000800
000801 CTGCTTCTCT CCAAAAATAT TCCAGAAAAT TCATGTGATT GGAATTATGT TGAGAAATTT TTATTTCGTA TTTTTATCTT 000880
000881 TTATTTCAAT TTTTCTACAA ACTTTTTGAA GTCCCCTGTG TGTATGTATA TGTACATTCC TGAGCTGGAG TATAACATTA 000960
000961 CTTTGGACAA AATTCTTTAC ATTAAAATTC ACTTCTAGAA TAAAGAGGTT CACAACTATA GGTTTCAAAG AATATCAGAA 001040
001041 CAAGAAAGAG AACAAGAGAT CTCTCTCCAG GTTAAACATC ATTCCACTAT CGATGAGGTT TGAATGTAGT TGCTCAAGCA 001120
001121 GCTGTATATT CACCTAAACA GAGTAACAAC CCAGTCTACA TTTCTCCACC ATATCCATCA TATAATGTGA GGATTTCGGA 001200
001201 GTGATTTTGT CAGATGTAGG GTTAAACTCC ATTGCCAGCA CCCCTCAATG TACCCAAACA GGAATAGTCT ACTTTTCCTG 001280
001281 ATTCCTTTTT AAAGTCTCAC AAATTATCTT TTTAATTAAG CATTCTATAA TTTTTTCCCA GGAAACAGCT TCGATTTTAT 001360
001361 CAGCTGGTAT TTATTGTCCA TGAGTCACCT TTTGGACTAC TGGGTTGACA TTTCCTCAAT TCCTGTCTTT TGGCCACTCT 001440
001441 CATGAGAGCT GAAAAACATT TTTACTACAT CTTGCCTCTC GCAGTTTGAA ACTGGGGTTA GGAAACATGG CCTATGGATG 001520
001521 GTAGTGGCTT GTGGTCCCTG GTTTTAAACT GCTCCCATGG TGGAATCACT TGAGTCTCTC TCCTTTCATC TCCACACAAA 001600
001601 TATTCTCCAT CCCACCTCTT CGTGCAGGCT TCTGGCTTTC TGTGCACTTT CAGACATTTT GGACATATAA TGCTCCTCTG 001680
001681 CTATCTGTTC TCTCAGGGCT TCCATATTCT AGTCCTGAGC CCTGTGTTTC AGGGGTATCT AGGAAATCTT GATTTCTTTC 001760
001761 AGGTGTTAAC ATCTCAAGTT CCTTTGCTTC ATATTCACAC TTGAAATCAG ATATAACATT CCTTACTTTT TCCTCCTGTT 001840
001841 ATTTTTGCAT GCAAATTAAG AGATTCTCCA ATTGCTCTGT GTCATACGTT TTCTTACGAA TAACAGGGCT TATAGTTTCA 001920
001921 CTTGAGTAAT TTTTCCACCC TCTTAAACTG CAGTTTAAGG TCTTGATGAG GTTACCAAAT CTCTGACAAG CATATATTCT 002000
002001 TTCCAGTCTT CACAGGGCAC GCTTCGGCCC TTGCTAGGAG TCCATCGTTC TGCCATAACA TTTGCTCCCA GGTCATCTGC 002080
002081 TTGCAAGAGG CAATCGAGAG TTTGGGAAAC CCTTACAAAA GAAATGACAA TAGTTTATTT TTAGCTCCTG ACATTTTGTG 002160
002161 CTCCTAATAC CCAGTAAGCC TTTCTGCCCC ACAGTCAGGA TGCTTTAATT GGCAGTAAAT GGTGAGGCGG TAACCCATCC 002240
002241 AGAAAAACTA ACTTTTATTT GATAGTATGT ATAAAACCAT CTGTTCATTT CCCATACCTC TTCCAGACAT GACCTTCAAC 002320
002321 TGGAAGGGAT TAAAACAAAC AAACAAACAA AAAAAAAAAA CACAAAAAAC ACTTGTGCCC CAGAGCCTTT GGTTTACCAC 002400
002401 CCAGGGCTCT GATCACTCGA GATACATTCC AGAATTCATT TAACATGTGT TCCCACCGGA GTCAGCAGAA TGGGTAACCA 002480
002481 CGGGCAGACA GCACATTCTG GCAATTCTCA GGCCCATGCT CCAGAAAGAC AGCCTTAAGT TCATACTCAG GGTTTATAAA 002560
002561 TAGTGAGTCA TCAACCACTA CCAGTTGTCT TGGTACTAGT GACATAATCT TGTAACTCCT GGTGACTAGA GAGCTTCATT 002640
002641 AATTTTGTGG AGAATTTTTT TTTTTTTTTT TGACAGTGAT GGAGTCTCAT TTTGTTGCCC AGGCTGGAGT GCAGTGACGC 002720
002721 AATCTCAGCT CACTGAGACC TCCATCTCCC AGGTTCAAGT GATCCTCCTG CTTCAGCCTC CTGAGTAGCT GGGATTACAG 002800
002801 GTCTGCACTA TGAGGCCCAG CTAATTTTTG TATTTTTAGT AGACACAGAG TTTTGCTATG AGGCTGGTCT CGAACTCATG 002880
002881 ACCTCAAGCC ATCCGCTGGC CTTGGCCTCC CAAAGCGCTG GGATTGTAGG CGTGAGCCAC CGTGACTGGC CCAAACATTT 002960
002961 AACAAAGAAA AACAGTGTAT CATCTTTGAG AGACTAATGA AAAGTCAATA AAACAAATCT GGTAGAAGTA TAAATCACAG 003040
003041 AGCAGAATTG TATTTTACCT TTCATTAAAA GAAAAAAAAA AGGATCTTAG TGTTCAGACA GTAGCTTGGG AAGGAGATCA 003120
003121 ACACTGACTG ATATCCAGTC ATTGTCAGTG TTCTGTTAAA ACTGTTTTTT TTGAGACGGA GTCTCGCTCT GTCGCCCAGG 003200
003201 CTGGAGTGCA GGCATGTATT ACTTTTATAA GTAAAAACTA CCCCCAACAT GACTACTCAA GAATGAGATA CTATGAAGAA 003280
003281 AAGTTCCAGA GCATTTACAA GCTAAGAGAG GCATTTCCCA TAGCTATTAA TTCAAAATAT AAGAAACCTC AAGTATTTTG 003360
003361 AACACTTATA CCTCTCCAGA CTGGACACCC CGATGCAGCA ATGTATTTTC ATAACCACCC CTTCCAGAGA GTATTACCTT 003440
003441 TTAGATAACT AATGAGCTTT TTTTAAAATA AAAAGTTTCT ATTATGTGCA GCTATAGAGC TAAATGCTCA CAACTACTCA 003520
003521 GTAAAGAGAA ACCTAGGCCT CGGAGGCTTT AGCCACTTGC CGACAAACTG AATGACAGCT TCTGAATGGC TCCTCTATCA 003600
003601 CTTCCCCATA CAAAAACCTT TCCCACAGAC ACCTGGCCTT CCAACCTACT TCTACCAGTG TAGAAGCTTT TGTGCATACT 003680
003681 TTTAACCCAG ACAGTTCATG TCTAGGAATT CTCCCAAGAA AATAACCACG ATCACTTTTT TTGACTGAAA AGAATAAAAT 003760
003761 GGATCTCGGG GTTTAAACTG TTGATCAGAA AGGTTATCAC AAGTTCAGTA TTATTTTCCT GTTCAACCAA CAGTGCGTAA 003840
003841 GTGGCTCAAG GTGAGTTCTA TTTTCATGCT ACCAATAAAA TCCAGTGCAT TTATATATAT TAACATAATT TATAACTTTA 003920
003921 TTTTGTAATT TTATAAATTA TTTTTTAAAT ATTACTTCAT GTTTACGTGC ATTTTATAAA ATTTATATAT AAAGTTACAT 004000
004001 TTTATACATA ATGAGCTACG AAAAAGCACA AGCAGAAATG GGCAAATGTG AGAAGGGTAT GTGTAAAGAC ACACTGTGTT 004080
004081 AATTTCAGTA GAGGGAGGAG GGATAGGAAG ACAGTGGGAA GTGTTACTAT GAAAATCCGC ATACATAAAT ATTAAATAAT 004160
004161 AGAACCTAAT AAAAGACAGA TCCCAAACAT GAAAGGTTAT TTTGTTACTG TAACTTACTA AACTGACTTA ATCTAATAAA 004240
004241 AATACTATTT CTTCATAGTA GTCAGGTAAC AATAAAAAAC ATACAGGAAA ATGGTTAAAA AAAACTACAG GTAATCTACC 004320
004321 CAATGGATCA TTATGCAGCC ACAAAAATTC CACTTAGCAC AGTTTATAAC ACGAAAAACC CACCTGTTAT GTTGTTAAAT 004400
004401 GAAATGGCAG AGTACAAAGA CATAATATGA AAAAAGTGAC TACAAAATTC AAAAACATGC ATTAATTTAA AAAAATCTAT 004480
004481 GTGGTAAGAA GACAGACATG GATACTATTC CTCAAAATGT TTTGTACTGG TCAATTTGTA AAATCCAACC TTACCACAGT 004560
004561 CTTCACATTT TTATATATGA GATATATTTC AAAATGTGTT ACTTTTATAG TGAAGTAGAA GACAAGAAAT GTCGTGTATA 004640
004641 GCCACACTAT AGTCTGGGGT TATTTAGTGC AGGGACAAAC ATCCCTAAAT ACAGCTTTTT GGGCATTAGT AGTATCAC
[back to top]

Predicted Small Protein

Name NONHSAT128044_smProtein_728:997
Length 90
Molecular weight 10707.5241
Aromaticity 0.179775280899
Instability index 51.4731460674
Isoelectric point 8.92816162109
Runs 14
Runs residual 0.0141000220313
Runs probability 0.068835642365
Amino acid sequence MIYIQRINLTLSISGVKKNSSSINLLLSKNIPENSCDWNYVEKFLFRIFIFYFNFSTNFL
KSPVCMYMYIPELEYNITLDKILYIKIHF
Secondary structure LEEEEEEEEEEEEEEEELLLHHHHHHHHLLLLLLLLLHHHHHHHHHHHHEEEEELLLLLL
LLLEEEEEEELLLLEEEELLLEEEEEEEL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHH
HLLLLLLLLLLLLLLLLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTT
Toooooooooooooooooooooooooooo