NONHSAT103120

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT103120

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

6411 nt

Genomic location

chr5+:107717801..107724211

Exon number

1

Exons

107717801..107724211

Genome context

Sequence
000001 CTCGGAGCCG CAGGAGCGCG CGGCCCGTTC CGGCAGGGGG CGGAGCCAGT TGCGGTGGCG CGCTCGTGGC GGGGACGCCG 000080
000081 GCACCCGGCT GCTCACCCGG GACGGCCGTC ACACCGGCAC TACGGGAAGA AGCCGGCCGG CTAGGGCACG GAGCTCTGGG 000160
000161 CTTGCGGGGT GCGGCGCACG CCCCTCCCTC CCACCGCCTA AGGTTTTTCC GAAGCTGCTG GGCGTGCGCC GCGTTTCTGA 000240
000241 GGCGGGAGGA GACACTTCCG AGCCAGCTGT GTGACATCCG TTTACACAGC CGCGCTGACG TCTCGAGGTG TAAGTTCAAG 000320
000321 AGCTGCCATT GGCCGGGAGC CGCTGGGCTC TTCCCCTCAC CTTCCACCCC TGTCACCACC CCTGACTCGG AAACCGAGGC 000400
000401 CACTTCGAAG CCGCGAGGCG GGTTCCGCCG GAAGGGCCAG TGTGGGCGCT GACCGCGCTT CGGGCAGAGT GGCAGATGCG 000480
000481 GCCCCTGGGT GATGCGCACC GAAGTCATGT GAGGGGTTGA TTGGTTGTGC GGCACCACTG TTTACCCACT TGTGTTTTCT 000560
000561 TAATTGTTGC GTACTACTGC CTTTCCACAA CTTTATTTCA GAATGAAAGC ATATATCAAA TTACCTGTGC ACAGCAGAAA 000640
000641 GATAACAGCT TCCCCACCCA GTGAGCCATT CAACACCCAG GCTACAAACT GTAGATCACT TTATTGTCTA CACCCTCGGA 000720
000721 ATCCTGTCTA GCCGTCTGTT GTATATTTTA AACATGACGG TGCAGTGCCT GCTATATGAC AAACGTTGGT GATCCAGATG 000800
000801 AGAAAGAGGA AACAGCTATG CAGGCAGTCC TTTAGAAGGT TTGACTATAT CGAGGCTAAA GAGCTAGGGC AGCTGCTAGA 000880
000881 GCGGTACCTG GTGCGGAGAG GTTAGCTTTC ACAAGCGAAA AATTAGAGGA GGCAAATACC ATGGGAATGA GCCACTCTAG 000960
000961 TGAGAGAAAT AAGAGGAGAA GGTGGAATGG AGAGTGCTTT AGGAAGGATA CTTTCTTCAT TGTGATTACA GAGAATGAGG 001040
001041 AAAGGTAGAT AAAGGACGGA TACAGAGAGG TGGATGTTTC TAGCAGGAAG GTGAAAGAGT TCCTTTCTGT TGGTTTCTGT 001120
001121 TTTGCTTCAG ATAGAGCTTG GTTACCTGCT GAAGGGTGGA AGAGAAGTGG CAGGTGTTTA AAAAAAAATC TGCTAAGGAT 001200
001201 AACTTGAGAG AGCTGACTGA GGAAGAAGTA TTGCAATATT CAAGGCCCAG TGGCTTGGAG AAGACCACTG TTGGAACCTA 001280
001281 CATTGCTATT CCCACTATAG TGGCCAGCCC CACCCAGCCT GCTCTCAAAC ATGCTGTCAA ACAATTGTTA ATTCTGGCCT 001360
001361 TTTTGTCAGA AATGACCAAT CTTAAGTCTG TCACATTCAG GGATCCTTTC AGAGCCTAAT GACAGTTATG TGGAATTCTa 001440
001441 gaagcaaata tttgtcaaaa gatttcacag acatacacgc tattcactat caaatgcatt gcctgaacga catttacgta 001520
001521 cacatataat ccacataact ttcatccatt tgcctgtagg agaatcttcc agggccccag agcAGTGCTT GAGGACTTAT 001600
001601 TACAGATCCT TTGACATCTC TACTAGGCTG CACTTTCACA AAAGGCAAGA GTATTATAAA GTGTTTTGTA ACTTTATGTC 001680
001681 AAAAATATGA ACCTGTCAGA gccagatgtg gtggtaccta tcattcaaac tacaacaggg ctggtgaggc gtgaggatcg 001760
001761 cttaagccca gcttgttatg attatgcatg tgaatagcca ccgtagtcca tcttgagcag cgtagcgaaa cacagtctct 001840
001841 taaaaaaaaT TGCGACCTAT ACAtttcata taagttaaat acacatcttc cctatgagcc agcaatccca catctaggta 001920
001921 tttatccaga gaaataaaag catatatcga taaaaagaca tgtacaaaaa tgtccgtaac aactttattc atggtcacca 002000
002001 aaaactataa actgcccaga tgtccatcag tgggagaacg gattaataaa ctgatatatt catctaatta aatactactc 002080
002081 agcaacacaa gagaaaaagc atggatgaat ctcaaaaaGT TAGGATGTTT TATGCTTTCT GGATACGGGG CCCTCGAATC 002160
002161 GCTACCTGGT AACAGGCTCC CTATGTGATT CCTGTACAGA CAGCCTGGCA CCATGCAAAT GTATCAGAAC TACACATacc 002240
002241 atccaacttc aacttcttgt ttttagagat gaagcaactg actctgagag ttctgaagtc gttgctaaaa tttacaATAT 002320
002321 TTGTTTCCTT ACTCCCAGTT CCTAAACCAT TCAACCACAA GCAAATTGTC TTTGGTCTAC CTATTGGACT CTGTGCTGAC 002400
002401 AAAAGATACT TGTTTTTTTA TAATTTAGAT ATTTTGCATG TCTGTTCTCA TCACCCACCA TCATCACACC ACTAGTTCTC 002480
002481 CCCAGAAAAA ATTTCAGATT CTTTGAGGCT TAGGATTATC TCTAACCCTT CTTTTTATTT CTTTACCACA GCAAATAGCA 002560
002561 GAGAGAGTCT TGTATAGTTC ATCAGTGATT AAATAAGCTA CTACTATTAT CAGGAGAGCT GTAATGAGTT TTGGAACTCA 002640
002641 GTACCCTAAA TAATAATACT AATGTTGTAA ATGTTAAAGA TACAGAATTA TGAATAAGAG TCATAAATTA GAATTATAGA 002720
002721 GACAGTTGTA GCAGCTGATG GCTTTGCTAG TTGGGGGGCA GAGAGGGTGG AGTTGCagtg gaaaccaact ccggacagcc 002800
002801 aggagctgga ctggcactat tagctaggca tttgtggtac tggcagctgt cctggcattc tcagttctca ttgttttgtc 002880
002881 aaatataaac agtcccacaa aacatcaaca tcagttatag tcattttatg actacaatag gcaagacaaa aataagaccc 002960
002961 ctccacaatc atgtctgaac acagcaaata caagaacagt ctctaaacca caaaaatgac caaacattct ggttaatata 003040
003041 agtgaccact gtttcttggc caattacagg cttagccttg atccattctt cctgccttct agatagcaaa tattaaaata 003120
003121 ttaagttata ggatcacctg tttcctgcca gcatccaatc cagagcaaaa ccttgacact tccccaaaat cacctaacaa 003200
003201 cacactcatt cctgtaatgt ttttcacaat accaatacct tcttactgag agtgctcctg cccaacttcg ctcctcattg 003280
003281 ctctctgcct ccctgcaaca agacattaaa cccaccttgg tttgaatcta gttgttttcc tgctgctctt tagctagagg 003360
003361 gcattagcaT TTGTTATTAT CAAATGTCac caacctatct aaagttaatc cctcctttgc cctttcctct taccgttttt 003440
003441 attttcttta tagctcatta tcacattttt aaaaattgtt ggtttgttca attagaacac aggctccatg aggacaggga 003520
003521 ttttgttcag ttttTGGGAA GATAATCAGG CTACTGTGTG GAATACGGAC TAGAGGAGAA GCAACCTCTT AGAAGGAAGA 003600
003601 CCAGTTAAAG TTGTTCAAAG GGTCCATAGA GAAGCCTTAA AAGTATGAAT TTAGggccgg gtgtgatggc tcacgccttt 003680
003681 gatcccagca ctttgggaag tcaaagcggg cagatcacct gaggtcagga gttcaagacc agcctgacca acatggtgaa 003760
003761 atctcacact actaaaaata caaaaattag ttgggcacag tggcgcatgt ctgtaatccc agctactcag gaggctgagg 003840
003841 caggagaatc acttgaacct ggaaggcagg ggttgcagtg agctgagatt gtgccattgc actccagcct gggcaacaag 003920
003921 tgcaaaactc catctcaaaa aacaaaaagc aaaacaaaaa aaagaaTGTA GGCATTCTGG GAATAATGAA GTAAGGTTGT 004000
004001 TATTGAGAAA CATTTCTGAT GTGGAGTTGT CAGGATTTAA TTCTGAATTG GATAGAGGGA AGGGAATGAT GAATATAGGG 004080
004081 AGAAAACCCC CAAATTGCTT GTTCTTATGA TTTATTTCAA TGAATAACTT ATGGTGAAAT AATGGATTCT ATAACGTAAA 004160
004161 ATAGTTACAT ATTTGAATAA TATTCCAACC TGAGGTAGAT TTGAGTTATG GCATACTGAG ATTCACACTG AAAACCAGTA 004240
004241 TATAAATTTA AAATGTGGCA AAAAGGTAAA AAGAAAAGTT TTTTCTTCTT TGCTAACATA GGTTATACTA AATTTGAAAT 004320
004321 TCACATTTGC TTATAAAAAG ACAGCCTTTT GTAAAAGGTT TTCATGCTGC TTGAAGTTGG GTAATGGTGG TTGTGGTAAA 004400
004401 TAGCTAAAAC ATCCTATAGC ACCAGTGACG CACAAGGAGA ACAAAGTTCA CATGCCATTG ATCCCAGTAA TTGTGGTAAA 004480
004481 TGTGACAGAT GGCCTAGAAA GACATTGTCT GATGCATCTT AAAATCATGG AACACCCAGG TGTTCAGTAG ATTTAATTCC 004560
004561 ATATTATATT GTGTTTTTGC TGAAATACTC AAGGCCAAAG ATGATAGTGA AAAACAAGTT TCCAGAGAAT TCCCTTCAGG 004640
004641 ACAATCTATA TTCCTGAGTA ATTTGTTTCA GACATGATAT GAAAGTTCAG CTGctgcctg gcctgttctt ccctaagctc 004720
004721 ttctcttgac tgattcattc tgagtttcca gatgatatca gccctaatat cacctcctgt catgagagct cttccCCAGT 004800
004801 CACTCTGTTA CACTCCCCTG TTCATTTCCT TCACAGCAGT TACCACGGTC CGCAGTTGTG GTATTTCTGT GTTATTTGAC 004880
004881 TATCATTCCC ATCAGAATTA ATTCCTACTT TGTTTCCTCT CAAATTGCCA GCACCTAAAT TTGTTAATCA ACTGAATTTT 004960
004961 AAAGGCAGat tttcggagtc tgagacagga ggaacactta agtccaggag ttcgagacca gcttgggcaa catagggaga 005040
005041 ccttatcttt acgaaaaatt taaaaattag ccggacatgt tagctcacac ctatactccc agctaatcta gaggctgagg 005120
005121 cggaagggta gcttgagccc gggaggtcaa ggccacagtg agccatgatc ttgcaccact gcgctccaga acctgggcga 005200
005201 cagcgtgaga gacaccctgt ctcaaaaaga aCCAAATAAA TAAGTaaagg cagattatcc taggtgggcc tgacataatc 005280
005281 aagtgaactc ttaaaaagag ggtcaaggaa cggctgggcg tggtggctca cacctgtaat cgtagcactt tgggaggccg 005360
005361 aggcaggtgg attgcctgag ctcaggagtt cgagaccagc ctgggcaaca tggtgaaacc ccgtttctac tgaaatacaa 005440
005441 aaaaaaaaaa aaaaaaaaaa aaagtagcct gtagtcccag ctactcagga ggctgaggca ggagaattgc ttgaacctgg 005520
005521 gaggtggagg ttgcagtgag ccaagatcgt gccactgcac tccagcctgg tgacagagcg agactccaaa aaaaaaaaaa 005600
005601 aaaaaaaaaa aaaggtcaag gagaagtcag ggagattgaa agcaacagca gatgcttttc ttttagccgt gaagaagtaa 005680
005681 agtttcatgt tatagtgaga cctatatggc agaaaatggc aagtggcctc taagaactga gagtagcccc tagctgatgg 005760
005761 tcagcaagaa agcaggcctt ccttcctaca ggattcttaa ttctgccaac aaccacatga gtttggaaga atacccagaa 005840
005841 ctacagaagg gagcacagct tggctgacac cttgatttca gcctggtgag aacctgagca gagaacccag ctgagcccat 005920
005921 gccaggactt ctgacttaca gaactacgag ataataaata tgttatttta agtcacttca tttctggtaa ttcgttTTTG 006000
006001 CATCAAATGC AAATGTTTTT GCATGGAATA AAGTGTGTAT TCAAATACAG TCATATGGAT ACCAAAACTT GATGTAGTTT 006080
006081 TAATAGCAGA AACCAAATGA TCAATTATCC ATAACTTCAA AGATTAATTC AACCTCATGA AAAGAAACAA CTAATCAGAA 006160
006161 GTTAATACCT GCACAGACTA TTTGTTGTTC TCCCACTTTT GCATAAATTA GATAATTGAG ACAAAGATAT GGGCTTATTT 006240
006241 CTTTTGCTTA AGACTCACAT ATTTATCTTT TCCTTAGATG GAATTAGTGT AACACAAAGA TGTTCCACTA TTTATATTCC 006320
006321 TTCAATTCTG TGTGTGTCTA TATTCTATAA TTAGTATTTT CTCATTTTCA ACTGACAGAA ATAGTGCTTA TAGAAAAAAG 006400
006401 AATTAAATTA C
[back to top]

Predicted Small Protein

Name NONHSAT103120_smProtein_3383:3544
Length 54
Molecular weight 6519.6405
Aromaticity 0.207547169811
Instability index 53.520754717
Isoelectric point 7.99774169922
Runs 9
Runs residual 0.00954901058444
Runs probability 0.038970342892
Amino acid sequence MSPTYLKLIPPLPFPLTVFIFFIAHYHIFKNCWFVQLEHRLHEDRDFVQFLGR
Secondary structure LLLLLLLLLLLLLLLLEEHHHHHHHHHHHHLLEEEEEHHHHHLLLHHHHHHLL
PRMN LLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLL
PiMo oooooooooooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiii