NONHSAT103321

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT103321

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4776 nt

Genomic location

chr5+:115074730..115095533

Exon number

4

Exons

115074730..115074811,115090392..115093563,115093739..115093903,115094177..115095533

Genome context

Sequence
000001 tgaatcccca tagtatacaa tttatttctc catgggcaac aagacatgcc agcaactgca cggctacttc tccatttagc 000080
000081 catagggaga ccccatctct aagaaaaaaa attagccaag tgtggtggca aatgcctgtg gtcctagcta ctcaagatgc 000160
000161 tgaggtggga ggatcacttg agcccaggag gtcgaggccg cagtgagcta tgattcacat cactgcactc cagcctgagt 000240
000241 gagagggaga ccctgtctca aTCAATCAAT CAATAAACAC AGAAGCAAAG TACTGAAGat aatgctgtat atttctggat 000320
000321 ctatccacac ttgttctgca gaagcccagc cccttcaaat tcacccaaaa ttgtttttga agtctgttgg gggtgtccca 000400
000401 tgtgtATGAA ACCAGCTCTT TTGTCTACCT CCCTGGTCTT CCATTTCCAT CCTCTCCTAC AGTGATGCCC CCTGCCAGTG 000480
000481 CCCCATGGGG CAGCTCTCTT CACATGGCCA TGCTAGTGAG GAGTGCTCCT CATTCCTGCT GGTGAGTGCC TGCCAGCATC 000560
000561 TCAGGCAAGG GACTGGAGTT CTTACAAGAC TACCATCCCC TACCTGGGCC CAAACTGACC TTCTTCTATT ATTTCTCTTT 000640
000641 TGCAGCTATC CCAGAGAATT CTGCCACCTT CCAGAGCCAA CTGTTTTCCC CTTTCTCTGC GTTTGATAGA AAAGAAAGAT 000720
000721 AAGAAATAAA TAGGGCTTAA TTTCCCAAAC CATTTATGTA GTTTTTCCTC CTTTTCAGGA CAGAGAATAA AGGGTCCCTG 000800
000801 TATTCATGCC TTCCTGGAAA TCTCAGTGCA AGGCCCTATC AGGAGGTTTC CTGGAGCAGC TAAGGGAACA GCTAGACCAC 000880
000881 CCAGGAAGAA ATGACAGAAG CCTACCTGAC TTCCAATCTG CTTGCTTCAT GGTGTAATGG GGGCCTAGAC AGAAGCACCC 000960
000961 CAAGGACACT TACAACAACT GTTCTGCCAG CTCTAGTCCA TTGGAGAAGA TGTGCCAACA TACCTTCTCA CCATCCCTAG 001040
001041 AGCCCAACCT TGCTATGGAA AGGTCAAACC CTTACCATTG GCCCTGCCAA GGATCCCCAG ACAGAGAAGC ACAATGGAGG 001120
001121 CCATAAATTA TGGCTTCTTA TCATGGCCTC CACATAACTT AAACTCTGGA TGGTGAATGA AATCTGGGGC CAAGTCTTAA 001200
001201 AGGGTCAGGG TAGCCAATGA GCATAACTCA ATATTTGTGT TTCTAAGAGA AAACCACAGA ACCTCCATAT CCATGgcaaa 001280
001281 ctggccagcc acaggctgga tccagtgtgc agtagtgttt tgtgtacacc attcactatt tgtctacttg ggttaaaaat 001360
001361 gttttatatt agttagcaat gtttaaacat tgggagaATT Aaagggaaat gaaaacgtac ttcccacaaa tacttgcaca 001440
001441 taaatattca cagcagcatt gtttataatt gcccaaaact agaaacaacc caaatatcca tcaactgatg aatgataaac 001520
001521 aagttgtggt atattcatcc aataaaacaa tattcagcaa taaaaataaa caaatcaggc acttaatata attgaatatc 001600
001601 aaggtcatta tgttgagtga aagaggccag aaaataaaag tatacactat ttgattccat ttatataata ttctagaaaa 001680
001681 tgaaaaccag tctatactta cagaaaacaa ggtggtagct tacagagatg gggtagaggg agggagagat gacagagagg 001760
001761 cataaaaaaa ctttccggac aatgaaaatg ttctgtattt taattgtggt gatggtttca tggatgtaca agttcatcaa 001840
001841 atctcatcaa gttaaatata tgcatttatt ttatattaag tgtacctcaa taaagcagaA AAATTTaaaa ttggtaatat 001920
001921 ttatatacaa attgagatat ctatatttct tgaaaatcaa aatgtctgac atcatttggt ctacatattt atctgccaaa 002000
002001 aatacaggag atagacagta gctgtccttt agaaggggca taccctttcg agtttgcaac aggccttcat acttcctgct 002080
002081 gcttacccag cATTCATTAC ATGTTTATTG tcatttattt aagttactta ttgtgaacct aatgcatgta tggcctcatt 002160
002161 ctaggtactg gagactaaac atgcaaacac acctaactgg catggtttcc atttgcctaa ttgtcacggg gtaccatgaa 002240
002241 gtttacattt tagtgagatt ggagaaagat agacatttag agttaataag ttggataaat agtatattag gtggaaagaa 002320
002321 gtataagaaa aaaacataca gccaggaaga agtatagaca gttccagagc taaaatgtta gccatggatc ccaggaagtc 002400
002401 ctcactgaaa ggtgactttg ggaggaaacc tgtagaaggt gaggaattga gctatgcata taacccaggg aagagcattc 002480
002481 tagacagaca tgacagtcca tgagaagatt ctaggcaagg gaggtagccc agggtcttca accaacagca aggaggctaa 002560
002561 tctagcttgg cacagagtaa caagagatga gatcagagaa ataacagggt agagatggtg tggagtctta tagattattg 002640
002641 aagggagttt gtcatttact ggatgaaata ggaagccttt ggagggtgtt gTTAATTGTA ATCATGATAG GGTTATTACT 002720
002721 TTTCTTATAG TAGAGTTCAA AAGAAAGGAA AACTTATTTT TGTAACCAGG TCATTAAcct ggcccctgta gacatctgag 002800
002801 tttgtaattt ctgTCATATT TTATCTAAAT ATCATGATAG AGAACAAAAT ATCTGACTGG TTCTGTACTG ATAGGCAGTC 002880
002881 AGTCATGCTG GCTAACTTTC AGAAATATGC CATGTTTGGT GGTCTCCTTT CTCTCTGTTT TCTCTAGGTT GAATATGGGG 002960
002961 ATTCAACCAC ATCACTTCAC TCTCTCACCC CTTATAGCAC ACGCTTCTCT CCCTTTCCTT GGCATCCCCT GGCCCTATCA 003040
003041 TGACCTGAGG TCTTCCCTAC CACTTTGGGG GCTCCCATCT AAAAATGCAT TCTAAGTCCT CTACCTGGAA CTAATTTATC 003120
003121 TCCAGGTCTA CACTACTGAT AACAAGATAT ACGATCCCTT GCAAAGGAAC TTGTAACACC CTGAAATTAC TCAGAAATCA 003200
003201 CACCTCAcag gagattgaga ccatcctggc taacacggtg aaaccccgtc tctaaaaaaa aaagaaagaa aaGAAATCAC 003280
003281 ACCTCACAGG CAAACTCTGT AAGTGGGCTC CTGTAGGAAA GCAGCCTGTT GCATGGCAAG AAAAATGCCA TCTTGAAGTG 003360
003361 AAATCGTCAT TAATTCCTGC ATATCCAGGC ATTCCCACAG CAAGGTCAAG AAACAATGCc tttcgggggt agatctgcat 003440
003441 atacctactc acccagaaca TTTGGTGAGT CAGTCAGGAG CTAAGAGGCC AAGAGACAGG AAAGGGGAAT GAAGCATCTG 003520
003521 TACGGGAAAT CCCAGGCCAG CCACGTCTAT GTGGGATGGT GTGGCACGTC TGTTAGATGC ATTTGGACTC TGTGTGACTA 003600
003601 TGGAGACACC CTGAACAAGC TGGTTGGCCT AGAGTGGTTA TTAACTGAAA GGCATATGGT GCAGTGTGGT AAGGAAGGGC 003680
003681 AGTGGATAGC CACTGCAGTG GGTTGGCTGC TTCTGTGGGC CATTTGTTTG TCAACACAGG TTTAGCTAGC AATAGAAATG 003760
003761 AAAAGCAAAT GGCTAGAGGA AAATTGCACC AACCTGATAG AAAAGGACAT GTGCCTTTTC ACTTCTCTGA TTGCGTCCAG 003840
003841 TGTAGCTACA TGTTTTACCa aaaaaaaaaa gaaaaaaaaa aaGCCTGCTA TAGAAAGAAT ATTtcaggag ggtaggtgtg 003920
003921 gaaatctatg gtctctcagc tgcccaaggc attgcttctg ttcataggtt cctatcaaat gtttctttct gagaaactgg 004000
004001 atttgtcagc ctcttttctc tggcttccca gcttccttgg cctttgaggg ggagacctgc atatatctgt tcactgcaga 004080
004081 acaGCCCCAT ACATGAAATG AATTTTGAGA TTAGAGGAAG AAGTTTCATT TATAAGCCCA AAGCAAACAA TTAGGAAAAA 004160
004161 AAATTATGCT CAGGAAGTTA GGGACTAAGA ACTAACCTAA TTACAGGGCT AAAGAAGGAA ATTTGGCCAA AATAATCTGA 004240
004241 CAGAATGCAA TTTCAACAAA GGCAAATTAA ACCAAGGTAA GTCACTCCAT GTACATTTGT GTCTAAATTA ATATCCCTAG 004320
004321 TCACAAATAA CAACTTAAAC TTATAGAAGT AGTTGATGAA ACCAGCTGAC ACAGAGAGCC ATGTATCAAT CTTGATTATA 004400
004401 GAACCTTCTG GAGATCATGG TTATTTCAAA ATATATGGAC ACTGAGGGTG CATCAAAAGG GCTAAAAGAT TATGGCTAga 004480
004481 agataaaata gatgtgtctc tctattcctt tctctagata tagctaaaat ctctggacat catacataaa cataaaaata 004560
004561 ctcggaaagg tggagacaaa aaaacaggtc agttagggac tttgaaacct taaaaaacag catggcactg aggtctctag 004640
004641 atttcctcgt gccttatata tttcatactg gatgctagag aagcctgcaa cccaaaatac caacatacac tgataaaaga 004720
004721 aaaaaaagcc ccaagaaaac caaggaaggg tcaaagggtc agcctagtga aaaaga
[back to top]

Predicted Small Protein

Name NONHSAT103321_smProtein_1748:2020
Length 91
Molecular weight 11132.5108
Aromaticity 0.166666666667
Instability index 48.6245555556
Isoelectric point 10.1321411133
Runs 12
Runs residual 0.00652557319224
Runs probability 0.0464435023259
Amino acid sequence MTERHKKTFRTMKMFCILIVVMVSWMYKFIKSHQVKYMHLFYIKCTSIKQKNLKLVIFIY
KLRYLYFLKIKMSDIIWSTYLSAKNTGDRQ
Secondary structure LLHHHHHHHHHHHHHHEEHHHHHHHHHHHHHHHLLEEEEEEEEEEEEELLLLLEEEEEEH
HHHHHHHHHHLHHHHHHHHHHHHHLLLLLL
PRMN LLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLHHHHHHH
HHHHHHHHHHHLLLLLLLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooooooooooooooooooooTTTTTTT
TTTTTTTTTTTiiiiiiiiiiiiiiiiiii