NONHSAT030100

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT030100

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3158 nt

Genomic location

chr12+:97917741..97921511

Exon number

2

Exons

97917741..97919858,97920472..97921511

Genome context

Sequence
000001 ctacttggga gactgaagcg ggagaatggc gtgaccccgg gaggcggagc ttgcagtgag ccgagatcgc gccactgcag 000080
000081 tccagcctgg gcaacagagt gagactctgt ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aTCACATAAC TGTATGGTTG 000160
000161 AATATCCATT GTGTTCTTTA CAAAGTATGT CATTCTTTCT GTAACATCCG TGGTGGAATA TAACCTTAGT ATTACACAAC 000240
000241 Caccatcagc agcagcagca tttgagagct tagcagaaat gtagaatgtc gggctccatc tcggacctac tgaatcaaaa 000320
000321 tctgcatttt agcaagatct ccaggtgata tacaggatca ttaaagttag agaagtcctg GTATACAGGA TAAGTCTTAC 000400
000401 CAATTGCTGT ATAGACCTAT GTTCACTTCA TGGATAACAT TTGCTCTAGA AAAGAAAAAT TAACCAGTAG AGAGGGCCAG 000480
000481 CAACTGTTCA TACCCTAGAG CTGTGATTGG CCACCTAATA GTGTCTAAAA TTATGATGCT GACAATGACA CTGCTGCTGC 000560
000561 TGTGGATAAC ACCCATTATC AGGTGTTTCC AAGTGTTAGC CATTTTGTCT AATATGTCAA CTAATCTGTT TAACTATCAT 000640
000641 CTACCTCCTG AGTTTTATGC TCTTTTACAG ACAAGACAAT TGAGGCAACA GGTTGCCTAG GATCACCACC CCCTTGTAAA 000720
000721 TACAAGCATT CTGATGGCTA GCCTCAGGTA TGCCCAAGGT AGAGTCTGTG ACTGGGAGCT GAACTTTTTA AAAGTAAGAC 000800
000801 AGACTATTTG TTTCAAATAG GAAGAAACTA TGACACTTGA AAGCTTCCTA TGAAGTTGTC AATTGAAATT TGTTTAGATA 000880
000881 ATATTTGTAA TGAGCAATGT GTTCTCCAAT TTGCATATTT GAAAAGATAA GGAAAGCCTT CTGTTTTAAG CTCATATCAC 000960
000961 ATGTTCAGGC TATCAGATCA CAATTTATTG TGTATTATGT GCCATGTGGC ATCTTAAGCC CCATTAGAGG CCCTTAGCCC 001040
001041 TGGGCATTCT ACTGGCATCT AAAGGATGCC TTGAGAGTGA GCTGCTTTTC TGAGGACAAA TATTTGTACA TTTGTAACAT 001120
001121 GGGGAAAGGA CTCCTTACTT GAAATGGCTT TAGAATTGTA GAGAACATAG AAATGCGGTT TGTCTCTTGT GTGCTAAATT 001200
001201 AAATCCTTTT AAAGGAAGAC ATCCTTTGAG AAAGGAGATG CAGGAATACC AACATTTTGA TGTTTATTGG GTGACAAAAA 001280
001281 TCAACTCAAT CAACAAATGG TTTTGAAAAT AACATTGAAA TGTGAATTGG AAGCAAAAGG TAGTGAGGTC ATGTTATGGT 001360
001361 GCAGAAAAAC AATGCTGTGT GTACTGCTGA TCAACTAGGT AGGCATTAGC GGGTGGATCA TGGCCACTAG CATGGGAAAG 001440
001441 GGAAGAGTTT TTATAAATGA ACTAGAATAG ACTGCATATT CAAGTAGGAA TAATGACTCT TTTTAAAAAT TATTACATTT 001520
001521 GTTCCTGGTT TAAAAGGAGA GCTACATATA TAACATAAAT TTATACAACC TCAAAGGATA TACTGTGAAG ATTAGTTTCT 001600
001601 TCTCCTCTTC TGTCCCCAGA CCCAAGTTTC CCTACTGACA CAACCAGTGT GTGCAGCTCC TTGTATATTT TTCAGAAATA 001680
001681 TCCTCCAAAT ACCTCATAAA GAGTTATGAA ATATCTTCAT AAATAAACAG TATATGCAAA AGCCTTGTTA AACAAAAGGG 001760
001761 TAATATACca ggggccccta atcccctggc cacaaaccag agcacaggtc tgtaacctgt taggaaccag gctgcacagc 001840
001841 aggaagtgtg tggcctgcaa gtgagagaag cttcatctgt atttacagcc agtcaaaatc acttgcatta caacctgagc 001920
001921 ttcgcctcct gtcagatcag tggtggcgtt agattctcat agcagtgcga accctgttgt gaactgtgca tgcgagtgat 002000
002001 ctgggttgtg cactccttat gagaatctaa tgcctgatga tctgtcactg tccccatcac cctcagatgg gtccatctag 002080
002081 ttgcaggaaa acaagctcag ggctcccact gattctacGG AAGCTATtta ttactattat tattgttgtt gttattaCTG 002160
002161 ATTtgagtga gaaagctagg ataggaattc acccaagatg cttgtctttg gatcccttgc tcttgtttgc tacactgtac 002240
002241 tgtttctTGG TGATTCTGTT GCTACTTTGG CAATGAGAGT Gcatttattg gatatctacc atgtgctaag cagatgacat 002320
002321 gtgttagtca cagtcttcat aatagcacca tgagAtgttg ttgttgttgt tgttgaacag attcttggta aggtttacta 002400
002401 acttgcccaa agtcccagag ctagtaagca aggagctagg aatggcctgg aagcaggatg tctgactcTT GTGACTCACG 002480
002481 ACAGAGAAGG GAGTTTCAGG TATTGATTCT ACATCTAACA GGCTGGAAGG CAGAAATAAA ACTAATATAC ATATGAGGCT 002560
002561 TAATTAAACG TATTGAAGTG ATAAGAGGGA TAGGAAGGGC TGTCACAAGG ACCAACGGAA GACAGTTGGT GGGGCACAAA 002640
002641 GATAAAAAGC ATCTGTAGCA GAGAGGAGCC GTGTGGATCC AGAATGCACT CTTCTCTATG GAGAATGTTT CCAAATCTAG 002720
002721 ATTGCTTTGT TCTCACCAAC TGCTTCAACC GTTAAAACAT TCCACACCTG TACATTGCAT GTATACCTTC AACAACCCAT 002800
002801 TGTCATTAGA ATGGCAGCAA TGAGAATGGA ACATGTGGTG CTTGCATGTC CTAGAGAAAG AACAGCCTTA GGTGACTGCT 002880
002881 CCGAGTTGGA CTTTGTGGCC ACAGGGAGCA GAAATCAGCA AGTGCCCCAC ATTAGAAAAT AAAAGTAAGA GTGCTAAGAA 002960
002961 AAGTAAGAGT GCTGAGATTC TTCAAGGCAA TGCCAATTTC TCCAGGTCAG CTTCTCCTAT TTTAATCTGT TTCCAGCAGT 003040
003041 TCTGCATCTT TGTAGGTATG CCATAAGCAG CCATAAAAAA GGCAAAATTG TACGATCTAC AGGTCTAGTA AAATTATAAG 003120
003121 TCATGTAGGT CTTATAGAAA TTATAATTTT AACTGAAG
[back to top]

Predicted Small Protein

Name NONHSAT030100_smProtein_2036:2275
Length 80
Molecular weight 8921.8923
Aromaticity 0.0632911392405
Instability index 44.5962025316
Isoelectric point 9.03948974609
Runs 11
Runs residual 0.00104040228889
Runs probability 0.0532591414944
Amino acid sequence MICHCPHHPQMGPSSCRKTSSGLPLILRKLFITIIIVVVITDLSEKARIGIHPRCLSLDP
LLLFATLYCFLVILLLLWQ
Secondary structure LEELLLLLLLLLLLLLLLLLLLLHHHHHHHLLEEEEEEEEELLLLLLLEEELLLLLLLLH
HHHHHHHHHHHHHHHHHHL
PRMN LLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLH
HHHHHHHHHHHHHHHHHLL
PiMo oooooooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiT
TTTTTTTTTTTTTTTTToo