NONHSAT127201

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT127201

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4260 nt

Genomic location

chr8+:72756338..72968547

Exon number

4

Exons

72756338..72756657,72875158..72875283,72964774..72967426,72967437..72968547

Genome context

Sequence
000001 AGGCGGGGAC CGGGTACTCC CGCTGCAGCC CCCGAAGCTC CATCTCCTCC GGATCACTCA CCGAGCCCGT GGACATCCCG 000080
000081 TTGTCCCCCT TGCCCACACG CGTCCTCTTT CCTCCCCCCT GGCCAGTCTC GCTGTCTCCG CCTTCCGCTC CCTGGCGGAG 000160
000161 GCGGAGGCCA GAGAGCGCTC CAAGGAAGAC TAAAAACCCA GGCCGGGAAG CGCGGGGTGA GAAAGCGAGG TGGGTGGCGA 000240
000241 GAGCGTGAGC GCCCCTCTGC TGACCCCGGG GAGCGTGGAC TACGAGTTGG CGCCCAAGTC CAGAATCCGC GCGCACCGCG 000320
000321 GAAAATATTT GCAATACCTG CCACGTCCAA GGTTCAGCTG AGGCCAATGT AAAGCCAGTA TTATAGGACA CTTCTCAGAC 000400
000401 TGAGAGCCCA ATGTCGTTGG AAACCACTGG GCCACAGGAA AGGCAGACTT CAAATCATCC CAACACCAGA GAACACTAAA 000480
000481 TGACAGTGGA CAGGAAGACA GTGTACCTTT TGCTCCTGAT GATCGTAAGA ACAACCTCCT TCCTCTTATT GTGAAGTGCA 000560
000561 AGGTGCAAAA AGGAGGCCTG CTGCTTGTTC AGGACTATGT CAGCATTGTG GCTCAGAAGA AGCGCAACGG CTTTGGCGTG 000640
000641 GCCTTCCCTT GCAGCAAAGT GAAGTGCAGT GTTCTTTTGA AGAAAAACAG ACACAGAAAA CGTGGTGACA GTGTCTAATC 000720
000721 TGTCTTATTT CACTGCTTAA CTGCTTAAAA GACATTAGCG ACACTATTTA ATACCAAAAT ACATAGCACT TAAAACTGGT 000800
000801 GTTCAGAATC AAGCTTCAAA ATAATGAGTT TATAGTTCTA GCATGGTGTG TGTGCATGTG TGGGAAGTGG ATCTGTGTGT 000880
000881 GTGTGTGCGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGATA GATAGAGAAA GAGAGAGAGA GAGAGAGAGA 000960
000961 GAGAGAGAGA GAGAGAGAGA GAGAATGAAT TCCAAGGCAT TTGTTTTGTG TTGCTTTTGC AATGGGATGA TTCCACAAGG 001040
001041 ACAAAGGACA AGCTTTCAGG GTAAAGATGT TGGGTTGAAC ACAGAAGAAA ATCCTCCTGA GATTGATCAT TTGTAGAACA 001120
001121 GTGGCTTACA GATTTGGGGC TTCACAAGCT TGTAAATGTT TTTTAAAATC AGAGACTGTT ATAAGTTTGC CCATTTTTAG 001200
001201 TTTCTCCACT TAAAGACTTT TTTTTTTGAA AAAGGCAACA ACAGTCTTCT ATCCCCATCA TTTTATGAAC AAACGTACAT 001280
001281 TTCAAAAGTT ACTCTCCCCA AAGGAAGGAA AGACATAATA GCACAATAAA GGGCAGCATT TCAAATCAAT ACACTTCAAA 001360
001361 ATAACACTTA TCTTAATGAA AACTCACCCC TGCCTTCATT TCTCTGGACC TACCAGGGAG TAAACGCTTT ACCATCAAAT 001440
001441 TTCATTGGCC TGTGGACCTC GCTGCTCTGT GAGACTGGAG GTTTCCCCAA CATTCACCCA CCCCTTGGTA TTCTCCCAAT 001520
001521 TCTGTTGCCT TTTGATTCAT ACTTCACCTC AAGAATCCAC TTGCTAAACA CCCAACAAAA GGCTGGGAAT AGAGCAAGGA 001600
001601 AGGTAAAAGG TGTCTAGGAG AGTTTTCTGG CAATGTTGGA AGTTTCTGCT ATATTGAGAT TTTGGGTAAG CATGAGAACC 001680
001681 GCAGTACATA CCCCGTCTTC ATCCAAGCGA TCTGTGCACT TCAAATTAGT ATCAAGAATG ACCTTCATGG TCTGAGTGTA 001760
001761 CCCGCCCATG GACGCATGAT GCAAAGCTGT CCAGCCATTG TGGTCACTGG TAAAGAGTTA AGAAAAGGTG TGTGCATACA 001840
001841 CTTAACAGAT GAAATGCGGG ATGGGATGCC ACTTGGCTTA TTCTAGGTAA ATCTGATGAC AACAAAAAGC ATTATTGAGA 001920
001921 TCTGGCAACT AAATATTTAT AAAACATTTA TAATAAATGA GCTAATTCCA AGCTCAATGA TCTTAAATGA CCTATTTCTA 002000
002001 ATGTGATTCA TACACGTTTT TTACTTAGTG GTTTAAAATA TGAAAGTATA TAAACACATT ACCAAACACT TTGGGGTAGG 002080
002081 AGAAGTACGC TTAGCATGTC TGTGTCATAC CTAATTTTTT TAAAGCTATA ATGAACTGAT ACACATCAAT TTAAATCTTG 002160
002161 TTTTATTCTT TCAATGAACA CTGTATCAAA ATTTTTTTAT GAAGTAATGT TAGTCTTTGT GGTCAATAGT TATTGGTAAT 002240
002241 ACTACTAGTA ATATGGTACA TGGCTATAAG ATGCTTAGCT TAGTGTCAGT TATTATTCTA AGAGTTTAAA TCTAAAAAGC 002320
002321 TTCCCTCTTG TGCTCTGAAA AATGCTTACT CAATTGCTTT GAAAAATGTT TATCCTTTTC CACAGAAATT CAAGATGTAA 002400
002401 ATACAACAAA GCCTTGTCAT ATTAGCTATC TTTTTTCAAA TTAGACAGAA AAAAATTCGT GACACTATTT AATTTCAGTG 002480
002481 CTAGTGAGCA GATGTTTTAG GGTGCAAGAG CCATGTTGCT ATTTCAGTTA TGATGTAATT AAGCAACAAC TGTCTGGCCT 002560
002561 GGGTCTACTT CGGTCCCCTT TGCAGGTCAG TTGCTTGGTG TTTCTTTGTA AACCAGAAAA AACTGAGCAA AACATTCATT 002640
002641 CATGTAGCTG TAGCACCAAT TTGCTGGCAT CAATATTATG GTAAGGAAAT GTGTTTTTGA ACATTAGCCT AATAAGCTCT 002720
002721 TTGCTACTGC TATGGTAAGA TATCAAATGA AAAACCTCTG TAAAACATTT CCAGGTTTTT TCTTTCTTAT CACAATATCA 002800
002801 TTTAATTATG CCACATATTT TATTATACTT ACATATTTTA ACTTTGTTTC TTGTACTTGA GTTCATTTAA GTATAATAAA 002880
002881 ATTCAGAAAT TTTTACCATC TCCTGTTGTC ACTTAAAGGG AAATTTTTAT GCCTACCATT CATCAGATTG AATGAAAAGT 002960
002961 AGAGTATTTG AACTTGTATG TTTTGTGTGT GTGTGTACGT GCATACACAC ATACACAACC ACAGTTTTGT ATCTATATAG 003040
003041 GAATATGGCA GCATGATGAG GTAAAATATG AGAAACTATG TTTCAAAGAA AGGTTCTGAG AAAGACTGCA CAAAGCCAAT 003120
003121 TATTCTGTGA CCTTGAACAA GGTTCTTCAA CGCTATAAAC TTCAAAGTCT TTGTCAGTTG GAAATAATAC TGACTTCCCT 003200
003201 TTCAGTATTT TTAGAAGTTA CATATGTTTT GATGAAAAAA TAAATTGGAT AAAAGTACTT AGTGAGATAT AAAGTTAGAC 003280
003281 CTCATAATTA AAAATATAAT GAAAAGATAG CCTGAAAATG GCTAATTAAA GAAATTATAT AAACACTCCA ATCATATATC 003360
003361 CTCACCTGAG AAACAATGCA CCTTTTTTCA GAAGAAGCTG AACTACTTTA TCATGTCCAT TCTTTGCTGC CAGATGGAGA 003440
003441 GGAGTCATTC CATGAAGGTC ACCTTCATTC AGAAGCCTCG TATCACTTAT GTCTTGTAGG AGCCTCTGAC AGGTATTGAT 003520
003521 ACGCCCATAA CTTGGAAAAA ATTAGGTTTC ATATTTTCAA GGCAAATATT AAACAGAAAG AAGACATGTT CATACATTGC 003600
003601 TAGACTGACC CTTACCTGGC TGCAAAATGC AGAGGTGATT TCTTATCTTT GCTTTTGGAA TGAATGGACA CATTAAAGCC 003680
003681 AAGTAGGTTA TTTACAGAAC CAGGGCCCCC CTGTCTACAT GCATAATGTA GAGGAGTACA CCCATCGTTG TCTTCATCCA 003760
003761 TTACCAGCTC TTTGATCTGT TGCATCTATA GGAAAAAATT AATATCATAC AAATAACTCT GCTATTATGC TAATATGATG 003840
003841 GTTTTCAAAC TATTCTGTAG GAAAATATTT TCCAACAAGG GTCAATATGT ATTCCTTGGG GCCAAAATTG TAAATAACCT 003920
003921 AAAGGATTTG AGAGTAATAT AAATATGAAT GAAGTTGAAT GCTCTGTCTT CATGTAGTGT ACATTACCAT ATTAGTGTAC 004000
004001 ATTACTGTAT TAAAGGATCT GAGAGATCAT GCCAATAGGA AAATGTACTT AACATTTTTA ACTCAGCATT TTTCAAATAT 004080
004081 ATTTGATCAT GGCCCTTCCC TCTGCCCCTC CTCTTATTTC CTCCCACCCC CAAATAACAC CCAACAGCTT CATGTTAATA 004160
004161 TGTTCTGTTA GGTAAAATGG ATGGGATGTA AACATTAGGT TTATTTTATG TTTTATATGT ATAGTAAATA AAAAATGAAT 004240
004241 GAAAAAAAAA AAAAAAAAAA
[back to top]

Predicted Small Protein

Name NONHSAT127201_smProtein_1775:1885
Length 37
Molecular weight 3977.7834
Aromaticity 0.0277777777778
Instability index 45.8861111111
Isoelectric point 6.51177978516
Runs 7
Runs residual 0.013986013986
Runs probability 0.0485779897544
Amino acid sequence MMQSCPAIVVTGKELRKGVCIHLTDEMRDGMPLGLF
Secondary structure LLLLLLEEEELLLLLLLLLEEEELHHHHLLLLLLLL
PRMN -
PiMo -