NONHSAT084857

From LncRNAWiki
Revision as of 09:55, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT084857

Source

NONCODE4.0

Same with

,

Classification

sense

Length

3093 nt

Genomic location

chr22+:31556194..31602999

Exon number

6

Exons

31556194..31556289,31588670..31588688,31591455..31591567,31592922..31593143,31597484..31597601,31600475..31602999

Genome context

Sequence
000001 GTCGGAGGTC TTACCCAACA GATTGACGCG GCGTTAGTAT TGGCCGTGTA CCCGAAAAAC TGATTGACTG GGCTGGCGTT 000080
000081 AACTGTGCGG AGGCAGTTGG CCGTGTTTAC ATCAGTGGTT GGAGACCAGA CCTAACAGAC AGGTGTGTCC TGTTTGCAAA 000160
000161 GCTGGCATCA GCCGAGACAA GGTCATCCCC CTCTATGGAA GGGGCAGCAC TGGGCAACAG GACCCCAGAG AGAAGACCCC 000240
000241 TCCTCGTCCT CAAGGACAGA GGCCAGAGCC GGAGAATAGA GGGGTGAGGA ACATTCTAGG AGAAGCTTCT ACAAATGAGC 000320
000321 TGATGGCTTT AGAAGTTTCC TTCTTAGCCT TACAAttttt ctccttgatt ataaaagtaa aggaaaactt gtaaaataca 000400
000401 gaaaggaaaa gagggggaaa aaaatcactt ataaccacac caatccagag GGATTTCAAG GATTTGGATT TGGAGATGGT 000480
000481 GGCTTCCAGA TGTCTTTTGG AATTGGGGCA TTTCCCTTTG GGATATTTGC CACAGCATTT AATATAAATG ATGGGCGGCC 000560
000561 TCCTCCAGCT GTCCCTGGGA CACCCCAGTA TGTGGACGAG CAGTTCCTGT CACGCCTCTT CCTATTTGTG GCCCTGGTGA 000640
000641 TCATGTTCTG GCTCCTGATT GCCTAATGCT GGGCTCCTGC CTACATCCGT GGCAGGGCTC TGGACTGGTG ACGTGCCACC 000720
000721 CCAACTCCTG GTGTTTGGCT TCCTGGCTAA TCTTGACTCC TGGAATCAGT GGGATCAGTA ACACATCAAG GAGTCTTGTT 000800
000801 TCTTCATCAG AGCTTTGGAA CTCGAGACCA GTTGGCGATG ACCCCTGAAT ATCGCCACCG CTGTAAACAC TCTATAACTT 000880
000881 CAGGCCTTGG CATTGAGTCA TCTCTCATGG GTGACACCAT GAAATCTTGT TTCAGCCAGT TCTGCAGGTC CTGACTCTGC 000960
000961 AGAGGGAAGA GGCAGAAAGA GAGAAACTGT CAGAGTATAA TTTCACCTGA GTTTAATATT ACAGAAACAA AGGGATGCAC 001040
001041 CAAATGGTAT TTCTGGAAAT TTTCATGTCT TTAAATACCC CTTGGTAAGT TGCTTCTGAA GCCAGTGGGG GCTCCTCAGA 001120
001121 TAGAGAGGTT CCCCTTTCAA ATCCCAGTGC CGCTCTGTTC TCTTTCCTTC CCCTCCCACT CCCCCTCTTC TTCCTCTGTA 001200
001201 GAGATGCAAG AAATTGCTGT CCCATAAAAA TCATAATTGC AGTAGCTAAA GCTGGGGTCA CTTCGTGAAT TCACCAGAGA 001280
001281 CTCAAAGATC TTTTATTGGC TCTGGGCTGT GCTCAGTGTC TTTGGCCTCA GAGAACAACT TGAATGACTT CCTGGTTTCC 001360
001361 TGGCATAAAT TATTCCTGGT GAGACATGTG GCTTAACTCA CAGGTTTCCC ATCAGCTTTC TCCCTAAAAC TATGTTCATC 001440
001441 TGCCTCTCTC TGCCAGAGAA CATACAGCCG AGAATACTGC CGAAGCTGAG ACTGACTACT GTGCATTAGG AAAGACCTGG 001520
001521 AGTCAGGACT TTGGTGGGAT TTGGAGCTCC GAGGCAGTAA TAACTGAACA AGCAGCCCTG TCCCCTAGGC TGCAGAAGCT 001600
001601 TGAATGCATC CTCTCCCAGA ACCTGCCACA GGAAACTGGG GGCTTTGTCA GGTCAGCCCA ACTGCATGCA AAAGACCACC 001680
001681 ATCCTCAGAA GCCAAGTTGT CTTTTATGAA GAGGCAAGGA AAGGGGAAAC CCACATGTGA CCCTGATTTT GGTATGGCTT 001760
001761 GATAGAGTTC CCTGAAAACT CCTTGTATGT GTGCTAAAAC CAGGGAAGCA TGTGACTGCC AAGCAGGCAA CCCCTGATGA 001840
001841 TTTGTAAAGC CAGGTGGCAG GGCCTTGGGG AGCCCCAGCA CAATGATATT GTGTGGTCTT CCCTCCTGTG GAATCGAGGG 001920
001921 GAAATTATTC TTCCCAATAC CTTGATTTGA TTTTCAGTTT CATAAGCTTC TTCCTCTGAA TCTTATTGAG GGACTATGGT 002000
002001 ACCAAGCAGG TAGGActgtt cacctggtgg aacagttctt gctctgcctt ctaggcttca tcccagaaat ccagcctctt 002080
002081 tctggagacc ccaaagctgg agggagatgg gctttcctct gggcctctct cctactttgc catccACACT GCTCCTGGCT 002160
002161 AACCCCAGCA ATAACCAACA AATGGTAGGA AGCCCCATCT ATTGCTTTTT CTCAATTATG ACTGCATAGT TTATGGAAAC 002240
002241 AAAGATCTTG AGGAAGATGA GGGAAGCCCT CCCCTCTTCA CAGTCCCCAT CTCTTCTCCT TTGTACCTGT CAACACCAGA 002320
002321 GTTCAGTGTT TAAACAGACA AAATATAAAG TATTGAGTAG GTGGTTCATA TGCCGAATCC ACTTGGTAGG AAGAAACCAC 002400
002401 CAGCTTTATA GTGTGCCTGA TAGATTTTAA ACATTCCTGG GCACCCACTC AAGAGTGCTC TTTTTATCAC CTTCTGGAAA 002480
002481 TCCGCAAAGT TGCAGGGGCC TCTGGAGTGT CTCTTCTCTA GAGAGAATTG GTGGGACCCC CCTCAGTGCA GTGGCCCCAA 002560
002561 CTAGTGGAGG GAGAGAGGAC TTAAGTCAGA TGGACTCAAC AGAAATGGGT TTCCAGAAGA ATAATGAAAA GTTGTGGGTA 002640
002641 GGAAAATGAA TCATTTGGAC TCTTCAATGA AATGGAGTGA GCCCAGGAGA GCTCAGCCAA CAGAGGCACT CTGGGAACCT 002720
002721 GTTAGTAAAG CCAGGCTGGC CAAATGCCAT TTGATTTTGA ACCTCGTAGG TCCCCACTCA CCCTCTGCCA GGAGCTAAGT 002800
002801 AAGGCAGGAG AGCTGACTTG GGACTCCTGG CTCGGCCCCA ACAGGGAGCC CCCTTCCCAC CATCCCTCGG CAAGCTCACC 002880
002881 ACCTCATCCT TCTGCCAAGG CAGCTTTCCT TTCTTTTGTG TGTTTTCTGT GTTCTTAGCC TCCACCCTCC TCCTGCCACC 002960
002961 CTTGTGGACT AGGACCAGGT CCTGACCCCA GTCAGAAAAT GATGATATGT ACAGTGGCAC ACCTTAACCA GTCACTAATT 003040
003041 TTCACTGTTG TGAAAGTGAT TTGATTTAGA ATTAAACAAA TGGTTTTACA TTA
[back to top]

Predicted Small Protein

Name NONHSAT084857_smProtein_665:847
Length 61
Molecular weight 6409.183
Aromaticity 0.0666666666667
Instability index 59.6683333333
Isoelectric point 5.95648193359
Runs 12
Runs residual 0.0502659574468
Runs probability 0.0467599952894
Amino acid sequence MLGSCLHPWQGSGLVTCHPNSWCLASWLILTPGISGISNTSRSLVSSSELWNSRPVGDDP
Secondary structure LLLLLLLLLLLLLEEEELLLLHHHHHEEEELLLLEEEELLLEEEEELLEEELLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLL
PiMo ooooooooooooooooooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiii