NONHSAT119684

From LncRNAWiki
Revision as of 06:29, 17 October 2014 by 124.16.129.48 (talk)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT119684

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3987 nt

Genomic location

chr7+:27186782..27195547

Exon number

3

Exons

27186782..27187031,27191194..27191300,27191913..27195547

Genome context

Sequence
000001 TTCTCTTTTA AGAAGCCAAA GAAACTTTGC TGGTTCTTTC ATCCTTTCTT TCTCTCTATT CCTCTTTCTA CTCTGTTTCC 000080
000081 TCCTTCTTTC TTTCTTTTCC TGCCTCCTTT CCCCTCTCCC TCCACTGTCT TGGGATATGT CTTACCCGCG CAGGAGTTCA 000160
000161 TCCGCTGCAT CCAAGGGTAA ACCGGGCTCG TGTACTTCCG GTCGGCGCCT TCGTCATGGA GTGCTTTGCC CTGCCCGCTG 000240
000241 CTGCTGTCGG AATTCCACCC ACGCACCTAT TCCCCCTCCC AGCGCTTCAG ACACCTCTCT CATCGAAAAA CCGGGCGAAG 000320
000321 GGAAGCCGGC AACGAGCGGA GAACTCTGGC TGAATTAACG GTGGCTCCCA GAAGCTCCTG CCCCTCTGAC AGCTGTCGCT 000400
000401 TGGGCAGCCC GAGAGAAGAA TTGTCCTCTT TCCTGGTGCC AGAGGACGCA GGAAATTAGC CAGGTTGCGA GTTGCAAAGC 000480
000481 TGCTGCCGCG GCGCCGGGAA CGGAGCGCGC CCAATCTCCA GCGGGAGCCG CCAGGCCTGG CCTGGCCGGG GCTTCCCTTC 000560
000561 GCTCGCCATC TCCGGACAAA GCACAGCCGA GCCCGGCTGG AAGGCAGAGC TCCGAAGCAG GCAGGACGGA GCGGAGCAAA 000640
000641 AGAATGCGGC TCTATTCTCG CAAGGGAAAT TATAAAAAAG TTCATGTTCA CGGTTCTCAT CCACATGACC GACAGCGGCC 000720
000721 AATGGAAGGG CCGAACAACT CATAAAGTTG TATTGCAAAG TTGTAAATTT TCATAAACAA CAACGGATTT ATGACCCTTT 000800
000801 CCCCATCACT GAGAGGAGGC AGCTCTTACA CCGGCGCCAT CTTACCACCG AGGCCGCCCC GACTTGGGGC CTCAGGTTTT 000880
000881 ACAGACCCTT TTGGGCCAGG TTTTACTAAA AGAGCCATAA GAAGCGGGCC CAGCCCAGGC AGGAGACTGG AGACGAGGTC 000960
000961 TTGCAGGCGG AACTCAGGAT GCTCTGAGCT GCCCGCACAA CCCCTGGACC TTCACCCCTC GCCCCTTCCC CGCATCCAGC 001040
001041 TGCCCCAGCC CCTGCCCAGG CTGCGTAGCC TAGCGGGGGT CTGCGGTCCT AGCCCCTCCC CGCGCCACCT ACTGCAGTGC 001120
001121 CGGACCCTGG GGCCCCCTCG CCTGGTCTGC AGGCGGGGTG GGGACCTTAA ATCCCATTTC CTAGCCTGGG GCTGGGTTCA 001200
001201 GGGCGCATGC GAATCCGGAA TCAGCTCTGG GTAATGCCCC TTTCCAAGCC CACTGCTCAG CCTTAGAGGA AAGTGTGGAT 001280
001281 TTGAAATTTC CTCATGGAAT TGATGGAGGT TTTTAGGTAG ATTCATAGAA TATAACGTAT CTACCAAAGA TTCCGTTTTC 001360
001361 AAGGGATCTA GAAGATGTTA GTGCACACGC AAAAACCAGA CAAACGTCTC TACACGGATA AAGGCACATA TACAATTATG 001440
001441 CACACAGGGA AGGGCATACA CTCTATTGTG GGCACAGAAT GACATGCAAT TATGGACACA CAAAAACACA TGCACCCAAT 001520
001521 TATGGACACC AAAATATATA CAATTGTGGA ATTAGGTAAA AACACACACA CAGAAATACA TACACAGAAA AATAAGCACA 001600
001601 TACTCATACA AATACACACA TAAAAATACA TTAAAAAGAT ACATGACACC AATACATGGG TACCCAACAC TTGGACCATC 001680
001681 ACAAGGACAG CCACCCCACT TTTGCTTCCC CACTGCCCCC TGCCCTCCAG CCATACTCAC CTCCCCTTTC CCAGTCCCCT 001760
001761 CTGGATAAGG CAGTCCACAT TTTTCTTTGT CACCACGCAT CTTTATTTTC GGTTACATAA AACACAGCTG GGCTGGGAAG 001840
001841 TGTGCCTTCC CTGAACCCCA GGATGGAGCT GAGCAGGGTA CAGGACAACA CAGGAGATGA AGGGCATTGC GGAGGGCATT 001920
001921 GGACCTCCCC ACCCACTACA GTTAACTCAA GACAACATAC CATGCTACAA AGTCACCCCA TTAACACATC CTTTCCAAGT 002000
002001 CAAGACACTG CCTTACAAAT GAACTCCAAG ACTATAGAAA TGATAAAAAA AAATCTTGTT CAAATATACA GTATCTGCTA 002080
002081 TTATAGGAAA CATCAGGGCG TACATATTTA ACACAGCTGA ACAGTAAGAT ACAGGAGCCA GAGGAAAGGA CAGCGAAGCT 002160
002161 GGAAGCATCT CCACAGTCCT GCTAAGCAGA AGCTAACCCA CAGATCTGCA GCCAGCTCAG GAACATTCCC CTCCAGAAGT 002240
002241 GGGGGTTGAT GGGCCTGAGC TGTGGGTGCC AAGCCAGAGA AGGAGGGATT GATTCTAGGG TGCAAGCACT TAGGATGCTT 002320
002321 TTTGGAATAA ATATATTATT TTTCGATTTA AATAGATGCC AATACCCTGA TCCTGGACCT CAGCACATTC TCAGGGCAGC 002400
002401 CTCAGGGACC CCAAAAGCTG CGGGCTGTAA GCAGCAGGGG ACTTGCCTGG GAGCAGTCGG CACTAGGTAG CAGGCAAGCC 002480
002481 AGCCAGCACA AAATAGGTAG TTTTAGGGGA GTAGGTAGTA GTGAGATTCA CTTTCTTGCG GGTCTGGGAG GGTGGTGCTG 002560
002561 GGTGTCTGCC AGTGTTGGGA TACATAGGGA CTTCCTGGGA ATGGAGGCCC TCTGGGGCTG GATACATAGG TAGTTTGGGG 002640
002641 GTGCCTCGAG CAGAGGCCTG TGCTAGGTAG TATTTTGGAC GCGCCAGAGC AGGGCCGGCT GGCCTGGGGT TGGGGGTGTC 002720
002721 TTTTGGGGTC CTCGGAGGCA GAGGGAATCC AAGGCGACCC AGTCTCTGCG GCCGCTCAGT CCACAAAAGT TGGGAGCTGG 002800
002801 AGTAGGTGAT GGGGGTGGGT AGAGTGCAGG TTGGGGACTG GGTTGCTTTT TTGTTTTTGT TTTTGTTTTT TACATTTTCT 002880
002881 TTTATTTTTC CCATTTTTGT AAGTAAAACC AGTGAGTCTC TTAAAGACGC TTTTCCGACT GTCCGGTGCA GAGAGGGCCC 002960
002961 CGGATCGGCC CCTCATTCCT CCTCGTCTTC CTCTTCTTCA TCATCGTCCT CCTCGTCGGC CTTGTCCGCG GCAGCAGTGG 003040
003041 CGGCGGCAGA GGGCACGGCG CCCTCGGGAG CTGCGGCGGC AGTCGGACCT TCGTCCTTAT GCTCTTTCTT CCACTTCATG 003120
003121 CGGCGGTTCT GGAACCAGAT CTTAATCTGG CGCTCGGTGA GGCAGAGCGC GTGGGCGATT TCAATGCGGC GGCGCCGCGT 003200
003201 CAGGTAGCGG TTGAAGTGGA ACTCCTTCTC CAGCTCCAGC GTCTGGTAGC GCGTGTAGGT CTGGCGGCCC CGCTTCCTGT 003280
003281 CAGGTCCTGA GAACAGACAT GCAGACACAT GAACACAAGG ACAGACAAGT AGACAGGGCA CTCGTTAGGC TGCTGTCCCA 003360
003361 GAGCCCGCAC CTTCCTCCTG GCCTAGTCCC CAGCGAGCAT CCCCCTCTGC CCCAGGCCCC GAACTGAGCT AGGGGAGGAG 003440
003441 GGGGAGTGTT AGGGAAAGAC CCCAACTGCA GTGCCAGACG CGCAGGCAGC TCTGTAATGA GCAAAGGCAC AGAATCTCAA 003520
003521 CTTTACAACC GACCTTTCCA GCCGGCTAAG CTTCCACAAT GTCCTGCTTC CTCTGACAAA GGAAAACTGT AAATATAGAG 003600
003601 TGTGAGCAAG TGGGAAACGC TGCACTTTTG CCATTCAAAG ATGAGCCCGG CCATTCCCCT GCCTTGCTAG GCAAGTGGGC 003680
003681 GACTCTTCCC AGCAGCCTGA GCCCTCATCC CCAGGACCTT CCTAGGGCAC CCCGACCCTC TGTCCTCATT CCCTCGCCCC 003760
003761 CATCTTGAAA TGGACCCTGG CACAGGGTCG GGTGAGAGGC CCTGGAGGGC TTGGCTCTCC TAGCTTTTGA GAAAGAAATG 003840
003841 TCAGGCAGCA AGGAAAATGA GGAGAGAGAG AAGAAGAAAG GGAGGGAGGG TGACAGAGGA GGGAGAAAGA GAGACAGAAT 003920
003921 AGCGAACAAA CTTAATGTTA AAATTCCAAG ACAAATGGAG TTAAATAAAT TTACGAGGAT CGAACCC
[back to top]

Predicted Small Protein

Name NONHSAT119684_smProtein_1862:2110
Length 83
Molecular weight 9354.4369
Aromaticity 0.0609756097561
Instability index 51.9207317073
Isoelectric point 7.87847900391
Runs 14
Runs residual 0.034830007391
Runs probability 0.0480480480481
Amino acid sequence MELSRVQDNTGDEGHCGGHWTSPPTTVNSRQHTMLQSHPINTSFPSQDTALQMNSKTIEM
IKKNLVQIYSICYYRKHQGVHI
Secondary structure LEEEEEELLLLLLLLLLLLLLLLLLEELLLLEEEEELLLLLLLLLLHHHHHHHLHHHHHH
HHHHHEEEEEEEEEEELLLEEL
PRMN -
PiMo -