NONHSAT135504

From LncRNAWiki
Revision as of 12:41, 13 October 2014 by 73.162.128.239 (talk)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT135504

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4041 nt

Genomic location

chr9-:139149260..139161211

Exon number

7

Exons

139149260..139151294,139151966..139152110,139154157..139154308,139155469..139155644,139157802..139157889,139159230..139159357,139159895..139161211

Genome context

Sequence
000001 ataaacattc ttaagctttc ttcgggactg cagttattta gaagtaattt ggtcctttca gacttttcat taatagtgtg 000080
000081 ttaggtcgat ctggatcagt gctcagcctg gggctaatca tttccgacag tcaaggcaag aacgttctgt gtacactgcc 000160
000161 tgcccagtga cctgggagtc ctaaggtcct ccagcctgac tactggaaac aggcgtcatt accaggcttc cagtcccccg 000240
000241 gaacgcttct gtcccagcat ctggtagctt cttgacacgc attcatccat cagttctcta ctgaatactc cagagggaca 000320
000321 cttgatacat ctcagaggtc ctctctctgt gcgactctct cctctctgac actctgcccc cacaaactcc agctgccttg 000400
000401 gtcctcccag actctctgct gtctcctcta cccagggagt cctccagctc agccagcttc ccctccctca ccatagcctg 000480
000481 gcaactctct gaggcagtaa gccgtgagca tcgcaagtct cctcccctcc tttgttttct gtctctcagg gatcactgtg 000560
000561 ctatgttgcc tcaagtccag tgcctggaaa actgttattt catatgtGTG TGTGAAAAAa tatatatata tgtatacaca 000640
000641 cacatatctc tatataaaca cacatgtgtg tgtgtattta CGTGATTTTT TCAGTTGTTG CAGATGGGAT GGCAAAGCCA 000720
000721 AAATCCAGTC CCTTTTGCTC TGTCTTTGGT GGGAGGGAAC GTCCTTAAGG ATCGCTTGAA GTACAGGCTC CTTTTTCCAT 000800
000801 CACTCAGAAC AGAAGCATGG CCAGCAGAGG TCGGCAGCAG GTCAGCTACT TCAGCTGGGG ATCTGGGATC TGCTGTCACC 000880
000881 TGCCAGCTCC TTGGTGCTAG ACCCCGACAG CCCTTCTCCC TCTCTGGGCC TCACTCTCCC CTCCTTTAGA AGCCTGTAGA 000960
000961 CGCAGAGCAG CTGGGCTGTC TCCCTGAGCT TCCTCTACTT TTGGTCTGGT CTTTCCCATG ATCAGTAGGG TCACAAACCA 001040
001041 CAACTTCACA TCAAGAGACT GCCTGTCCCT TCCCGCGTGG TCTTGTGGAT TCTGCGTGAA GCCCTGTGTG TCCGGGATGG 001120
001121 GATGGGGACA CTAAACTGCC CTAGACCTCC TACTTTTATG AGGCCCTCCG GTAGAAATGG CCCCTGACAC GGGGCACTCA 001200
001201 GACCCCCTAC ATGACCCAAC CATGGCGCTT CCTTCCTCAG AGGTGGAGGG CCCTGACGGA GCCCTCTCCC AGCTTCCCCT 001280
001281 GGCCAAGTTC TTCCCTCCGG ACAACCCCAC CCACCAGATG CTGGAGCGGA GCCTGCGGGA GGAGGAGCTG CGAGCACAGC 001360
001361 ACCAGGCCGC GCTGCTGCGC CTGCGAGAGA TGGCGCTCCA GGAGAAAACG CTCGCGGAGC TGGCCTGGCT GGAGCATCGA 001440
001441 CGAGGGTGCC TGGACAGCAA GAGGGACAGA GCTGTGCTGG CCGCGCTGGT TGAGAAGCAG CAGCAAGCCC TCAGCAGATT 001520
001521 GGAGAAGGAG CAGAGGGAAA TCCAATACCT GAGACACACG CAGCTGTTCA GGCACCGGGA CAGGAAGCTG CTTCTGCAGC 001600
001601 ATCAGAGGGA CGTCGTCTCC ATGCCGGGGC CTGTGGACAT CCTCCCCATC CCGGGGCCCA CGGACGTCGT CCCAGCGCAC 001680
001681 GAGCTGCAGG CCCAGGCCAA GCTGCAGCAG GTTCCAGCCC CAAAGTCAAG GCCGCCTGGG AGGGAGGCTC AGAGACAAGC 001760
001761 CAGCAGCCAG AGGCCTCCCT GTGTCCCCTG ACCCCATGCA GGCCCAGCAG CTCCACCAGC CATCGCCCCC AGAGCAGCCC 001840
001841 CGCAAGCTCA AAGGCCACGC GCCCTCCCAC TGAGCAGCAG GATGTGACGC CACCCCAGAC AACCTCCGAT GCAGATGGTC 001920
001921 ACCAGCAGCC TCCGAGGCCA GCATGGGGCG AGGACACACA CGACCCCCAG GGCCCGCTGG TGGAGAGTGG CAGCCATGTC 002000
002001 AGTCAAAGCC TGGGGAGCAG CCTCGGGCCC CTCTGCTGGG CCTGCAGCAC GTGAGCCCCC CGGACGGACA GCGGCTGGGC 002080
002081 CCAGCCTTTC CCGTACGCAC TCTCCCTGGA AGCCCTTTCC CCAGCACCGC CTGTTTCAGG GACCACCCCC CCCCAACATT 002160
002161 CTGCCAATCT GTGACTTGAT CTTCATCTTG GTACCCTCCC CCCCAACATT CTGCCAATCT GTGACTTGAT CTTCATCTTG 002240
002241 GTTGGGGCCC CTTGGTTCCT GTGGCCGGTC CTTGGTTCCT GCAGTGGGTC CCTGTTTGTC CCTGCctgtc cccatctgtc 002320
002321 cctgtccgcc cccggctgtc cccatccacc ccagtctgtc cccatctgtc cccatcCATC CTGGTCTATC CCTGGCCCTT 002400
002401 GGTGCCCTCC CTTCAGGGCC CTGTGCCATG TCCACTCCCG TCTCACCCCC CAGGCCGAGG AGGCGGAGGG GCGCCTCCCC 002480
002481 ACAGCTCAGT GCAGGTCTCG GGAGGTGAAG GAGCCACCCC CAGGTAAGCA CCCTGGAAAC ACCGCGTGGG CCTCCACGGC 002560
002561 GCCAGACACT TCCCCCACAT CCAGCAGAGC CTGTCCGAGG CTGGAGCTCA GCCACTGGAC AAGGACGGAA AGCAAAGAAG 002640
002641 ATGGGCCCTG GTGGCCCCCC ACCCTGGCCT CAGACCCCTT CACTGCAAGA AGAGTGGCCG GCTCCATCTC CTACCGGAGG 002720
002721 TGACGCAGCG GCTCTTCTGG AGAAATTAAA AGTCATAACT GGCTGGGATT GTTTTAGTCG CAACTTCGTT CATCTGGGGC 002800
002801 AGATTTTTAA GTGTTTGGAA GCCAGCACTG CCTGAGCGTC TTCAGCTGAC GGAGGTTTCT GCACCGATTA CATTTTTCTC 002880
002881 TTTCTGAACC TTACACTTGG CCGGGGCCCC CCCCGAGAGC CAAGTCCCCA ATTTGCAGGT GGGGCGGGAG CAGATGCACT 002960
002961 CACAGTGTCA GGGAACCCTG GTGGTGCGTC TGGTAGCCGC CGGCTGAAGA GTGGGGGGTT GTGTGCCCTG CGATGCCACC 003040
003041 TCGGAGCTCC CTGACCAGCC CTGGGTGCCG AATTCAGAAG GCCCCCACCC AGTGTCCATC CTCGGAAGCA GGGAGACTAT 003120
003121 GCCACCCAAG GGGGGTGGCA TAGACTGGGG GGTCCCTGCC GTTGGGGCCA GGGAGAGCAA ACCTCAGGAC AGATGGGTTG 003200
003201 AGACGGAAAG AGATCAGAAG CACTTGTGGG GCTGCCCAGG AGAGCGTGGC TGGTGGGGCT CCCTGGAGAG GCCAGTGACA 003280
003281 CTGGGAGGCG GAGTGAATAC AGAGCGCTTT CCCTCTGTGT CCCGCGTCAC CTCCGGGCCT GCGCTGTTCA GGCTTCTGCG 003360
003361 GCCACCGTGA AGCTTGCTCA GCCTGAAGCC TCCCTCGCCG CATGGCCACG TGTACCAGCT GTCACCGTTC GGGGGGCCAG 003440
003441 CCAGGCGCCT GCACCCACCA TGGGTCCTCG TTGCCTCTGG AGGTGGATTC ACTCGTTATG CCCACGCAGC AGAGGAGGAG 003520
003521 CCTGAGGCCT GACTTATGTT CTCACGTGGT CTAGGGTTCA CCCTGAGTCT GCCTGACTGC GGTGCCGGCC GCTCTCTGAT 003600
003601 CAAAAGACTG CGCTTGCAGG GGCTCCCTGG GTGACGCCCC ATTGTGGTCG CTGGGGTCAC CCGGAGCCAC GGACAAGAGG 003680
003681 GCAGGTGTTC CCACTGATGG AGCTTCCCCC TCCTTGGGCT TTAGACAAAC AGCTCTGGTT CCCTTAGCCA GGTGGCCTTG 003760
003761 GCTCTCAGAG TCCCCTGACC CCACAGAACG AAAGTCTAGG GAGATGGAGG TGCAATGCTG GGGGGCCTGG TGCTCCTGGG 003840
003841 GAGGGCTCTG CGGTGGCCCC ACCCGTCCCT TTGTCCCCCT TTGTGATCCC CGGCGGTGCA CTGGTGGCCT TGCTGTGTCC 003920
003921 GAGGCCTGCC CCCCACCGGC CCTCTGCACC ACCCCTGCCT CACACCAGTG TCTGTCGGAC GGCACCGAGG GGGAAGAACA 004000
004001 AGCTCTGGCA AAAGGTTTCC CCCAAGGGAG GGAAAGAGGC C
[back to top]

Predicted Small Protein

Name NONHSAT135504_smProtein_3401:3553
Length 51
Molecular weight 5345.0458
Aromaticity 0.0
Instability index 104.594
Isoelectric point 7.75604248047
Runs 7
Runs residual 0.0246783625731
Runs probability 0.00720468367527
Amino acid sequence MATCTSCHRSGGQPGACTHHGSSLPLEVDSLVMPTQQRRSLRPDLCSHVV
Secondary structure LLLLLLLLLLLLLLLLLLLLLLLLLEEELEEELLLLLLLLLLLLHHHLLL
PRMN -
PiMo -