NONHSAT100204
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT100204 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3281 nt |
Genomic location |
chr5+:6583415..6588868 |
Exon number |
3 |
Exons |
6583415..6583914,6585888..6586140,6586341..6588868 |
Genome context |
|
Sequence |
000001 TAAAGATGGG GCGGCGCGGC CGGGGCTCGC CAGCACCAGC CCTGGAACCC CATGGGGGAC CTCGACAACA GCGCCTTTGC 000080
000081 GGCGTGGGCC CCTTTCTGCA GCTCCTCCAC CCGCGCGCTG TCGGCTCCGG GACACTGGGG CTTTGAAGGG CGCCAGTTCC 000160 000161 CAGCCGCACC TGCACGCCTA GGAGGGCGCA GGCCGCGTGT CCCTGTGGGA GGAAGGCCCC GGCAACCTGC AAAGACCTCA 000240 000241 CGTCACAGAA ACTTGCGGCG TTTGCCCCCG ATGCGCGCGT CAGGTGCTCC GTCCCTTCCA GGTGGCAGAG GAGAGGGAAG 000320 000321 GGTCGCGCCT GGTGGGGGCC TGAGAGCCAC CGGGCTGCGG TGCGCGAGTG TGGACACCGC CCCGCCTTCT GCGTCTACAG 000400 000401 CCCTGGGTTC TGGGACCTTT GGAAACCGCT TTGGACGAAG CTGAGCATCT CTTTCCCCAC GCTGACCTCC CCCATAGACT 000480 000481 CCGGATGGGA GTAAATGAGG TCCTGGGAAA TCCAGAAACT CTAGAGAAGC TTGTTTGCCA AACCTTGCTT TGGGCGGTGG 000560 000561 AGACAGAATA TCATGTCCCA GATCAGCGTG TGGTGACTGT GGAGGCATTT ACACACAGCA CACTCCAGCT CTGCCACTGG 000640 000641 GCCGGCTGAG GTGGCTGCGG TACACGGCGC CCATCCTGAG CCCCAGGTTC CTCACATGTC AACAGCATGA AAAGTGCCTC 000720 000721 TAAGACATCC ACAGGTGTGC AGTGTTAAAG CAGGCAGCGC CTCAGCTTGG TTAGCTCCCT TAAGGACTGC GGCTCCCTTA 000800 000801 GGACAAGGCT CTGTCCCTCA GAGCACTGAG TGCGATCTCA TGCTGGCATC TTAGAGACAT TGTATTCACC AGGTAAGATG 000880 000881 GATTCACATC TGCTGGGTGC ACGAACACTT CCTCCTTCCT TGCCACCAGC CAGCATCACA ACCAGCCTGG CCAGGCTCCC 000960 000961 GTCCCAGCCC AGCCTAGCAG CTAACCCCAG CTCCATCCCC CAGCAAGCCC CTCTGCCTGC ACCTCAGCTT CTTCATGGAA 001040 001041 GACGAGTCAG GTCATGATGC CACCAGGGAG CGTGTTTCCA TGTTGAACTG AGACACTAAG CCCTTGACAA ATGTTGGCCA 001120 001121 AGAAAAAAAA GGGGCTCCAA GAGGGGAGCT GTGGCTCAGG TAGAAGACGG TGAATGAGCA TTGCCTGAAT CAGAGTTAGC 001200 001201 TACTCTGGTT ACACAGGTGT CCCCGGCAGG TTGGTCAAAA GCCCTTCCCT GAGTCCCGTG ATAAGTGTGG GGAGGGAGGC 001280 001281 AAATCAAAGA CCCCTACACA CCTCCTGCAT GGATGGCCAG AGCGCTGGTC CCACCTCCTT CCTTCCAAAC TGCAACCCCG 001360 001361 ATCAGCACAG CAGAGAAGTC TCCTGTGGCA GGGACCAGTA ATTCAGTGCT TCCCCTGGAG AGTCTGGGAT TGCACAGAAC 001440 001441 ACCAGAGGCT GGGGATTTAT TTGTTCACCA Acattcattc aacaaacact tactgaggcc aacactggga atggagacga 001520 001521 acttccctgt cctcatggag cttgctttca aaagggaagg aggcagacag taaacaaaac aagcaagtca atatacagca 001600 001601 tgatgcagaa aaggtgtgat gCCACCCTCT TGCCTGAGTT TATAACAAGC AGATGATGTT GTCCTAAGAA TTAATTTCAG 001680 001681 CCGGTGAAAG AGTACAGGCA AAAAACGTGG TCTCGGACTT TCCGGTGATG CTTAAGAACA TAGTTGCAGT TTTTGTTCTG 001760 001761 ATTCTTTGTG CATCTCGCGC TAATGAAGGA TTCTGCCATG ATTGCCACCT CTGGAGACGT CACAGCCAGC ACAGCCATGA 001840 001841 CCAGGGATGA ACTTGGCCAT TGTCCAGCAG CACCATGGGA GGACCAGCTG CTGTAAGTGA AGGTCATCAT CCATCTGATT 001920 001921 TGCTGGCTTT TTGCAACTAC TGCGGTGGAA GTCTTAGAGT AGCCACCTTC TCTGTACTCA GACCTTCACT TCGCCAGATG 002000 002001 TCAGCACAGC AGTGTCAGCA GCTTCGTCAA ACACAAACAC CACCACCTCT CTGGATCAGA AGCACATGGA AAGCCCACCT 002080 002081 TGCAAGAAGA CCCAGCTATT CCAAAAGTAC CCATATCCCT GTAGCATCCT CTTAGCTGAA TGTCTCTACT CCTTAAACTT 002160 002161 GTACCACTTT GAAGAGTAAA AATTTAAATT CATTCATATT ATCATTTTTT AGTTTTTAGT ATAAGTGGAG Ctgtttatcg 002240 002241 gccatttgta tgtccttttc ttttttttgc aaattactcc tttgccaaat tttctctcaa atgactcatc tttttcttat 002320 002321 tgatttgcag aagctattta catattatgg atgtggggtg tctaacatac acagcaaata ctttgtccca gtgtgtcact 002400 002401 ttttaaaaaa attcttataa tgtctttCAT CTTTAAAAAA TTATTAAGCT ATTGTGAGCT TTTATGCCTT TTCAGTTTAT 002480 002481 TTTCTTACAT AGATAATTAT ATGAAGTGCA AATAGGGGCA GTGTTAGTCT CATCTTCCCA AGGCTGCACC TCAGATGTGG 002560 002561 AATGTTGACA GCAACATCTG TTACCTTGTT TCTGTTTTTA GGATTTTAGC GTTAAggcca ggcataatgg ttcatgcctg 002640 002641 taattccagc actttgggag gccgaggtgg gcagatcact tgaagccagg agttcaagac cagcctggcc aacatggaga 002720 002721 aaccctctct atttttaaat acatttaaaa tGTTAACAAT TAAAACAAaa tattttaccg ttagtatgat gcttgataca 002800 002801 ggatttctgg tacatacgcc ttatcaaatt aaataagttt ccttttactc ttgtttattg ttatttttta aaataaggaa 002880 002881 tgaatcttgg attttaGTGT ATCAATAATT ATATTTTATT CTTTAACTTC ATAATATTCC TTATAATTCC TTATGTCATA 002960 002961 TTAAAGATGT AAATTCATAT ATATTTTATC ATACAAATGA TTAATAAATG TATAAGCATG TCATTATATG AGAGTATACT 003040 003041 TCCTTATGTT TCATGAATGC ATCGTGACAG GCTCattttt ttaatcactg ctcgatttgt gccactgatg ttttatgagg 003120 003121 cctatttatg tcggcgatgg taagagtgtt agtgtttcct ctgtgggtac cttcatctac cgtgagtctc ctggatatgc 003200 003201 tagcatcaca gaatggcagg cggaaatttc cagctttttc tatagaatga gtttaacaca gaaaatagat ctttcttgaa 003280 003281 a |
Predicted Small Protein
Name | NONHSAT100204_smProtein_1874:2134 |
Length | 87 |
Molecular weight | 9445.7083 |
Aromaticity | 0.0697674418605 |
Instability index | 75.4465116279 |
Isoelectric point | 10.4332885742 |
Runs | 11 |
Runs residual | 0.0117146824019 |
Runs probability | 0.032567326685 |
Amino acid sequence | MGGPAAVSEGHHPSDLLAFCNYCGGSLRVATFSVLRPSLRQMSAQQCQQLRQTQTPPPLW IRSTWKAHLARRPSYSKSTHIPVASS |
Secondary structure | LLLLLLLLLLLLHHHHHHHHHHLLLLEEEEEEEELLHHHHHHHHHHHHHHHHHLLLLLLE EELHHHHHHHHLLLLLLLLLLLLLLL |
PRMN | - |
PiMo | - |