NONHSAT099079
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT099079 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3068 nt |
Genomic location |
chr4+:165798150..165820752 |
Exon number |
6 |
Exons |
165798150..165798559,165799371..165799424,165800072..165800163,165816568..165818722,165819878..165820134,165820653..165820752 |
Genome context |
|
Sequence |
000001 CTTGGAGCTT TTGGAGTACA CTTCCACTAA AGTTATAGCA TGCTTGAATG GTTTATTTCA CCAATATTTG CTTATGGAAA 000080
000081 TAAAGGGGAG TGGCCGAGGA GAAAGGAGAA GAGGAGTGGA GGAGGGGTTT GAGGCTGAGG GAGGCTCTGA CCACAGCACA 000160 000161 GAGCACCGGC AACTTTGTCT AATGTGATCA TTAACCTTCC TGCAAAACAC AGCTGGCAGT TCTCTGAGGT TTGTCACTAG 000240 000241 AATGTGAAGA CAGCCACACA GATATTGCAC AGACTATTTA CAGATCGTTT GGTTTACATT GAGAGTCATT GCTCTACTTT 000320 000321 TGTGCGGTAG GAAAATGAGA TTTCAGCAAT TCCTTTTTGC ATTTTTTATT TTTATTATGA GTCTTCTCCT TATCAGCGGA 000400 000401 CAGAGACCAG AGAAATTAGA AAGTTAAAGT TTTCTGAACC ACAAACCCAA CTTTAAATAA AAAGTTAATT TGACCATGAG 000480 000481 AAGAAAACTG CGCAAACACA ATTGCCTTCA GAGGAGATGT ATGCCTCTCC ATTCACGAGT ACCCTTTCCC TGAGGTATCT 000560 000561 CTCTAGCTAA CTTTACTGGA TCTATCAGAA GAAGAAGAGG AGTGAAGGAA AGACACCCAG CCACACAAAA GAACTTCATG 000640 000641 ATGCCAACAG CGTGATTGCT TAGAAGTTCC TACACAAAAA AAGGATCATT TGAAAGCACC TGGAATGGTT TATTAGCTTC 000720 000721 ACAGGATTTT ATTCTTCTTG GCTTCTATTT GGAGGGAAAA TAACATAAAT TCAAAAGGAT TCCAATCTGA AGCCCAAATC 000800 000801 GTTTGCCTAC ATAACAAAAA TATCTCATCT TTTCCTGCAC ATTATTATTC TTTTATGGGT TAAAAAGAAA AATACCTTTT 000880 000881 AGTGTTTTAG AACTCTCTCA TGGTAAAAAG TGCAAGAATT TAAAATGTTG CTTTCATATT CCTATAATTC TCCAAAAGTA 000960 000961 TTAAATTCGT ATATGTTTGA GTGATTTTCT AAAAACTGCT CAACCTGAAA TCAATTGCAT TGACCATTTG GCTTCGCACA 001040 001041 ATAGGGAGAA AATAATTGGT TCATTGATTA TATAGAGAGA AAGACTAAGA AAAGCTATTA ATTGCTACCA ATTTTATGAT 001120 001121 AAGCTTTAAG GTTTATGAAA GTATGTTTTT TTATTTAATG AGTAATGTCC ATTTGAAGTT GAAAGAAAAC ATGAAATCCT 001200 001201 AATTGTAGTT CATTTTATGT TCAAATGAAA CCATTGTTTT TGTTTTTGTT TTGAAACAGA GTCTCACTCT GTTGCCCAAG 001280 001281 GTGGAGAGAA GTGGCACGCT TTTGTCTCAC TGCAACCTCC ACCTCCCGAG TTCAAGTGAT TCTCGTGCCT CAACCTCCCA 001360 001361 ATTATAGGCT GGGATTACAG GTGTGCACCA CTACACCCAG CTAATTCTTG TATTTTTTGT AGAGATGAGG TTTTACCCTG 001440 001441 TTGCCCAGGC TGGTCTTGAA CTCAGGCTGG AACCATTCAT TTTTTAACCT TTCTCATCAT GTAATTATAG GAACCCAACG 001520 001521 TTTGATTTCC TTTGAAGTTT TGTTATGTCC TTTATTATTT TGTATGGATA ATTTCTTTAA AAGTCTTACT TAAAGTTGAC 001600 001601 ATCTAAAATA CAGTTATGCC AATGAAGTCC CACTCAGGGT GATATCTGTA TCTAAAAGAT GAGTGCTCAT CATCCTATTA 001680 001681 GGCTTTGTCT TGGTGGTGTT CATCCTGAGA TGCTGAGACA TGGAAATAAA AAATCAGAAG GAATTTAGGG ATATGATTAC 001760 001761 TCAAAAAAGA AACTATCCTG TCTAAATTTG AATTGTGTTG ATAACTAGGT GTTCCCCAGA TGCTAAGATG TTCTTAATTT 001840 001841 GTATTTATTG AAGGATTGTT AGCTTAGTGC CACAAAATTT TTCTTACTTT ATGTTAATTC CAGATAAGAA ATTTACAAGT 001920 001921 TTATATCTTT TTTTTTCTTT TTTTTAAGAT GAGATCTGGC TCTATCACCC AGGCTAAAGT GCAGTGGCAT GATCTAGGCT 002000 002001 AACTCCCTGG CTCAAGCGAT CCTTCCACCT CAGCCTCCCA AGTACCTGGG ACTACAGGCA CTCACGGCCA CACCTGACTA 002080 002081 ATTTTTGTAT TTTTTTGTAG AGATGGAGTA TCGCCATGTT GCCCAGGTTG GTCTCAAACT CAGGCTGGTG AGCTCAAGTG 002160 002161 ATCCGCCTCC TTGGCCTCCC AAAATACTGG GATTACAGGC ATGTGCCACC ATGCTGGGCC ACAAGTTCAT ATCTGGAGTA 002240 002241 GAAGTTTTAC TTTGTAAATA TTATAAAGTA GAAGAAACCA TAAACCATTT TGCTAAAATG AAAGGTTGGG GTTAATATAA 002320 002321 ATGTAATTTT AAATAGAAAA TCTGACAACA CTGTCGAGTT TGTCTTCCTG TCAAAGCTTA TTAAAAGTGT CTTTGCGGAT 002400 002401 GAATGGTACT TTCCACAAGT GCATTTGAGT AGAAGCATAA CCTATTCTCA GTTATATTTA TGTTTAAAAC ATGTACTGGT 002480 002481 TTGTATATTT TGTACTGAAA AAGAAAACAC TTTATAGTCA AGATACATCT CATTCAATAC AAGTCTAAAC TCTTTCAAAT 002560 002561 ACAAATTCGC ATATTCACAG AAAAAGTTAC AAATCAGTTT TACTATTGTA AAGTAATGAA ATGGTTATAC ATTTCTTAAT 002640 002641 TGTTCAATAA AACACTCAAT GATTTGCATG TCTGGCTGTC CTTACTTTGT ATAAGAAATG TTCAGTGTTG ACTTCCTTTG 002720 002721 AGGAATGAAA ATCATTGTTT GCTGTACATT TGATCAGAAG AAAAAAGAAA AACTGAAATT AGTGAAGTCA GTGAAGTCTT 002800 002801 CGGTCCCGTC CTGAATCATT TTTACCCTCT GTTTAGGGAC AGGTCTAGAA CGAGTGAGCA CAAAACTGAA GAGTGTGCCG 002880 002881 AAAATCTCAG CAATCAAGAT AACATTTGAA CTCCATTTTT GAAAAAAATA AAAACTAACA CCCACGAAAA ATACATGATG 002960 002961 AATAAAACAA TCTCTTGAAC CCAGGAGGAT GAGGTTGCAG TGAGCCAAGA TTGCACTACT GCACTCTGGC CTGGGCAACA 003040 003041 AAGCAAGACT CTGTCAAGAA AGAGAAAG |
Predicted Small Protein
Name | NONHSAT099079_smProtein_2102:2314 |
Length | 71 |
Molecular weight | 7899.3079 |
Aromaticity | 0.1 |
Instability index | 44.8814285714 |
Isoelectric point | 9.30157470703 |
Runs | 12 |
Runs residual | 0.0265912305516 |
Runs probability | 0.0476211505623 |
Amino acid sequence | MEYRHVAQVGLKLRLVSSSDPPPWPPKILGLQACATMLGHKFISGVEVLLCKYYKVEETI NHFAKMKGWG |
Secondary structure | LLEEEEEEELEEEEEEELLLLLLLLHHHHLHHHHHHHHLLLEEEEEEEEEEEEEEHHHHH HHHHHHHLLL |
PRMN | - |
PiMo | - |