NONHSAT004289
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT004289 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3137 nt |
Genomic location |
chr1+:87597609..87602352 |
Exon number |
3 |
Exons |
87597609..87597935,87599297..87599401,87599648..87602352 |
Genome context |
|
Sequence |
000001 GGAACATCCT CGCGGCCCGA GGCGCGGTCG CAGCCGGGGA GCACTCGCCA CGGTGCTGTG GAATTCTCTG GTTTTTCACG 000080
000081 CAAGGTCAGG CGTCCTGCTG GCGCCCTCTC GCCACCCTGC CCTCCCGTCA GAAGCCCGGC TCCTCGCCGG GGAAGGCCGG 000160 000161 ATGCTGGCCC GCCGGGACCT GGGACTTGTG CCACATGGAG TGTCGGGAGT CTCCATTGCC GCGAGTTCTA CACCACAGGG 000240 000241 CCAGGCTGTT TGCTCCCCAT CGGTCGCTGC CCCCAGCACC CTGTTGTTAT TAAGGACTCA TTTGCTTGGA GCGGCATCAT 000320 000321 TACAAGGCAC CAGATACTGC TACACCATCT CATCGGCATG GACCTATATG TGGCAGCAAG TCCATCTCAT CGCTTTTGGT 000400 000401 GAAAGTCAGT CCAGTTTGTA AAGATTCTCA TTGTCACTGT AGACGAGGAA CTGGGACACC AAAAGGAGAA ACTCTGGCCA 000480 000481 CGCTTGCACC CTGTTCCCAA TCCTGGTCCA GTGTCACCCA CAGATGGTAA GGAGCTCTAG AGACCTCACC AGCCCCTGGG 000560 000561 ATTGGTCACC TCACTCTTCT ATGGACAGAG ATTCCTGCTG GGATCCTTTG AGGGCAAGCA GACCCTTCTT CCAGCTCGGA 000640 000641 CTGTGAACTC CACTGCAGCC GTAAGGACTG TCTGTGACAG TGAGCCCGAG ATGACTGGGC TCTGTGCTCC CTCCCGGCCC 000720 000721 TCCAATCCTT GGCCTGCCAC AGAGAACTGA GCTCTTTTAT TAGCACCATG AATGTGACTG ATACAGCTAG CCATTCCCTT 000800 000801 GTGCGAATGA CTCAGTTTAT TAATGCTCTG CTAAAGATGG CTTCTTTGCT TGCCAGCAGC CTTAAACAGT ATTTCATTAA 000880 000881 AACTGGCTTA ATTATTTTGA GAAGACGGCC CAATTAAAAG CTATACACTC CCTCTATGTG AGTGTTTATA CATAGAGCTG 000960 000961 tatatataat acatatttgt aagtgtgtat atatatatgt gtgtatgtat gtgtctataa atatataGGC TTAGCAATTT 001040 001041 CATTACATGG GATAAATTGT TGGAAAAAAT ACCCAGGAGC TGGTCCCCTT TCTGTTGCTA GATTCAGAGT AGAGGCCACC 001120 001121 CCTCCACTCT GGGAGAGGCT GGTGTTGGTG ATCTCTCAAT GACTCTGCAA TGGAAGTCCC AACTGCACAG AGCCCTGCCC 001200 001201 CAGTTTCAGG AGCCAGCAGC CTCGGAGAGG CGGATCCTGA CCTCTGCTCT GCTCTTGGGA TAGCCTTTCC CTTCCCAGCA 001280 001281 GGGTTGAGAT ACTTGGGCCG GGAAATGTTG TGGCAAAGTG TTTGCCAAAG CTCAGGAGAG ACACAGACTT GGGGCTTTTG 001360 001361 TTTCTTGAGC TGGCTGTCTA GCTTTCCTAA TGAGCAAATA TGTTCTCTTT AAGGAAACAA ACAAACAAAG CAAAAACACC 001440 001441 AATTCATCTG GATTTTATTC ATTTGTTTTA AATACAAACA AACAAAAGGA GAGTGGTTAT TTCTGCACCA ACTATTTCAA 001520 001521 ATGCAAGTTA CTCCATCGCT CGGGGTGGTT GGATGGTGCT TGTCACCATA GGACCCACAG GGCTAGTTCC AACTGTTATT 001600 001601 CGGTAAGGCT TTTTTCTTTC CAAAATTCCC AGTGTTCCTT TAAGGCCCAT TTAGCTGCGG GTTTTGTTTA TTCTCCCGGC 001680 001681 AATCAGCATT TAAAATAAGA CAAACAAGCA TTTTTTCCTG GGCTGTGAAT CCCCCCGGCC AGCCTCCACC TGCACACCTG 001760 001761 AAGCCAGCAT GTCCAATCAA ATTTCTCTGT AACCCATATC CCCTTTAGAG ACTTGCCCCC GTCGTATACC AGGCTGGAAA 001840 001841 TAGAGAACTT AAGCAGGGCA AATGTAATTT TAAGAATTGC TAATGATGCT AGAAATCTGC AATGCAATTA GCGTCATTGG 001920 001921 ATTTGGCGCT CCTCCGAAGG CACAAAACTC CTTGTCATAG CGCAGTGGCA GCAGCGGCAA GTGCCTCCGC ATGTGCCGGG 002000 002001 CTGTCCGGGT ATGCTGGCAG CCGCTTTGCA CTGAGATGTG AGCAGTTGGT TAGGCTTCCT CTCTTTCTTT CTCACAGATA 002080 002081 CTGACTTCTT TGTCTCTTTT CTGGGTTGCA GAGGGATGGG TATTTTCCAT TGATTATTAC TTTAGCATTT GACCCTCCAG 002160 002161 TGGAGTCACC CTGTTTTTTT TTTAGAAAAC TGAGACTCTC ACTTTGTGAA TTCACTGTGC TCTCTGGGAT TTCAGTGCTG 002240 002241 TAGTTCAACC ACCAATCCCC CTGTCCTGAA CTCCAGTACT TCTGATGCTA TTAATTGGTT CCTCAACAAT TGTGGCCTTT 002320 002321 TCCATCATTG CCCACCATAG TATATACTTT TTCTTTCTCT CTCTTTTTTC TAATTTCCTT GTCTTCTTCA CTCTCCATGG 002400 002401 AGCCAGAGGT AGTATGAAGA GTTAAAAATA GGAATATAAA GAAAGCCAGA GGGACAGAGG GAGTGAGAAA GAAAAATTTT 002480 002481 AAAAAGGGAG GAAATGAATT ATTGGATTAA AAATAAACTT TTACTTTTTT GCAGAAAAAT TATTTTTGCT CTCTGGGAAA 002560 002561 ATAACATGGG CCAGGCATAA AAAGCATGTC AGCTGGCTAA AAGATTGCAA AATCCAGAAG ATGATCTCGA TGTGTCTGTT 002640 002641 CAATTTAGCA AGGGTATCTA CTAGGGGATC CTCTTTTAAA TATGGAGGCC CAAATCAGAA GCTTGTAGAG GGGAGCTATT 002720 002721 CTTCCAAGAT TCCAGATGTG TCTGTGAGAC AACACGTTAT GGGGCAAATT GATTTCACCC TTGGGAAACC AGGGAGATTT 002800 002801 TCAAAGTTAT GTCTGCAAAG CCAGCTAATG CAATTCCCCA TTAGTGCATT AAAGTGCGCC CTTATTAATT CAAACATAAA 002880 002881 GGCAACAAAA TAAGCtttta aatttaaaat ataatacata tataatGAGC ATGTGTGAAA GCCTTATTCA AATGAAAATA 002960 002961 CAGGAGTGTT TGAACTACTG AGGTATCTTT TGTATTGAAT TATGAGCATA TGTAATAGAT TTAATTATTA ATTTCCCCAT 003040 003041 TGTTCTATGC ACACAGACAG GGTTCAAGGC ACAGTCATTC TCTGGCTTTC ATAGATCTAA TTTGTATAAT TATTGCCTGA 003120 003121 ATAAAAAATT GCTCCAA |
Predicted Small Protein
Name | NONHSAT004289_smProtein_1169:1411 |
Length | 81 |
Molecular weight | 8560.5923 |
Aromaticity | 0.1125 |
Instability index | 58.03875 |
Isoelectric point | 4.00238037109 |
Runs | 11 |
Runs residual | 0.00131578947368 |
Runs probability | 0.0473896356249 |
Amino acid sequence | MEVPTAQSPAPVSGASSLGEADPDLCSALGIAFPFPAGLRYLGREMLWQSVCQSSGETQT WGFCFLSWLSSFPNEQICSL |
Secondary structure | LLLLLLLLLLLLLLLLLLLLLLHHHHHHLLLLLLLLLHHHHHHHHHHHHHHHHLLLLLLE ELLHHHHHHLLLLLLLEELL |
PRMN | - |
PiMo | - |