NONHSAT104312
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT104312 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
4825 nt |
Genomic location |
chr5+:141785944..141790752 |
Exon number |
1 |
Exons |
141785944..141790752 |
Genome context |
|
Sequence |
000001 CATTTGAAGT AGTGAGGCAG GGATTTGAGC CCAGGTCTGG TCCCTGAGCC CATGCTCTCT GTTCTGCTGT GCACCTTGGG 000080
000081 TGATCAGGGA AGCAGACCCA AGCTAGGGAA GAAGTCATGT GGGCAGGGCT GAGGCACTGG AACAGAGTGG GTAACGCAGG 000160 000161 ATCTTGGGAA TCTGACACAC TGGGGTTCAA ATCCTGGCTC TGCCACTTCG TAGCTCTGTG AGCTTGGAAG TGACTTAATT 000240 000241 TCTCTGATCC TTGGGTCCCC ATCTATAAAA GAGGAGAGCC ATAACTACCC AGAGAGTGTA AGGATGACAT GAGAATGTAT 000320 000321 ATAGAGCCTT GGCATGGGCC CTGGGGTGTG ATGAGCATTC AGTAAATGCT GGCTGCTGTG GGCACATGGG AATTGCTGGG 000400 000401 CACTTTCTCA TCCACTTGAT CTCCTTGGCT CTTAGGGAAG GCAGACGCTG GACGGCAGGT CTCTCAGACA CAGGAGAGAC 000480 000481 CCTTAGCTTT ATTTATCCAC CCTGGGATGT ATCCCTGGGA GCTGTGTATC AAAATTATAG CCACTGAAGG TTGGAAGGAC 000560 000561 AAAGGGGAGG CTAGGCCTGG ATATAGCATC TTTACAGTGG TCCAACTTCC CTCTTGCTGG TGATGTCCAA AGAGAAAGGC 000640 000641 CAGCACAGCG GCTCAGATGG GCGTAGCGGG ACTGCCTGGA GGAGCCTATG CAGTGGGCTT GGGCAGGGTC CAGAGTTACT 000720 000721 GGTGGCGGGG GTGGACGCTG TCAGTGTCTC CCCCAGACTC CACTGAACCT TTTATATCAG GTGCCCTAGA GTAGCACCTG 000800 000801 TTTACCTAGA CCTGCCCTCC AGAGACACTG TCTCCCCACC TTGTGAGCTC CTGGCTGCCA TGACCTCCGC ACTTGCCCTG 000880 000881 AGAGGCTGTG AGGATGAAAT GCTGATGTAC ATGAAGCTCT TAGCCTAATA CTTCACATTC AGCGAGCGCC CACCTTCTGA 000960 000961 CTGTGGCCAT TAATTTGTGG ATTCAAGTAA TGTTTTAGGT TGAGAGGTCA TAGTTATGAA GAAAACAGTC CCCACCCTCA 001040 001041 TGGAGGTCAC ACGCTAGTAG GGAAGATGGA TGAGAAACAT GTAAGCAAGT AGATTTGATT GTAATATTGG AGTGAAGAAC 001120 001121 AGTAAAGCAG GGAGAGGGAA CAAAGACATC GTTGGTGTGG CTGGGAAGGA AGGCTTTTAT TGGGAGATAA CAAGTGAGCA 001200 001201 GAGACCCTGG GGATATGGAG GTGCCAATCA TGAGAATGTT GGGGGAGGCC AGGTGCGGTG GCTCATGCCT GTAATCCCAG 001280 001281 CACTTTGGGA GGCCGAGGCG GGTGGATCAC TTGAGGTCAG GAGTTCGAGA CCAGCCTGGC CAACATGGTG AAACCCTGTT 001360 001361 TCTACTAAAA ATACAAAAAT TAGCCAGGTG TGGTGGTATG CCTGTAATCC CAGCTACTGG GGAGACTGAG GCAGGAGAAT 001440 001441 CACTTGAACC CAGGAGGCAG AGGCTGCAGT GAGCGGATAT CATACCACTG CACTCCAGCC TGGGCGACAG AGCAGGACTC 001520 001521 TATCTAGACA GACAGACAGG TAGGTAGATA GAATTAATGT TGGGGAGCAT CCCAGCAGAG CCTCCAGGTG GGACTGAATC 001600 001601 TGGTACAACA AGACAGAGAG GAGGGATAAC TAGGAAATGA AGCCAGGGTG GAGGCTACCA GGTCACATAG GACCTTATTG 001680 001681 TTTTCAACAA TCATTATTTC TAGATGTCGC TGGGCAGGAT CAGCTGTTCC TTCTCTTGTG CTATTCTGCG TGGTATGCAC 001760 001761 ACAGCCATTG TAGCAAGTCA CCCTTTGTTA GGACTGTGTA TTTTACCACC ACACTGGGAG CTCTGGAGGC CAGGGACTAT 001840 001841 GGCTGTTTTG CATATACCCA GCACCTAGCA TGGGAACTGT TTATCGAAAT TATAGCCACT GAAGGTTGGA AGGACGAGGG 001920 001921 GGCAACCAGG CCTGGGTATG GTATCCTTAG AGTGGTCCAA CTTCCCTCTT CCTGGTGATG TCCAAAGAGG AAGGAGCATG 002000 002001 GAGTCTGTGC TTCACACATT TAAAGATAAT GAATCTTGGT GGCAATGGAT GCTGTCTATT GGGGGTTTCT GTGTGCTATC 002080 002081 CACAGCAGTT AGCATTTTAT ATTTCATTGT CTCATTGAAT CAATACAATA GGCTTGGCCC CAAAGTGTCA TCTCCATTTT 002160 002161 ACGGATGAAG AAATAAGCTG TAGAGGCCGT CCCATGCCCC AAGCACACAC AGCTAGGAAC TTGCTGAGCT GAGATTTGAA 002240 002241 GCCAGGACTC TAAGGACCAC ACTTTCAACT CTGACCAAGC TGGAAGGTCG ACTGCCTCCG GCAGGGGCTT GGCCCAGGCC 002320 002321 TGCCCTTCCA AGGGCTGTGG TGTTTACTGC CGAGGGTTTT GGCATCCTTC GGCCTTCTTG GTCCCTCAGG GGTCACCTTG 002400 002401 CACCCCAGCC TTCCCTGTAG TATCCCACGG AGCCTGAGGG GCTGTCCCTC AGACTTCCTC ACAGCTTTGT AATCCACAAA 002480 002481 TGGAAAAGTG GCACTTAAAA TGAATTTATT CAGGTGGGAG CTGTGTAAGA CATGAAAATA AGGCTTTTGA GCTCCTCCAC 002560 002561 ATAAACTTGA GTGTAAATGA AAAGCTTTTT ATTTGGTTTC TGAGCCAGAC TTTGAGAAGC CAGTGGTCCG TGCGGTCTGG 002640 002641 AAGCGCATGT TATGGGCGCT GGGTCGGAAC AGGGGCCACT TTGAATGGCA AGGAGGGAGA ATCGCTCCAG CGAAGCTGGA 002720 002721 ACGGCCAAGA AGCTGGGATG TCAATAAACA CAGGCAGGAC ACAGAAAGCC TTTTCCGAAG AGATGGAAGG CGGGGAAGCG 002800 002801 AAGGAAAGGC AGTGCCTCAC TCCCGGTGCA TTTGAGAAGG AAAATGCTTG AGGTAACCCG TGTTGGCAGC TTTTATAGCT 002880 002881 TCCTCTCGAT AATGTGCTGG GCAGACCCTG AACAGGCTGG TGTTTTATTT TCGTGTAACG AAGCCGAGTG GACCAGGGGC 002960 002961 AGCTCTCCAG CAATGGCTGG CCTGGGGCTG TGCTCACGGA GAGAGGCAGG GCTACCTGAG ACCTGGGGCA GGGGCCAGCC 003040 003041 TCCTGCCTGC TGGGCAAACT GTGGGGAAGG CGGGGACCAG GATTTCTGAC TTCCTCTGGA TGATGTTTGG TTTGTCACAA 003120 003121 TCCTTAGACG TAAGCCCCCT TTATGTCCAG TGCCAGTGAG GGAGGTGGCA GGCAGCCGTG GGAAGGCAAC TGCTCTGAGA 003200 003201 AGGAGGGCAC AGGTCCACGC TGTCTAAATG TTACTGGTTG TGAACTACAC AGGAACCTTA GCATGGGGAA CCCACAGAAA 003280 003281 GGTCTTTGTG CCCTTTGCTG CCTTTTGCCA AGGTACTTTC TGTGACTAGA TTTACCTTGC ATTTATATTC TCTTCTCTTT 003360 003361 GGTAATATGT ACATATGTGT GTGAGGTATA TTTGGACCTG GGATTATGTG TGTGGTTAAA AATATACATC AGTACGTTAG 003440 003441 TGACCTGCAT AGGTGCATTA AGCAATGTAT TTTGAGAGCC ATGGGCTGGA AGGAAACACC CAACTAGCTA CTTTGAGACA 003520 003521 CTAGCAAAGC CCAAAGTAAT TCTGTTAGAT GAGCGTATGA GCTACTGTTT TATATGTTTG TTACAGATTT TTTCTTCCTG 003600 003601 TTTTCTCCTG GATATTATGT GAAAACAATG TATGTATTTA AACAAAATTT TAAATTTCAA TCTTAAGAAT TTCAGGGTTG 003680 003681 GTTTGTTTGT TTGCTTTGAG ATGGAGTCTC ATTCTGTCAC CCAGGCTGGA GTGCAGTGGT GCGATCTCGG CTCACTGAAA 003760 003761 CCTTTGCCTC CCAGGTTCAA GTGATTCTCC TGCCTCAGCC TCCCAAGTAG CTGGGACTAC AGGCGCGCTC CATCACGCCC 003840 003841 AACTAATTTT TTGTATTTTT AGTAGAGATG GGGTTCCACT GTGTTAGCCA GGCTGGTCTT GATCTCCTGA CCTTGTGATC 003920 003921 CGCCTGCCTT GGCCTCCCAA AGTGCTGGAA TTACAGTCGT GAGCCACCGC ACCCAGCAAA TTTCAGGGTT TTTTTCAACC 004000 004001 CATGTTTATT TTGAAAAATT GAAAATATCA GGAAAAATAA GAAGATAAAA GTCTTCTGCA ATGCTATCAT TCAAATAATT 004080 004081 GCTGCTGTTC TTTAGGTATT TGTATGCTCT TCCAGACATG TTCCTAATGA GTATATATTA ATGTAAAATA AAATATCAAT 004160 004161 GGCGAACATT TGAGTGTTAC CATTTGCCAG CATTTTGCAT GTAGGGACTC ACTTCATCCT CAACAATTCA ATGATTGGTT 004240 004241 TTATTATTAA ACTCATTGTA CAGATGAGAA AGCCGAAGCC CAGAATGATT ACCTTTCTTA AGATCACATA GGTAAGTAGC 004320 004321 CAAGCCAGGA TTTGAACACA GGCAGTCTGG TTCCAGAGCC TGTGCTCTTA AGGCACTTGC TATACTAACT TGCAAAAGAA 004400 004401 TTCGGAGAAT ACTATCCATA CCAATTTGTA ACCCGTTGTT TTTCCCACTC AAAATATTAT GAACTCCCTT CCATGCCAAT 004480 004481 AACCTTATAT CTAGAACATC ATTTAAAATG GCTCCGTTGT GATGCATACA TCCCCAAAAT GCAATGCTGC AATGCTGTTT 004560 004561 CCCTTTGTGG TTGTTTTGGG TAATAATAAC AATTAAGCTT AAAAATAGCA AATATTGAGG TTCTGTGTCT GGTGCTGTCC 004640 004641 TAAGACCTGT TTACATCTCA TCTAATTTAA TCCTTGTGAT AACTCAGGCT TCTTACAGGG GAAGAAGTTA GGGAAGAACA 004720 004721 GGAAGAAATT TGCCATGACA TCTTCCTTAC ATAAAACCTC AAGGTTAGAT TTTTGACATC GTCTTCAAAT AAAATTATAT 004800 004801 ATGTAATAAA AAAAAAAAAA AAAAA |
Predicted Small Protein
Name | NONHSAT104312_smProtein_2843:3088 |
Length | 82 |
Molecular weight | 8750.7198 |
Aromaticity | 0.111111111111 |
Instability index | 52.4161728395 |
Isoelectric point | 4.35638427734 |
Runs | 12 |
Runs residual | 0.00596844084703 |
Runs probability | 0.0410336807397 |
Amino acid sequence | MLEVTRVGSFYSFLSIMCWADPEQAGVLFSCNEAEWTRGSSPAMAGLGLCSRREAGLPET WGRGQPPACWANCGEGGDQDF |
Secondary structure | LEEEEEELLHHHHHEEEEELLLLLLEEEEELLLLEEELLLLHHHLLLLLLLLLLLLLLLL LLLLLLLLLLLLLLLLLLLLL |
PRMN | - |
PiMo | - |