NONHSAT103903
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT103903 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
sense |
Length |
2769 nt |
Genomic location |
chr5-:134670791..134681677 |
Exon number |
2 |
Exons |
134670791..134670831,134678950..134681677 |
Genome context |
|
Sequence |
000001 GCCCTTGGAA GTAGCTGGAG GTGAGGTGGG AAAGCCACCG TGCTTATCTG GTTGTGACAC ATGCAGTATT AGGAAAGCCA 000080
000081 TTTTGTGAGA GGGCTTAGAA TTCAGAGGTG TGTGGCAGCA CAGCCGCATC TTCAAGATGG AGGCCAAGGA CAGCCTGTAG 000160 000161 CTTTACATCT TAGAGTTTCA CCACATGCAG GTTGTCTGGG AAAATCACCT TTGCATTTTT TTTTATTACT TAAAATAGCT 000240 000241 CCTATATGTT TGATGTCTAG ATCACAAATT TTAAGAAATT ATAGGAGCTG AGTTTGGTAC ATCTAAGAGC TGAGAGTGAG 000320 000321 TGATACTGAG CCATCTGCAA AACTGCTGGT GAGAGATACA AATTAAAAAT CTTCAGCCAT CTGTGTGTGT AAGAGGAAAC 000400 000401 ATATAAATAC ACATTTCTCA TTTGCCAATT ACACCTTCCC TGGTATAAGG GGAAGAAAAC GACTTGGCAT CTGATGTGGG 000480 000481 GAGAATTATT GCTAAGACCT GGAGGCAGGG TACTTCTTAC TGCTTCAGGC CTGAATTTAG GTGGTCATTT Ggggtgtgca 000560 000561 ggacagagca cctcctaagg acctgagagt ccagtcagtg gggaggccca gctcccccac caactgtgtg aggacagatc 000640 000641 acccacttcc tcacttgggg ctctggtttt ctcttctTAC TACCCGATTC CAATGCCTAT TGTGTCTTTT AAATCCTCTG 000720 000721 TTTTAAAAAT GAATCTTAAG CTCTCTCGGA GGGTCCACCA GGCCGTGCAC CTCTATTCCC AGTGACAAAT GACACCCCTT 000800 000801 CCAGGCTAGT CTCAGGATCC AGTAGGATTC CCTTTGCTGA TCTGTGCCTT TGACCTCTGC TCTGCCACTT CTGTTTTACC 000880 000881 TGCAAGGGGC ATGACATACA GGGTAGTAAT TATGAAGTTC TCCCAGGCAG TTCCACCAAA GTCCCACATT GTACAAGGGA 000960 000961 AGGCATACAC AGTTTTAACT ACCTAGCCAG GGGCAGCTGG CCTTTCCCCA GGCCCTTTAA AACACCTGTT TGACCAGCCT 001040 001041 GTGGGTCCTT TCCCATCGGG GTTCCAGCTT GAGTCCTGCT GGTAGCCCTT GGCACAAGGC CTGGTAGGCA GCCTGGTGCT 001120 001121 GCCATGTACC AAACTCTGCT GGGGAAACCT CAGCCACAGC AGATGCAAAC CCATTAGGTA TAGCATGGTT CAAGATATTT 001200 001201 TTGCTTGTTT CAATAACACT TTTGTTCTTG ATTTTAAGGG CTATTTTAGA AACTGCTTTA AAATATTTTT gaaagtataa 001280 001281 agaggtgaat aaaacatact cctgttccac tgtgcaaaga tagtcactgt taatgtttct ttccagtcgt tttctatgca 001360 001361 taGTTTAGCC ATGACTTCTC ATGGGGACAA ACCCACTGGT TTAATTCAAC CCCCAATGTC AGGTTTTTAT AGGTCAGCCC 001440 001441 AGAGTTCTCA AGCCCAGTCA CAAAgcagag ccaggactca aattcacacc ctgtgactga gctgtttcca ctgtaccatg 001520 001521 agacactacc CACAAATCGC TCCTTCTCCA GTATCTATGA CACGACCATA ACAAACCATA CCCCTTGGAC AGATATTCCT 001600 001601 TAAAAAAAAA AAAAGTTTCA GAAATAGCCA ACCAAATACA AAAAGTTTGG ATGTTTTACT TTAGCAGTGG TGTGGAGACT 001680 001681 CTGGTGTGCT CTTGTATAGA AGCGTAGCAT GCAGCTAAAC CCAGTGCCCT GGAAACCTTC TTTGCAGAGC ATCTGTGGAC 001760 001761 AGGGGTTCTG GAACATGCTC TGGGAGACAT GGTACTTTGC AGTTTGTGGC AGACCCCAGG CCTGTGTCCT TAGATCTGGC 001840 001841 AAGGGAAAGG CAAGTCTTTG GTGGAGATGT ATAGCTAGAA AGGGGCCCCT CTTCTAGGAA AAGAAAAACA CAGTTCCTTA 001920 001921 AAAACTAATC TAATTCTGCC ACAAATAAAC AGGATCCGTT ATTCACACAG TGAGGCCATC CTCAGCAGGG AGAAGACAAT 002000 002001 GGCAGAGTGG AGATGCTGGT CAGGTTGCTT AGCAGATCAT TTAAAGCAGT TGCTTTCCTA GTAGCTTTGT CCTGAATTGG 002080 002081 GTTTGGGAGA TTTAGAGAAT ACATCATAGT AGAGACAAAT ACATCATATT GGTGTGCAAG TCGATAATTA GCACACCAGT 002160 002161 CGACTTGAAG AAGAAGGGTT GGAAACCCAA AACAATTATT TTGAGGCTGC AAATGTTCTG ACACGTTGAG TTTGCGTTCA 002240 002241 GTTGAGTTTT GATGGATAAG GCTCTGCCTC TGGGAGAACC GTGTATGCTT TCTGAGAGTG GAGAGAGGCT TACTACGAAC 002320 002321 TTGGTCAAAA TTCCTGGTGG TGACACATCA TAATGCCAAC TTTCCAAGGG TGTGGCTTGG CTCCAGCTCA CGTTGGAATC 002400 002401 ACAACGCAGC AGACAGCTGT GGAACTCCTC GGACACAGGG CATTGGAGGG CCCAGGAAGA TATACCGTGG CAGGGGAGTG 002480 002481 GGTCTGTGTT TGGCATCTTG TGCTTTGCAG TGAACCAGAG CTCACTGTTT CTGTTTCATC CCCGCACCAC TAGCTGCTGT 002560 002561 CAGCGCAGGC CATGGCCTGC CTGCCAAGTT TGTGATCCAC TGTAATAGTC CAGTTTGGGG TGCAGACAAG TGTGAAGAAC 002640 002641 TTCTGGAAAA GACAGTGAAA AACTGCTTGG CCCTGGCTGA TGATAAGAAG CTGAAATCCA TTGCATTTCC ATCCATCGGC 002720 002721 AGCGGCAGGA ACGGTTTTCC AAAGCAGACA GCAGCTCAGC TGATTCTGA |
Predicted Small Protein
Name | NONHSAT103903_smProtein_1355:1495 |
Length | 47 |
Molecular weight | 5027.5933 |
Aromaticity | 0.0434782608696 |
Instability index | 88.082826087 |
Isoelectric point | 10.9032592773 |
Runs | 8 |
Runs residual | 0.0125517598344 |
Runs probability | 0.0129173290938 |
Amino acid sequence | MHSLAMTSHGDKPTGLIQPPMSGFYRSAQSSQAQSQSRARTQIHTL |
Secondary structure | LLEEEELLLLLLLLLLLLLLLLLEEEHHHLHHHHHHHHHHHHHLLL |
PRMN | - |
PiMo | - |