NONHSAT101661
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT101661 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
4572 nt |
Genomic location |
chr5+:60474133..60479163 |
Exon number |
3 |
Exons |
60474133..60476024,60476179..60476280,60476586..60479163 |
Genome context |
|
Sequence |
000001 CAGGTAATGC CTTGGTTTGC CCAGCTCTTG GTAGAATTTT GAACTGACAG CGGATTACAT TGTGTCATGA GGTGGGAAGG 000080
000081 GCTGGGAAGG CAACATCCAC ATATCCCCAT TATGATCTAG TCATTGTTTC CATTTCAAAG GAGCGATATA TTCAGCTCAG 000160 000161 CACTTCCGTC TATTTAAGAT ATGTTAGCTC CATGAATCAG AGTGAATCTA CATATTCCCA TTGTGAACCT GACATTGAAA 000240 000241 TATACGTTTT CCAGTTTCAA TCTACTCACC TGTTATGAGT GAAGCTTACC TTTAAAAAAA AAAAAAAAGG AGCAGAGATA 000320 000321 TTTAATATTC CCAATATTTG CTTTTGATGA ACTGATACTT TTGAACTTTT AAAACTATTT TATGTCTTCT GTACTCACAG 000400 000401 CTCGTATACC CCCACAAGAG TGGAGAAGCC ACAGGGAAAT ACAACTGGAT CTTACTGTGA AAGGCAGGAT TATGAGGTCT 000480 000481 CGATATCAAG GCTGCCTAGA GGCTGGTATA AAAAAATGCT GATGTCCACA TAACTTGTAA CTCTTAAGTA AATCAAACCT 000560 000561 GTAATTCAGC TTGGCTGCTT CTAAGTCATT TTGATTCCTG CCAACAGAAC ATTAATGAAC TTAGCTTCTC CGCAAGTTTA 000640 000641 TTCATGTGTA ATACCCAATT CCTTCTTCCT CGTTCCAGCA GTTGCCCCTG GTACATCTGA TAAGGTGGAA AGCTAGACTA 000720 000721 GGTCATACAA ATCCAGGTCA TTACTTAAAC CCAGATTATT ATAGACTGTT CTCTAATTTT TCTGTCTGGT TTTAGAGTGG 000800 000801 CAAACAGAAG AAAGTTTCTA GGGCTTGGGT TTGGAAAAAA AATCCATACA CTAATAATTC TTGTCTTGAA GAAACATTTT 000880 000881 ATAAAATTTA GCTTATTCTA ATGCCTTTAT ATTTCCTACT TTTATTCTCC TTTTGACAAT GATAAATAAT TCAATTATAT 000960 000961 TGCAGTTTTG CTTTTGGATC TTGGTTTACC TCCATGGATT TACAGTGATG GGAATATTCT AGAACAAGTA GTCATCTAGA 001040 001041 TTTCCCCAAA TGCCCTGAGA CTGATATAAG AAGCATACTA AAATGGACAT GAACCTAATT TTTCGAAGTA TAAAATAATA 001120 001121 TAAACcagtt gtgaccaaac ttgcctacaa attagaattg cctggagatc tctaaaatgc caaagcccgg gacatatccc 001200 001201 aaaccaatta aatcactatt tctcgaggtg ggacacaggc gtcaggagcg ctttaactcc caggatgatt cctatttgta 001280 001281 gacaagctta ggaaccactg ACCTAGACAA CTGTGTTACT TGAGGATTCT AAGTCGGCAC AGCTGTGGCG TGTTCACTAC 001360 001361 CCAGCCCTGC CCTTGGCATG ATGTTCTCTC CTATCCAGCT TCTCCATCAT CTTCCTCCAC CTGATTCGTA GAGACCTCTG 001440 001441 GACATCTGAG CCTAAGCAGG GCCACTAAAT GAGAGAGAAT CCCATCACTG AAAAGCAACA ATGAATATAT TAAGGACCAC 001520 001521 AGGTGGGTGT AAAATAGATT TTTGCAAATT TTGTGAATTG TTATTTGTTA TATTTTTCCT GATAACAAAA GTTTTATGTC 001600 001601 AAAGATTTTT GTCTTCCAGT TTTAGTTTCT TATGCAGTAG TTTTAGGAAA CTCTGGTGTG CCAAGATGAT ATTTTTTTTT 001680 001681 CATGCTTGGG TGATTGTTTT AGAATTCCTT CTTTGAGTTC AAGAACATCT TTAAAATATC TAACTCAGGG GTATTATGTG 001760 001761 TGTTGAGTGA GAATTGTCTA TTTGCCTTCT TTAAAGAAAC ACAATTTGAC ACAAAGATAG GTATTAAAAT TTTTATTTTC 001840 001841 TGTGTATACA CAGCACATTA GTTATTTATT TAAGCAACCA CTGGCTGTCC AAGAGAGATA GAAGAAATAT TGCAACTGAA 001920 001921 GCTGCTAGAT GGTTATGATT ATCTGAATAG GTGTGATCTT GACTTTAGCG AATTAACAGG TAAGCTTCTT ACTAATTTTA 002000 002001 TTCTATAAGT GATTATTGGG GGCAGAGGGT AGAAAAAAAG TGAGAATACA TGTCTGTCTC ATGATAAACA AAAGCCTGAG 002080 002081 ATGTTACAGT TACTTACTAA AATCACTTTT ATCATATGAA GATTTTGGAT AAATCATTAT TAAGGAAACT AAGGACAGCT 002160 002161 GTGGCATGGA ATCAGCGGGC TTTCAGTTTG AATCCCAGTT ACTCACAAAT GTAAAGTGGT ATTTTCATTC CTCTTTTGTT 002240 002241 AAAATGCTTT TTTTAAATGT AGGAAGTTTA CATTTTGTCC TAGATCACAT TGGCTTCTGT TCCCACAGGA AATGTTATAT 002320 002321 GCTATATGTC AGTTTTCATT TCATTAAAAG TGAAAGTAAA CATGGTATGC ATCAGCCAAA CAGTAACTAA TCCCGACATA 002400 002401 ATTTCAGTAT TAAATGAGGC TTGGAATATG TATTTGAATG GGCATGATAG TTTGGGAAAC TCTAATTCAG AGATGTCCCA 002480 002481 CTTATAACAG TGATGTTGGA GAATAGGGTT ACCAGTTATG GCCTATATTT GCTCTGTCAT ATCACGTTCT TACAGTATGT 002560 002561 AATTATATTT GGAGAAATAT CCTTTGAGCC ATAGTATTTT GGAGTCAAGT ATTAGGCTAT AATTTAGACA TTGTTTTAGT 002640 002641 AGAATAAAGT AACTTTATAA GATTGTTCCA GATCTTTGTT CTATATCTCT GTCTTTTATC CTTCCCCCTT GCAGAAGTCA 002720 002721 TTCCTATTTA CCTTATTTAA AAAACGAGAG AAAGTATAAC TCAGGCTGAA ATAAAGCACC AGACTAAAAA GAGATGCCCA 002800 002801 TTTTCATTCA ATATTTATAT TTGCCAGAAT AAAGGGAGAC AAATAATTAT TTTGCCTTTA AGGCCTTGAT TCTGGATGAT 002880 002881 AGATTAAGTA GACAGACTGT TGACCACATG TGACTTAAGG AATATTGTTA ATTATTTTAC CAATCTTTGG ATCAAGGTTT 002960 002961 GCTTGAAGTA AGTAAGCTGT GCTGTGGATA AACCTTTGGA TGATAAAGAG TGAGCTGTAT TGCCTTCAAT ATTGTTTGCA 003040 003041 TCAGAACATT ACTAAGTGGT ATCGATTTTG AAAAATAAAA TTCAACTTTC TGAGAACATA GACCAGATTC AAGTATTTTT 003120 003121 ATCTTTTTTT CCCCCCtttt ttcagagaat ggaatctctc cacgttgccc aagcaggtct cgaactcctg tacttgagct 003200 003201 attctcccac ctctgcctcc ctaagtgctg ggattacaag cgtgagccac tgttcatggc cTTAGGGACT TTTAAATGTT 003280 003281 AAATTGATTT TGACCTACCC AGGAGCTAGA GAATACCAAT GATTACAGAG GGGAGAATAA AATTGCCGTA TAGATTTTCT 003360 003361 GTGCTCCTTT AGAAAATCCA TTCTGGAGTA AAGAGATGGG CAATTGGTCA ACTGAGTTTC TTTCTTGTAT AATATGTCAG 003440 003441 AGTTTGATAT TGTCAGTCTG TTAACCAAGA TTTTTATTGG GATATTTGCT GAAAGAGGCT CTAAGAATAT CCCATCACCA 003520 003521 GTAATTAGAA TTTGTTACAG GTTTCAGTCA GGGTAGTTGA AAGGGAGCTA ATTTGGTAAC ACAATAACCT AGTACCCAAC 003600 003601 TAATTCAGGA GCATGAAGAA AAATTGGTTG ATTAAGGCTT AAGTCAGATA AATAATACTA TAGGTTGACA TGGCTGGGTT 003680 003681 AGCTCAGGTT TGTATATTTG TTTTTTCAAA GAGGCAGACT TTTCCTAAGA TTAGAATCCA CAGTAATGAT GGAAGCAGGA 003760 003761 AACTTAGGGG CAGGCCATAG GTGTAAAATC AAAGTTGAGC TGCTTCTGTG TTACAGCTGT AAGCACAGGA AGTTGCCCAT 003840 003841 TTTTTTCAAG CACTATTTCT GCATAGTCAT ATATCAGCTA GTGTTGTCAA CATAGACACA AGAGACAATA GGAATACGTA 003920 003921 CATGCAATTC CACCATCAGC TAAAAAACCC ACATAAGTCT TTGAAGCTCT CAAATAACAC GCATGACACT GAATGGTACA 004000 004001 TTTTGATGCA CTTTGAAACT CTTATCTCTT TAGTTGGTAA GTACATTTGA ACACTTAATA TGTGCTCAGC AGAAGAGAAG 004080 004081 GAGAACAAGG GAAGTCTAAT ATAAAATCAG GATTTTTTCT TTTCTCTTTG AAGCCAATTA GAAtgtgtgt atgtgtgtgt 004160 004161 gtgtgtgtgt gtgtgtgtgc gtgtgtgtgt gtgtgtgtgt gtATTATTTA GTGTAGTAAT CCTTGGGTTT TTTTTAAAAA 004240 004241 AACGTGTTTT ACtatgaagt gtaacataca agaaagtgat aacatatgac atatacagct tacaaatggt tgtaagaatt 004320 004321 aatacacttt gtaagtgcca cccatggtaa aaataagtag gtggagaagg aggagaagaa aagaagaatt taactagcat 004400 004401 gtcagaagtc tcatggatct ctttccagtc acaaccctct caccaagctc gattcctatt ccctgagtaa gttctaatct 004480 004481 tacttttctt tttttttttt tttttgagag agagtctgac tctgtcaccc aagctagagt ggtgcaatct cggttcactg 004560 004561 caatctctgc ct |
Predicted Small Protein
Name | NONHSAT101661_smProtein_2165:2389 |
Length | 75 |
Molecular weight | 8731.9771 |
Aromaticity | 0.175675675676 |
Instability index | 41.7783783784 |
Isoelectric point | 8.69122314453 |
Runs | 8 |
Runs residual | 0.0345418589321 |
Runs probability | 0.0464140611199 |
Amino acid sequence | MESAGFQFESQLLTNVKWYFHSSFVKMLFLNVGSLHFVLDHIGFCSHRKCYMLYVSFHFI KSESKHGMHQPNSN |
Secondary structure | LLLLLLEEHHHHHHHHHHHHHHHHHHHHHHLLLHHHHHHHHLLLLLLLEEEEEEEEEEEE EELLLLLLLLLLLL |
PRMN | LLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLHHHHHHHHHHHHHHHHH HLLLLLLLLLLLLL |
PiMo | iiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooTTTTTTTTTTTTTTTTT Tiiiiiiiiiiiii |