NONHSAT122633
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT122633 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
4810 nt |
Genomic location |
chr7+:105548099..105564604 |
Exon number |
15 |
Exons |
105548099..105548507,105550673..105550920,105550955..105550971,105551485..105551687,105552929..105553086,105553098..105553124,105553144..105553163,105555279..105555650,105559225..105560263,105560470..105560489,105560497..105561187,105561943..105562740,105563408..105563655,105563669..105563692,105564069..105564604 |
Genome context |
|
Sequence |
000001 GGTCACTGAG CTCATTATCC CTAAGCAGCG TTCAAACCCA GGCAGTCAGG CAGGCAGGCA GGCAGGCTGG CTCTGGAGCC 000080
000081 TGTGCCACTG CCCTCTCTGG TAACTGATTA GGACCAAATA TCCCTGTGAT AGTCCTTCAT AAAGGCTGTC CTTGACAGCT 000160 000161 GAGCCTTTGA CAAATCTTGG GCAGCAGTGA GTTTAAGGAA ATGTGTGAAT CCTGATGAGT TCCTGAGGGG CCAACATTCT 000240 000241 GGAGGGCTAG TTACTGTGGA GCCGTATGCT TTACCCGTGG GCTGAGATGG AATAGGACCA ACAGATTTCC GCTCAGGGTC 000320 000321 CCAGTATGGG CTCTGGCTTG AACTGAAGGT GATCCATGAG CTGGGTTCAG AAAAGGGGGT AGGCAAATAG GGAAGCAAGG 000400 000401 CCAAAGTGCA GAGCTGTGAT CACTTGAGGG TGTCATGTCA ACCTTGTCTC ATGCACCAGC TGAATTCAGT GACACTAACA 000480 000481 TCCTGGATCC TGCAGATGAG AGTATTATGA CTGCACTACC AGGGAGGCTG CCCAGGGCTG GAGATTGCTC TGCTCCAGAG 000560 000561 GTGCTTGGTT TGCAGTGTGG GCGGACTCAA CCTGAGCGAA AGCCACACAA AAGCGCCTTG AGCTGGAGCC CCATGCTAGT 000640 000641 GTTGGTTCCA CCAGTGGCTC TTTTTGCCCA GGCTTCCTGA GGAACCTCAG ATTTGAAAGA AGAGTGAATG GAAGTCAAAT 000720 000721 GTGGAAACCA CTCAAATGTC CATCAACTGA TGAATGAATA AACAAATCAT GGTATATCCA TACAAGGAAT ATTATTCACC 000800 000801 CATAAAAAGG AATGAAGTCT TAATACACCT ACAACTGGAT GAACATTGAA AACATTATGC TATCCATTCT GCTTCAGACC 000880 000881 GGAAAGGAGA AACAGTCTAA ACTGCATGAA AGTCCTGGTA CTGCTGTATC TCGTACTATA AGTTCAGTTT TCTCCCTTCA 000960 000961 CCTGAGGGTA AGTTGAAGAA TAACTTCATA AGCCAGGGAG AGATACGTTT GGATCAGGAG TTAGTGATAA CAGGTTAATC 001040 001041 CTTTTTGCGC TCATGCTCAG ATTTATAATG ATGTAGATAT ACGTTTTAAA GAAAATTTAT GCTGGTGGTT TGCAGAAAGT 001120 001121 TGACATATAT CCTAGGGTTT CCTTGTCAGA GGGCAGCAGG GTCCCCATGG GTTCACATCG TTGCCTGGAC AAGGAGTCTT 001200 001201 TGTACCCTAG TGGTGCCTGG GTTGTCGCAG GGATGGGGTA ACTCATTTAT TCACACTCTT TCTCCTTTCC TGTTTCCTTC 001280 001281 AAAGATACCC TGGTCCTAGT TGACTCTACT TTACTGTTCA AAATTCAGCT CAACATATGT CAAGCTGTAC GTTCTTAAAC 001360 001361 TCAAGGAAAC AGAAAGGTTA TAACCCCGTG GTTTCCAAGC CTGGCTTTTC ATCAGAGTCT CTTGGTGAAC TTTTAGAAAA 001440 001441 TCCATATTCT TGCCTCCGAT GTTGTAAAGC TTTGCCCCGT GTTTTCTTTG AAGAGTTTTA TCATTTTTAG CTCTTGTGTT 001520 001521 TAAGTCTTTG ATGCATTTTC AGTTAATTTT TGCATATGTT GTTAGTTAAG GATCCAACTT CATTATTTTG CATGTGGATA 001600 001601 TTCAGATTTT CTAGCACTAT TTGTTTAAAA GACTGTCTTT TCCCCCATTG AATGGTCTTG CATTCTTGTC AAAAATAATT 001680 001681 TGACTCTTGT CATCTCACTC TTGAATCACA AGTCCTCTAG CCTCCAGATG TGGCTTCCCA TTCAGTCTGT GTATTGTAGT 001760 001761 TACAGCCCTT ATTTTGAAAT AAACTTGATC ATATCTCTCT CGGGCTTAAA AAGTCTTCAG TGGTTTTCCT GGCACCTTCA 001840 001841 GAATAAAATA AAGCCCTTCA GCATGGGATA CCTGCCCCTT CCTAGGGAGG CCTCACTTCT CACGCCCCCC TGCTTCATCT 001920 001921 TGCATGCCTC AGACTCTGGT GGTCCGTCTA CTGGTTGATC ATAGTGCTCT TTTATGCTAT TGGACATGTT ACTCCTGCTG 002000 002001 CCAGAAATAT CTGCCTCCCA CCTAACTCCC GTTCACCTAA CTCCCATTCA CCTACTTGCT GCTCTACTCA GTATGTCTCC 002080 002081 TCTGTCTACC ACAACCTGCT CCCCATTATG AGAAATGTGC CCTTCTTTGA ATGCCCTTAT CTTATTTCTT TCATAGAACA 002160 002161 TAATTGCTCT GTGTTGGAAT GATAGATTTA TCTGCTTGTC TCCTCTCTAG ATTGTGAGAT CCATGAGAGC CCAGAAAATA 002240 002241 TCTTAGGATC CCTAGTGCTT AGCACAGAGC TTGGCATATG GAGGTAAGCA AAAAGATTTT TTAAATCCAG TTGTCCATCC 002320 002321 ACTCATCCAT TCAATATGAA TGAGAGAACA GCCCTTTTTT AAAGAGGGAT GGCCTTTTCT TAAAAGACCT CTATCTTGAG 002400 002401 GCATTATTTT CCCTTATTCA AGTCCCAGTT TATTAGGTCC TGGTACTTAA TGGTGATAAA AGGGCATTAT TATTGAAAGG 002480 002481 TTGGGTATGT CAGGCCATGG GAAGTAAGAC TGGTTTTCAT CTTCTTAGAG TTTGGGATAT ATGGATGAAT TGAGTGAAAT 002560 002561 GAGGATTTTA GTGAGTATCT CTACTAGGTT CCTGACAGGG ATGGTTTCTC CAGGGATTCA ACTGAATCTT ATAGGTAGAT 002640 002641 CTGATCTGGT GAAAGATAAG GTATAAATTG GCCTGTCAAC AGGACATTCG TACTGGTAGT TATTACTACC ATGATGGCTT 002720 002721 TAAGAGATTT TGGGGGCTGG CAAGAAGTAG AGTAAATGTA GCAGAACAGA TGGATGTTAT AGGGAAAATA TAGCTGTAAG 002800 002801 GAACAGTGAG AGCTTAGTGA TGGTGGCAGT AGAGCATAAA AGACAAATTA TCTTTAACTT TGGCTTTGCA AAAATTGGGG 002880 002881 TATAAAGTGG GGCTATATTA CTGAAGCAGT CCGATTAGGA GGATTCATAA AAATGAAATG CATTTCATAT GCACATCTGG 002960 002961 GCAGCTATAG CCACTGTAGG TTTCTTGTAG GTCTGATAGA AAATGAGCCC TGTTGGTGAA TACCTTAGCC AATATGATGA 003040 003041 ATACCTTAGC CAATATGATG AATAGTAGTG AAAGGCTGTA CCTTGGGAGG CTCAGGATTA TACTGATGCA CCCATCACTC 003120 003121 ACAGTTACTA GACATGTGGC AGAGACTGAA TAAAGGGTTG ACCCAGGGAC ACTCCTTGAA TATGTCCCCT TTTTTTGATC 003200 003201 AGCATTAATG GGTGCAGCAC ACCAACATGG CACATGTATA TATATGTAAC AAACCTGCAC GTTGTGCACA TGTACCGTAA 003280 003281 AACTTAAAAT ATAGTAAAAA GAAAAAAAAA AAGAATGGAA TGCCTGATTC TGCATCCCTC TAGATGGGAT CAACAGGATC 003360 003361 AAAACATGAC CTTGATGGCC AAGCCAAGTT CTCTTTTTCC CAAAGGGTTT AAATCTGCCC TATTTAATAT GGTAGCCGTA 003440 003441 GCCATATGTA TTTATATTAA ATTCAAATTA ATCAAAATTT AAAAATTAGT TACTTAGTCA CACTAGCTAC ATTTTACATG 003520 003521 CTCAGTAGCC ACATAAAGCC AGTGGCTGCT GTATTGAACA GTGCTGATTA TGGAATATTT CCATCACTGC AAAAGGTTCT 003600 003601 GTTGGACAGC AGTAATCCAG ATCTCCTTTG TAGAAGAATT GGCACCTGGG GCTGCCTGAT CTTCAAGGGG CACATCACAG 003680 003681 AGCATGGTGT TGACCCATCT GGATACCATG TTTGAAGTTG ACATGGCCCA GAAATTTGAC TTCCACCAGT TAAAAATGTA 003760 003761 CTTCGACAGT TTTGTATTTA ACCTGTAAAT CCAGAAGCAT GTCAGGGCCT TCTGGATGGA GCACTGTGAA ATTAGCATAC 003840 003841 ACTTCAAACC CTCTTCCCTA AAAGATCATG CACTACTGTG CTGTACATGG TAGACCTATC TCATAGTGTA GGCTTCAGCA 003920 003921 AATACTCTTT GACAAATAAT TGGTTCTGTG CTGATGATAA GCCTAATAGG TGAAAGATGG TCTTTCCCTT AAAAAATTGA 004000 004001 GAGATCACAA ATTGGTGCAT TCTTAACTGT GGATAATTAT GTATGAAGCC AAATTGTAAT AAAACTGTTA TTTGCCAAAC 004080 004081 ATAGAAACCT TGCTTCGTGA CTAGGCTGAG CCCAGAGTGT TATTCTCTTC TTCTGCCTGT TATGTATCTG GGCAATTGTA 004160 004161 GGGCCTTTGC AGGTGAATCC AGTAGAGAAC ATGCTAAGAA TGAGCAGCAG TCATGCATGA CTGTATCATT GAAAGCCTTC 004240 004241 TAGCCATTTT CGTGGTTTTA TTTTTTCAGA GGGGCATCCA CCTGATTTCT GGAAGGCATT GCTTCACTTT CATAGACAGA 004320 004321 AGATGTAACG ACTGCCACGA AGATCCCTGG AGCTCTGCCT GGCGATTCTC ACTCCCACTG GGTAGCTGAC AAATTTTCCC 004400 004401 TGAGAAGACA TGGTTTCCTC CAGCAGCAGC AGAAGGAGTC ATGGGTTTTT CTGTTTCATG GCTCAAACTG TTGTGACTCG 004480 004481 GAATTATACT AGGACAGGGC CCTGACTTAT TCTTCTGGGC TGGGCTGAGT AAGGCCTGGC TGAGTGTAGC TGGGTACACA 004560 004561 GGTAGGGGTG GAATCAAAGC TGGAGGTACC CATAATGCTG GTTAGGAGGA TAACTAGCTA GAATAAGGTT GGCAAGAACT 004640 004641 GGGTGAGCAG GGATAGGAAT TAAGACTGAG GACTGGGCTG AGGGTAGGTC AGGGGTTCTG GCAGCAACAC AGCTGGAATG 004720 004721 AATGGAGAAG CACACAAACA AAACCAAGCT AAGAAGAGCC TAGATCAAGC TTGTCCAACC CATGGCCCAC AGGCTCCATG 004800 004801 TGGCCCAGGA |
Predicted Small Protein
Name | NONHSAT122633_smProtein_434:679 |
Length | 82 |
Molecular weight | 8581.7711 |
Aromaticity | 0.037037037037 |
Instability index | 56.1543209877 |
Isoelectric point | 5.47283935547 |
Runs | 10 |
Runs residual | 0.0187229171777 |
Runs probability | 0.028351881293 |
Amino acid sequence | MSTLSHAPAEFSDTNILDPADESIMTALPGRLPRAGDCSAPEVLGLQCGRTQPERKPHKS ALSWSPMLVLVPPVALFAQAS |
Secondary structure | LLLLLLLLLLLLLLLLLLLLHHHHHHHLLLLLLLLLLLLLHHHLLLLLLLLLLLLLLLLL LLLLLLLEEEELHHHEEEELL |
PRMN | - |
PiMo | - |