NONHSAT097492
Revision as of 01:03, 17 October 2014 by 124.16.129.48 (talk)
Please input one-sentence summary here.
Contents
Annotated Information
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Function
Please input function information here.
Regulation
Please input regulation information here.
Expression
Please input expression information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Please input cited references here.
Basic Information
Transcript ID |
NONHSAT097492 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
5648 nt |
Genomic location |
chr4+:99580055..99585702 |
Exon number |
1 |
Exons |
99580055..99585702 |
Genome context |
|
Sequence |
000001 AGGCGAGAGG GAGAAGGAAG GGCGCCCGGG GAGCAGGCTG GCAGAGTGAC CGGGATTGAT GAGCGGCGCA GCTCTGGAGG 000080
000081 GACGCGGGCG AGAGGGTCGC TCCTCGGGGA GAGGGCTGGG TGAGTGAGGA CCTGCGAGCC ACCCCCGACA CCCCTTCCAG 000160 000161 GGTAGCCACA CTGGCGCCCC CTCCTCCACA ACAGGCTTAT GTGGCTAGAG AAGATGCGAT GCCACTGGAC AGGGAAGGAA 000240 000241 AGTCGTCGTG GACTTCGCTG AATGGAAAAG GGGGTAGTTG GGGAAGGTGG AGAAAAGAGC CACGGGAAGT GGGAGGAAAG 000320 000321 GATGAAGGGA TGGGACGTTG AAAAGGAAAG CTTCCTGAAT CCCTGTGTGA CTCTAACCCA ACCTCCCCAC CTACCGGAGA 000400 000401 AATGGGTTTT ATCCTAGAAA TGGGGACGAG GAAAAAGAGA TGGCTCAAGG GAGTCCTGGC CCGGGATGCG ATCCTCGCCC 000480 000481 TAGGGTCTAT TGCAACCACA CAAACCCGGA CAGCGCACGC TCCCCCCTAC CCCGACAGAA TAGTGCCTAA GAAACCTGCT 000560 000561 TGCCTCCTAA CTCTAGGAAG GAATCCACAA CGGATTCCAG GGAGGTCTTC TGGGGAATCT GAATTGGGTT TTTGGATCAC 000640 000641 GAACCATTAA GCTTTTGAAC TCTGGGGAAG AATAGAACAG GTGGCTGGAA ATTGTTTTAG AACCCTACCT GAGCAGGTGT 000720 000721 GCTCATTAGT TGCTTATCCG CCCTCCTCCT GGAGCACTTT CATAGGTGTT GGCAAAGCAG GTACCAGTAT TTCACGGAAG 000800 000801 GAGAAGGAAG AACCGCCCAG GACTGTGTAT AAATGTCTAT GGGAATATTT TTAAGGATTC AAAAGGCTAG AAGGAGGTGA 000880 000881 GATCAAAAAA TTTTTCCAGC ACCACCCAGT GCAACCCGAT CCTGGACACA GAAATCTAAA AGATGATACT TAAATACTTG 000960 000961 AGTTTGAGAG CCTCTAGTTA TAAGGTTTCC TACGTAACAA CCTCTCACTT TTCTCAAAAA CTGGCTAGTT GACAACCGGC 001040 001041 AGTCAAGTTT TGGCTAGTCA AGAAGCTACT TTTTCCTGCA ATAAACCACA TTTAGAGATA GTACTGGGCT TCAGCAGGAT 001120 001121 TGAGGCTATA TTGAGGTCGC AAGTATGAAC GAGGTACCCA CAGAGATGGG AGGGCCAGGA GATCCTACTC ATTCAGTACC 001200 001201 TCAGAAGCTA TGACACGGGA TAACAGTAGC TTCAGCCCTT TGGATGAAAT AATCCTCACT CCCTCAAAAG TGCTCTCTAC 001280 001281 CCACTCAAAA CACAGGCCAT ACTACCACCA ATCTGGTAGA CTTCAAAAGT AAGGTCTTTC AGCAGCAGAT TTCCTTGTTA 001360 001361 ACAGGAAGGT ACTGACAAGC TTTACCTAAG CCTATTGGAG CTATGTTGTA ATTCTGTATA TAAATAGAAC AGTCGCTGCA 001440 001441 GGAATACTGG ATGAGTCAGA AGAAAGGTCT GTCCTAAGCT GATGTGACAC ATTTCAAAAG GACCAGAAAT TAGATGCAAT 001520 001521 TGATGACTGA GTTGATCTCT AAGTGGATTT ATCTTTGAAA TAAAAATCAG TGGATATCCC ATATACCCGA CCCCACACCC 001600 001601 TCCCCCATTG TCAACATCCC CAACCAAAGT AGCACATTTG CTACAAATAG TGAACCTACA TTGCCAGATC ATTATCACCC 001680 001681 AAAGTCCATA GTTTACATTA GGGTTCACTC TTGGGGCTGT ATATTCTATA GGGTTGGACA AATATAGAAT AACATGAACC 001760 001761 CACCCTCATA GTATCATAGT ATCATACAGA GTATTTTCAC TGCCCTAAAA GTCCTCTGTA CACCACCTAT TTATCCCTTC 001840 001841 CTCCCCCCAG TCTCTGGCAG CCACTGAAGT TTGACTGTAT CCCTGTCTCC TTAGTTTTGC CTTTTCCGGA ATGCCATATA 001920 001921 TAAGGAATCA TACAGTATGC AGTCTTTCTT TCAATTTTGC TGTGAAACTA AAACTGCTAA GAAAATAAAG TCTTAAAAAG 002000 002001 AAAAATCAAT GGAGAAGGGA AAGGTAAGAT AACTAATCAG TAATTTGGAG GGGGGGGTGG AAAAACTAAT TGAATTATCT 002080 002081 CATTTCGTAT ATCCAAAATA AGTTCTATGA GGATTAAGGA GTTAAATATA TTTTAAAGCA GACCAGAAGG AAAAATATTT 002160 002161 GTCAAACCTC TGTATAGAAG TATTTCTCAA CTTAGAGGCA ATAGAAGGAA ATAATCAATG GATATGATTG GGTAATATCT 002240 002241 TAAACACTTC TACGTGTCCA GCTTTTTTTA GTGTAAGCTA AACCAGAAAA AAAATTGGCA GCAAACATGA AAAATAAAAA 002320 002321 CCTGAATGTC TATATTATAA GTAGGGCTCA TAGAGCTATA GTAAAATATA GAATCGCAGT GGTTGAGTAT AAACAGAAAA 002400 002401 TAAATCATAT AAGAAATAAA ATATATGGGG AAATTAATCT AGTAATAAAT GAAGTATACA TTAAAACTAA TAAAGATACA 002480 002481 AAAATAATGA GCACTGGTTA GCACATACAT TGTTGGTGTC CTTATGAAAT AATATGTCCA TTTGGAAAGC ATTTTGGCTG 002560 002561 TATATATCAG GTCAGAGTGC TCCTACATTT TGACTCAGCA GTCCCTCTTC TGGGAATCTA CCCAAAGAAA TACCCCAGCA 002640 002641 GATGGAAAAA GTTATACACT GCAAGATGTA TATCATAGCA TTATTTATAT TGAAGACTTG GCCAGAAATT AAAGATCAAA 002720 002721 GAGTAGGGGT GATATCCAAT TCCTTGTAGT TACTTGGACC ACAATGTTCC TTCACATTCT GTGCCTCTGC ACATGTCATT 002800 002801 TAGAGTTTCT GCCTGGGGCA TTCTCCTTGT CTCTTGTCTG TGAGCTGATT TCTCTTATTC TTCACTACTT TTGTTCAATG 002880 002881 TCTCGGAGGC TCCTTCTTTA TTCCCTCAAC ACCCGCTGTG TTCTTCCATC CCGGCACTTA AAGGCTGCCT GTAAGAACTT 002960 002961 GTTTACCTAC CTGTCTCCTC AATCTGCTGT GACTCCTTCT AGGGTGTGAC CTAACCCTTG GTAATCTTAG AATCACCAGA 003040 003041 GCTCTGCAGA TATGGCAGGT GCTTGATAAA TATTTATTAA AGGAATGAAT GAGTCATTTG CTCACAGTTT TTACAGCTCT 003120 003121 TGAAGATACT GCTTATAGAG TATGTGGCAA TATATGGATA CATTATAATG TTAAATTTTA AAGGCACTAT ATATGGAATT 003200 003201 ATCTCCAAGT AATATTGTTA AATTTTAAAA ACAAGGAACA AAAACAGTGT ACACAGTAGG CTATCATTTT GTGAAAAGAT 003280 003281 AAGCTCTGTT GGCAAAAGAA TATCCATACA CATATTTCCT TGTTAATGGG TAGGCTATCT ATGGAAGAAA ACGAAATAAA 003360 003361 CTGGTAACAG CGAGTTAGGG TGGAATGGGG GGCAAACTGG GGGACTAAAG ATACTGATAG GAAGGAGACA TGTTTTATTT 003440 003441 TTAAATAACC CCTTTAGTCT GTTTTTGAAA TTTGTACCAT GTATATGATA CACGTTTTGA AAAATAAAGC AGTCTACAAA 003520 003521 ACGGTATGCA GTTGTATGAA AAAAAGTTAG GAGTTAATTT TTGCCAAATA AGAACAGTTA TAAGGCTGGG TGATTTTGTT 003600 003601 TTCATGTTTC TGGTTTGCAA TTTTTAAGTG TGGTTAAATT GCTTTTACAA TTAAGAAGAT AAAATTATAC ATAAAGATGT 003680 003681 AAGAAAGTAA CTAAGTAATC ATATTTATTT CACGCAACTC CTTCACAACT CAATAAATGC TGATCTATTT TAAGGAGAAA 003760 003761 AGGAGAAAAA TTTTCTGTCT GAATACTGCT AAGGCCACCT CCCTTATAAC TCTGAGGTTG AAAAATGGTT AATAAGTTTA 003840 003841 ACTACTGACA AGTTAATCAG CTCTTCAAGT GGCAGTGAAA ACAGGTAAAG CGTGTTCTTC TAATAGTTGC CTGGCTGAGG 003920 003921 AGCAAGATAT GACAACAGCG TTTTAACCTA AGACACCAAA CTTAACGAGC TTGTCTGTCT CCACCTGTGC TTTATTGTTC 004000 004001 CTGCCCACCT TCCTGCCTCC CTCCTCTTGG GCTCTGTCAT TGTCTGCCTC TCTGCTCCTC TCTGTTTTTC TTTTGCTTCT 004080 004081 CTGTAGCCGC CCTGCTGGTT GCCTCCACAC CCCTCCTGCC GCCTTCCTCT CAGAACCCTA GCAGCTGTGT TCAGCGTCCC 004160 004161 CCTCTTAAAG CAGCCACATG CTCTGGGAAG ACTGGCCCTC TGCGTCACAT CTGGCAGTCC CATTCCTCTT GCTAATGTGT 004240 004241 GGTTCAGGAA TGGGCAGGTG GCCCAATTCT GGTCAACAAG CCATGAGAAG AATCTGCTAG AGGTTTCCTT GCATCTAAGA 004320 004321 AATGGTCTCT TTCGCCCTTG GACTTGGTTG GGTTTGGATG TGATGCCTCG ATCCTCTACA GCCTGATGAC AAAGACCACA 004400 004401 CCAAGGATGG CAGAGTGGAA GGATAAAAAG AACCAGGGCC TTCCAGCCTT TGTTGTATGG CTGACTTAAC CAACCTTGAA 004480 004481 GGTGGCCTTA CTCACGACTT CCTGTCAGAT GAGACAAATT TCTTGGTTAA GATACTTTGA CAGGGGTTTT CCGTTTCATA 004560 004561 CAACTGAAAG TATCCTGACT TAAACACCTT TTGAGGTTCT GCCCCAAGCC TTTATTGATT TTCTCCTCTT GAATCACCAC 004640 004641 CAGCATTTCT GCATTTCTCT TAGTACCACG TTTTTTCTAT TCCCCACACC CACCACCCCC CCTCTTCCCT AGCTGTTTCT 004720 004721 GTCTCAAACA ATACAGGGAA ACATTTTAGG GTGTCTGACA GATCACACCC ACCCCTTTCT CAGACCTTCT CAACCACAAG 004800 004801 GTGCTCTCCC ACCTCCAGCG AGGTTTGTAC TTCACAGTTG ATGGGGTCCC GGTAGATTCA GGCTGGGCCA ACCTGAGAGT 004880 004881 CTGTCTCAGT AACAAAGGGA ACCCTGCGCC CAGCTGAAGT GCAGGCAATG TTTACCCGAA AGCTGGGGGT GGGTCCGCAT 004960 004961 TTCCAGGAGG AAGCCAAGAC TCTCCCTCAT CCTCTGCAGA AAGAGGAAGG AAGCCTAAAC AGAGAAAAAC AGTAGCAGAG 005040 005041 TCTGACAGCC TGGGTGCTGT GTATGGTTGT GCATTTTGTG CTCTGAAAGG CTCCCATCCC TGCTAAAGGC CTGCTTCTAG 005120 005121 ATTTAAAGAG GACTCTGTCT TTGAGGTCTA TGAGATTTTT AGATGGAATC ATTCATCACA TCCCATTTGG GCTTTAGATA 005200 005201 CTTTGAAGTG AGTATCTCAA ATTTTCAAAT CAAAGAACCC TGATTAAAAC AAAAGGTCTT CTCACCTCAA ATCCTTTATG 005280 005281 AAATGAGGTC AACTTAAAAT ACTTTTTAGT TAGCTATGAT ATAAAATTAA ACTATATTTT TGTTATAGTT AATGAATTAA 005360 005361 TATTGTATCT TCTGTTCCTT AGTCGAAAAG TGAGCAGTTA AATGGTCTAT TTCTCTAACA AAACAATTAA CTGATTCCTC 005440 005441 AGGATTTACC GCCCACTGTG ACCCTCTTGC CCTGTGCCCA CACATACCTA CTAAAGACCA GTTTTCAGAT CCCCAGGAAT 005520 005521 AACATAAAAT GGGGCATATC TGTCTCGTAA ATATGATTGA ATAAGCATAT GTAGAGCACA GTTGTTTCAC TTTTAAGAAT 005600 005601 CAATTATAAT CATTTGTACA ATAAAATGAA TTATGTCTTT CACACAAT |
Predicted Small Protein
Name | NONHSAT097492_smProtein_401:649 |
Length | 83 |
Molecular weight | 9065.5215 |
Aromaticity | 0.0609756097561 |
Instability index | 49.5975609756 |
Isoelectric point | 10.9357299805 |
Runs | 9 |
Runs residual | 0.0261456023651 |
Runs probability | 0.0183222438125 |
Amino acid sequence | MGFILEMGTRKKRWLKGVLARDAILALGSIATTQTRTAHAPPYPDRIVPKKPACLLTLGR NPQRIPGRSSGESELGFWITNH |
Secondary structure | LLEEEELLLLLHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLEEELLLL LLLLLLLLLLLLLLLLEEEELL |
PRMN | - |
PiMo | - |