NONHSAT077043

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT077043

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5069 nt

Genomic location

chr2-:220358356..220363424

Exon number

1

Exons

220358356..220363424

Genome context

Sequence
000001 TGGAACTTAA GACTGGGACC CGAGCTGTTG GGGAATCGGC CTCGGGGCAG CAGCCGAAGT GAGGAACTGG GTCCCACAGG 000080
000081 AAAACGGGAC TAAGGACAGC AGGGGTGTGT GGCCAGCCCT GGAAGGGGCG GGGCCAGGGG CAAACAGCCA CAAGGGGCTG 000160
000161 GGGCTGGAGG CGCTCAAAAG CGGAGAAGGC GGAGTCCCAG GCCCAGCCGG CGGTACCGCT GGGAAGTGAG GGCGCCTTGG 000240
000241 ACGGCGAGGG CAACTCTTAT CAGTTTGACT GTCTGAAATG CAGCACGGCG CTAGGCGCTC TCCAGACCTC CCTGTGACTC 000320
000321 CAACAGGCTG TGACCTTGGC GTGGGGGTCT GGAATACGTG TGGGCGCTGG TGCGGGGGAG AGGGAATGGC GACGCGGAAG 000400
000401 AGCAGGACAG ACGCGGAGGG TCAGAGAGCA GCTTTATTAA GCAAGCTGGT GGAGGGCTGT GGGTCCCACC ACCCCCGGGC 000480
000481 TCGGCGCCAG GGCGGAAGGT CCGGGGGTGC CGTTCAGGCC ACGTCGACGC GAGCGAAGCT CTGGTCCATG CCCAGGCTGT 000560
000561 TCTCCACGTA CACCTCGTAC TTGCCGCTGT CCTGAGGCGT GGCCCGGCGA ATGGTCAGCG TCGTGGTGGT GCTGCCGATC 000640
000641 TCGAAGAACA CCCTGCGGAA GGGCGGCGGG GAGGAGGGCG GAGCCGTGGT CAGTACACGG GCGCACTGGG CGGGGCTCCT 000720
000721 TAGCCAGCTC TGTTCCTCGC TGCCCAGTCC CCGCCCCCGT CACTCCAGCC AGATTTCTGG GCTCAGTCCC ACCCAGGTTC 000800
000801 CAGCCCCAGG CCCTGGCCGT GCCGCAGGTC CCGGCCCTCG CCTGTCATCC TCCTCGATGT CCTCCCCGTC CTTGGTCCAG 000880
000881 CCTACGTCGG GCGCAGGCTC TCCCAGGATC TCCGCAGTCA GCGTCACGGT GGTGCCTTTG CGCGCCTTAG TGTTGTCGGG 000960
000961 TCCCTTTTGA ATCTTCGTCG GGACTAGCGG AAGGAAAGGG GAGCGAAACC CCGTGAAGGA CCCGGAGATG AGGCAGGGCA 001040
001041 TCACCTCACC GGCTAATACC CGGCACTAAA TACAGATGca tttaaactcg cagtgtgacg ttgagcaaat cattagcctc 001120
001121 tctggtttca tctccttggc tgttaaaggg acttatccct gcctctccta cttctcgggg tcgaatagag gatcaagaga 001200
001201 gataaggcgg gcgaaagccc ctGAATGTTC ACACATCTCG CGGTGCCTCa gagaccgctc cgggtagtgg aaattgccag 001280
001281 agttgggagc cagaagaccc gggtccaacc ctggctctgc ctctggcttg ctgtgtgacc ttgggGACCC GGAGCCGCGT 001360
001361 GATGGAAGTC CACAGCTGAC TGCTCAGCCC GGGTGGAGCT GGATCCCGAG GTAGGGAAGG ACGCACTTGG CTGGCGCGGC 001440
001441 GATGCGCCCC AGGCCGTGCC CAGAGCTGGG CTGGCGGGGA CCCTGTAGCT AGGCCCAGAG GTAAACACGC GCAGCTGGAC 001520
001521 CGGGCTGCGG GCTCTGGCTG AGTGGTGCTG GTCTGCGCCC TGGCGCTGCG CCCGGGGCGC GCGTACCTTC CACGAGGATG 001600
001601 CGCGCCGAGT CGGAGCACTG GCCGAAGGGG TTGGTGACGT TGATGCTGTA CTGGCCCAGG TCTGCCAGCT CGCCGTCGCG 001680
001681 CACCACCAGT GCCACCACGT CAGGGTCCTC GAAGACGTAG CGGTACTTGG GACCGTCACG TAGCTCCTTG CCGTCCTTAC 001760
001761 TCCAGCGGAT GAATGGGTCC GGGAAAGCCG AAATGCGGCA AGTGAGCTTG GCCGCGCTGC CTTCGATCAG CACCACGTCC 001840
001841 TTCAGCGGCT CCAGCACCCG CGGCGGCCCC TTCTCGCGCA GCTCCTTCCG GGACCTACCC GCGGGCCGAG TCAGGGCCCA 001920
001921 GAGTTCCTAA GACAGGCCCG GCCACAGCCC TTACCGCGAG CGCACCATCT CCTACCTGCC CCCTGGACCG GCCAGACCCC 002000
002001 TAAACGTGTG GCTTGGTAGG GAAGAGGACT GCTCCTGAAC TGCTGCCCTG ATAGGATGAT GCATTCCTCC CGGCCACTCC 002080
002081 CCCATACCTC AGTAACGACC CTCCCCTGAG ACCACTCATT CTAAAGGCTC CTGACGCTGC TCAGCGCTTA CTCAAACCCT 002160
002161 ACTCAAGAGT GCGCTCTCTG CAGCTCCCTT CCTCCACAGG GCTCCGGGGT TCCCTCTCAC CTGTGGCCCC GGTAAGAGGC 002240
002241 CTGGATACGA ATGGCCGCAC TCTGGACCTG TGGGTCATTG ATGTCCAGGG TACATCCAGG AGCTGGGGCC ACAGCCACTG 002320
002321 GCTTTTTGGC AGGAGCTGCT TTAGACATCT GGGGAGTGGG CGACAAATAT GGGGGTCCAG GTTGAGACAG AGGCTCCCAC 002400
002401 CTTCAGCCCT TTTCCTCCCT GGTCAGAACA TTCAGGGAGA TGGAGGAATG GGGGTGTCTG CGGCTTCTCT GAGGATCAGA 002480
002481 CTCACCGTGG AGGGGTTGAC CTAGGGACTT GAGTCGACTG GCTGGATTGC AGGCAGCACA GAGCCCTTCT GGATGGCACA 002560
002561 GTAGGGGTGC AGGCAGGTGC TCAGCCCTGG GCCCGGTTTT ATACATCGCT CCCCTCCTTG CCTTGTTTGG CGTGGTTGCC 002640
002641 CCAAGCCCCC CGGGGCCCCA TCCGCCTCCC TCTGACTCAG GCAGGCTCTT TATTTAGCCT GTGGATGCCT GGGATGCTGG 002720
002721 CTCAGCCGCT GGGCTAGGCC CGTGGGGAGA AACCCGAGCA GCTCTGAGGT GCACCTTGCG TCAGCACCTG GACAGCTGGA 002800
002801 CAGCTGGATG CTGCCCTGTC CCCTCCCAGG GCAGGGGGTA TCCATCCCAC TCCCAGGAGG ACCAGATTCA GCCCCTAGCT 002880
002881 GACAAGAGAT CCAGCTTCCT GCTGGGGCAG CTTTTCAATC AGGTTGGGCT CTGGATAGTC AGACAACACA CCCTGTTGGT 002960
002961 TAAGGGGTTT GAGTGTTTCT GGGGTTTCCC TGAATACAGC CAGAGTAGCA TCGGACTTGG TCCTCAGGGC CCCTACAGCT 003040
003041 AACGGCAACA GGGCCACGTG GCTGTCAAGA CCAATCCAGA AGCTCCTTAC TCTTCCCCTA CCCCCAACCC CAGGAGTCCA 003120
003121 GGGTTGTCAC TTCCACACTG TGCCCCCTGG CGGTAAACTG AGATAAACCG GGAGGGGACG GGAAAGATTT TTGAGAATAG 003200
003201 CAGCCCCTCC ATGCCTTCCC CATCCCCACT CACTCCCCTC GATGTCACAT GGTGGGGGCT GTTTAGACTG CACTGTCCTT 003280
003281 GGAGATTAGC ACCACTACTG GCCCCAACCC TAGGTCTGCC CAATTCCTTT TTGGGGGCAT AAATTGGTGC ATAGGGGCAT 003360
003361 CAGACCACCT GGGGTGAAGG GGTTCAGTTA ACCCCTCTGG GGATTGCCTG GAGGCCAGTC TGCCCAGGCC GGGCACCGGG 003440
003441 CAGAAGCATT GCTTCCATTC ATTGGAACTA CCCGCCCTAG TGTGGGGGCA CTGTCACGTT TTGATCAGAG CAATTTTGAC 003520
003521 ATAGCTCTGC TGACATAAAT TACCTTTGCT CCTCCGGGTT TGGAACAAGC AGTTCACATG CCATTTCCAA TGGAGATGGG 003600
003601 AAGGGAGCTC AAGATCAGAG CGGGGACGGG AGAGGAATTT TACCATTTAT CAAGATATGA GTTCAGGCAC CTCATTTTTA 003680
003681 TACAAAACAC TTGCAGTGTG TAGGGGATTG GAGATTCTTA CAGGCAAACT GTATAtcttc atcgctaaaa tgacaggtag 003760
003761 gcaaggtgtc gttctggtcc cttccagctg gctttaacat cctgttattt tatgatGATG GGGCTTGCCT CTTCCCTGGG 003840
003841 GTCCTGAGAA GCTAGAGAGG CTGGGCTGGG CTGGGCAAAA GGGTTGTCTG GGCTGGAGCC AAGGGGCCCT CAACCTCCAT 003920
003921 CAGCAACACA GCAGTGACAA CTCTGTCTAC CCTGGTCAGA GGCTTTGTTC CTTTTACTTT GCCAAGAACA GAATTTTGAA 004000
004001 GGTGGGGATG AGGGAAACAT TTCCTCTGGA GACAAGGAGG TCTGGGTCCC TGCCAACCCC CCAATCCCTC GTCCCCAACC 004080
004081 TCAGGGCAGA GCAGCTCAGT CCTAGGTCCC AGTGACCATT CTATGGCACG GGTGTGGCTA GAATTGAGGT GCTTTAGCAC 004160
004161 CCAGCAGACC TCCTAACTTG CCATGTGGAC AAGAAGGTCC AGGCTGGTGT GTCCCTCAGC CCTAGGAAAA TGCTGGACAC 004240
004241 AGCAGACTAG AGCACTCCCA GCTGGTGCTC TCCATGTGCT CTGATATGTG CTGTGACATG GAGGGCAAGG GGTGTGAGCC 004320
004321 TGTCCTAGTG tttctttttg agatggagtc tcactctgtc atccaggctg gagcgcagtg gtgcaatctt ggctcattgc 004400
004401 aaccttcacc tcccaggttt aagcgattct cctgcctcag cctcctgagt agctgagatc acaggtgtgc accaccacgc 004480
004481 ctggctaatt tttgtatttt tagtagagac ggggtttcat catgttggct aggctggtct tgaactcctg acctcaggtg 004560
004561 atccagccac cttagcctcc cTGTCCTGGT TTCTTCTGCA GCTCCTCACA CTACCTACTC TCACCAACCA TCAGGATGAA 004640
004641 ACCCACTCTG CTCCAGGGGT CAGGACCAGA GCTCCTGCCT GTCTTCTGGG GAGAGTGAAC AAGGCTGAGC CAGGATTGGG 004720
004721 GTGAGGGCTG GCCTATTTCA GAGCCCAGCA TCCCCCACAT TGTGATGAGG GAACACGTGA CAAGAGGTGG GGGCTACTAG 004800
004801 AGCCTGGTTG CAGGGGTTCA GTGGATCTGC CAACATTCCG GAAACTGCAA GGGGAGAAGC AGACCACAGT CTCCCCTCCC 004880
004881 TCCAGGAATA AGGAGAGGTG AGGGGCCAGG AGGGTCACGG GAGGGGGTTG GAAGGACCCA AAGTGCTAGC TGGTATATTC 004960
004961 CTGAATGTGT GTGTTGGGGA ATACCCAGAG AGGGCTAAAT GCCCCTGTCC CGGACTCACC CCTCACCAGC CATCTTCCCC 005040
005041 TCCAGCCATC CCTCCCAATA GACACACAG
[back to top]

Predicted Small Protein

Name NONHSAT077043_smProtein_3656:3796
Length 47
Molecular weight 5037.0152
Aromaticity 0.108695652174
Instability index 46.7130434783
Isoelectric point 9.86248779297
Runs 10
Runs residual 0.0560300207039
Runs probability 0.0461402579051
Amino acid sequence MSSGTSFLYKTLAVCRGLEILTGKLYIFIAKMTGRQGVVLVPSSWL
Secondary structure LLLLLHHHHHHHHHHHLHHHHLLLEEEEEHHHLLLLEEEEEELLLL
PRMN LLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTToooooooooooo