ENST00000602433.1

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

ENST00000602433.1

Source

Gencode19

Same with

NONHSAT003019

Classification

intergenic

Length

2445 nt

Genomic location

chr1+:47004368..47035927

Exon number

8

Exons

47004368..47004466,47006602..47006681,47009366..47009440,47023897..47024014,47025342..47025483,47029184..47029248,47033878..47033969,47034154..47035927

Genome context

Sequence
000001 ATTTCATTCC AAGTCCAGCC TTCAACCCCA GTGCCTGAGC AGGAGGGAGG GACATTGCTC TGACTCTTGA GTTGCGAGGA 000080
000081 CTGGAAGACC TTCACTTTAG CCCTGCAAGC TCTGGCTTCC AGCACAGCTG AGAGGCCCCA GTGGCACAAG ACTCCGAAGG 000160
000161 CTTCTTTGAG CGATCACAGG TTCTTGACCA CTTGTCTGGG CACTGCTCCT GTCTCCTGGA GCTCAGGCTG GATGGAGTCT 000240
000241 CTCCCCTAGT TCAGGCTGCA GCCCCAGCCG CCACAGCTAC AGCGGTGAGA GCAGGCTGGA TCCCAGATCT GCAGGCGGGT 000320
000321 GAGCAGGTGG AGCCCAGTGA AGAGATGTTC CTGCTGCAGG CAATTATGCA AAATGCTTGC TCTGTTGCCC AGGATGGAGT 000400
000401 GCAGTGGCGT GATCGCAGCT CACTGCAACC TCCGTCTGCT GGATTCAAGC GATTCTCCTG CCTCAGCCTC CCAAGTAGCT 000480
000481 GGCATTACAG CACCTGCCAC CATGCCTGGC CATGGCTCAC CAAGAAACAA CCACCTCCCG CCTCCACACA CCCTGCAAAA 000560
000561 CTAATTTGAC TGAAAGATGG TCTCATATAT TTTCCAAACT CTCAGGCTTA CATCAATCAC ACTGATGATC AAGAGCGAAG 000640
000641 AAAGGACGAG GATGGGCAGA AACATCACAC AATCACGATG AGCAATGCCT GACATGACAA AGAGCAAAAA AATGGTTAAC 000720
000721 ATATGCAGAT AATTATCAAA CATTCCACAC ACAATTAATT TTTTCCAAAT GCTGTAGTTC GCAGATTAAT TCCCAATTAT 000800
000801 GGTACATTAA AAAATCAACG AGCTGCAGGA GGCAGAAGTA ACCAGCGACA CAAACCAAAA GCACAGTTGA TTCTGCGGGT 000880
000881 TCTTTTGAAG AGGAGGCTTT AGAAGTCTGG CAGGGATAAG AAGGGCTAAA TGAATCCCAG CCAGCCTACC CCTCAGTACC 000960
000961 ACAGTGACCA CCAAGAAGAA GGCCTAATGC AGAGCAGAGC AGAGAGCTCT GCAAACGGCA ATGCTTGGAA CAGCAATGCA 001040
001041 GAAATAATTA GTGCCTTGTT TAAGAGCATC TCAGGGGGCT TAACTGCTCT ACCTCATTTC TTCTTGTGTG GCCGATTTAA 001120
001121 AAAAAAAAAT TCCTTCAGTG CGACTGTCTG AATGTTCAGA GGAGCAAGTG TTAGTTACCA AATCTCCCAA AGTGACCTAT 001200
001201 TCCCTTCTCA CCTGCTTTGC ATGTCTAAGG CAGCCCTGCC AGCAGCAGAG CAAAGACCGC TTTGGCACCA GGATGCTGTT 001280
001281 ATTTTTGTTT TTTAAAATAG AGATGGAGTC TCGCTATAGC CAGGCTGTTC TTGAAATCCT GGCCTTAAGC AATCCTCCTC 001360
001361 CCTTGGCCTC CCTAAGCATG GGGATTACAG GTGTGAGCCA CCTCGCTAGG CCCACTGAGG GTTGCTGTTT TTAAGAGAAA 001440
001441 ACCTCTGCAC TTTGGCCAGG CCTCCTGAAG GCGACTGGAG ATAGCAGCTT AGCCCATGAG GGGTTAACCT GCTGACCTGG 001520
001521 CCAAGAAGGC ACCTCGGGGA GAGCCCTCTT CTAACCAAAG AAAGGGACAA AGAACTAACT CTGGGTCATG GTGTTTTGGT 001600
001601 GGAGGACACA GAAACCTCCC TTTTGTCTTT CCTTCCACAG CACATGCAGC ACAGAGACAG GGCTGGAAGG CTGAGCAAGA 001680
001681 GTCAGAGCTA CAGTTGGAAG TATTGGAAAG CGGCAGCGGA TTCTCGGGGT GATGGGAAAA GCAAAATCTG CTTCACTAAA 001760
001761 ACTCCGTCGG TGATCAACTT TTTCAGCAGT CCTCCGATTC CCGACTAGAA GGCAAAAGCC TGGCCCAGGC CACGGGAGGC 001840
001841 ACTTCACTAA AGAAGAAAAA CCTGAGCTCC AGCAAGCTCA GTAACTTCAT CAGTCACCTA ATTGAACGAG AACCCAGAGC 001920
001921 AACTCAGTCC AGGCTGCACA AATTGAGTAG CACTGCTCTG CAGCTCGCTG CACTAAAAGT GGAAGATATT TTCCATTACA 002000
002001 GCAAGTGCTC CCACAGAAGG ATCACGAGCC ACACACGGAG CCTTACTCTC CAAATAGGCG CCCATGGCCC AGTCGTCCCT 002080
002081 TAGCCTAAGA ACTCAGAGAG CCAGGCTGTA ATAATGGGGC TGCCGTGTGA GCACAGCAGC CATCGAGCTG CTCCGGTTCA 002160
002161 TTTCCCAGAG GTTGAGCTGC AGTGTGTCCA GTATTTAAAA GTATGGAACT TGTTTCAGGG CTGACCCATT CTTGCCCAAT 002240
002241 TCATCTTCCC CGTGTGAAGT TATTTAGCAA ACAGCCAAGT TCAGCTGGAA ATCGTTTTTC ATCCCCTCCA ACTTCTAGTC 002320
002321 ACATGGGCAG CTGGCCAGCT ACATGTTAAC AAGCGCCTCA GAACTCCTAA GACACCACCG GCCTCTGTTC CCTTCAGTGC 002400
002401 ATTCTTGTAA ATAAATAAAT AATAAAACCT GCCTTTACCA CCCCC
[back to top]

Predicted Small Protein

Name ENST00000602433.1_smProtein_344:568
Length 75
Molecular weight 8361.5609
Aromaticity 0.108108108108
Instability index 88.2108108108
Isoelectric point 9.64849853516
Runs 9
Runs residual 0.0210283454186
Runs probability 0.016918879664
Amino acid sequence MFLLQAIMQNACSVAQDGVQWRDRSSLQPPSAGFKRFSCLSLPSSWHYSTCHHAWPWLTK
KQPPPASTHPAKLI
Secondary structure LHHHHHHHHHHHHHHHLLLEEELLLLLLLLLLLLLEEEELLLLLLLLLLLLLLLLHHHLL
LLLLLLLLLLLLLL
PRMN -
PiMo -