ENST00000414769.1
MIR503HG, a lncRNA which is likely originated at least 370 Myr ago in the tetrapod ancestor, may functions like H19 by promoting miRNA transcription, preferentially in the placenta and in an imprinted manner.
Contents
Annotated Information
Transcriptomic Nomeclature
MIR503HG: MIR503 host gene(HGNC nomenclature)
"H19 X-linked co-expressed lncRNA", H19X, MGC16121[1]
Characteristics
MIG503HG could potentially promote the transcription of between 2 and 7 miRNAs in different species, such as Human,mouse,Opossum,platyous,xenopus,. This lncRNA (that we name H19X, for H19 X-linked co-expressed lncRNA) is ikely originated at least 370 Myr ago, in the tetrapod ancestor[1].
Fuction
H19X is a neighbour of Rsx, the lncRNA that drives imprinted X-inactivation in marsupials, suggesting that H19X may itself be imprinted. These results suggest that H19X may function like H19, by promoting miRNA transcription, preferentially in the placenta and in an imprinted manner[1].
Expression
Expression pattern appears to have dramatically shifted during evolution, from an ancestral testes-predominant pattern to preferential expression in the chorioallantoic placenta of eutherians[1].
Evolution
Expression pattern appears to have dramatically shifted during evolution, from an ancestral testes-predominant pattern to preferential expression in the chorioallantoic placenta of eutherians[1].
Labs working on this lncRNA
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland[1].
References
Basic Information
Transcript ID |
ENST00000414769.1 |
Source |
Gencode19 |
Same with |
lnc-PLAC1-1:2, MIG403HG, H19X, MGC16121 |
Classification |
intergenic |
Length |
720 nt |
Genomic location |
chrX-:133677473..133680660 |
Exon number |
3 |
Exons |
133677473..133678029,133678744..133678827,133680582..133680660 |
Genome context |
|
Sequence |
000001 GAAGGTAGAA GGTGGGGTCT GCCGGACGCG TGTTCCTGCC ACCAGGTGCC CGCTCCCCGC GAGGCCGGCT CAGGAGCAGA 000080
000081 AGGAAGCCCG GTGCCAGCCA GCCTTCCTGA AAGACCAAGC CCGCGCCATC CGGCTTCCTC CAGTGGACGC CTGCAGGACC 000160 000161 CAGGAATGTT TTTCTTGAAG GCATCCAGCA TCTCCAGTTA GCAGTACTGA TTTTTTTTCC CCCCAACAAA GGAACACTAC 000240 000241 ATCAACACTG TTGGCGGGGA CCTGGACACA GAAGACTCCT GTTTCAAGAA AATACAATCA TCTCTCAAAG GCTGTAATTT 000320 000321 ATATGCATTT TAAAACTCTA GGCATTGAAA ACCACCCAAG TGTCCCAAAT AGAAGGGTAA TATATAATCA ATCACTCAGT 000400 000401 GTAATATTAT ACATCCTTTA AAAATGTTAT TGGGAAATGT TTTATGATCT GTAAGTCCAA GGAATCCTCT CCCACCATTT 000480 000481 CTTTCCCCCC GCTGTTCCCC CATACCCACA CTTCTTTGTT CCAATTGGCA TGTAAATTTG GTTTTCCCGC CAAATGAGTC 000560 000561 AGTCATGATG GGAACCTCAA CTGATTTGAA CAGATGTGTG TCAATGTTAC TTGGAAAACT AGATGTCAAT AACCAGGGTC 000640 000641 ACAGAAAAAG GCAGTGGTCA CAACTCTGTA AAAATGTATG CATGCACACA GACAAGAACT AAAGTGGAAC CCCACACAGG 000720 |
Predicted Small Protein
Name | ENST00000414769.1_smProtein_443:556 |
Length | 38 |
Molecular weight | 4124.8464 |
Aromaticity | 0.135135135135 |
Instability index | 65.8432432432 |
Isoelectric point | 9.30364990234 |
Runs | 5 |
Runs residual | 0.0446317877511 |
Runs probability | 0.0455043278572 |
Amino acid sequence | MICKSKESSPTISFPPLFPHTHTSLFQLACKFGFPAK |
Secondary structure | LEELLLLLLLLLLLLLLLLLLLHHHHHHHHHLLLLLL |
PRMN | - |
PiMo | - |
Name | ENST00000414769.1_smProtein_443:556 |
Length | 37 |
Molecular weight | 4124.8464 |
Aromaticity | 0.135135135135 |
Instability index | 65.8432432432 |
Isoelectric point | 9.30364990234 |
Runs | 5 |
Runs residual | 0.0446317877511 |
Runs probability | 0.0455043278572 |
Amino acid sequence | MICKSKESSPTISFPPLFPHTHTSLFQLACKFGFPAK |
Secondary structure | LEELLLLLLLLLLLLLLLLLLLHHHHHHHHHLLLLLL |
PRMN | - |
PiMo | - |