NONHSAT087920
GHRLOS, which gives rise to endogenous ghrelin natural antisense transcripts. exhibits features which are common to many non-coding RNA genes, including extensive splicing, lack of significant and conserved open reading frames, differential expression and lack of conservation in vertebrates.
Contents
Annotated Information
Name
GHRLOS:ghrelin opposite strand/antisense RNA(HGNC nomenclature)
"GHRL antisense RNA 1 (non-protein coding)", GHRL-AS1, NCRNA00068, "non-protein coding RNA 68"[1]
Characteristics
GHRLOS transcripts that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene(GHRL) initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). The 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important inprotein transport. GHRLOS that is riddled with stop codons is little nucleotide and amino-acid sequence conservation between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons[1].
Function
GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames[1]. GHRLOS natural antisense transcripts, functioning as a non-coding RNA,may regulate one or both of these genes, because GHRLOS overlaps the terminal exons of both GHRL andSEC13-T[1]
Expression
We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. For non-quantitative RT-PCR analysis of GHRLOS splicing, RT-PCRs were performed with a forward primer in a region common to the 5' terminal exon Ia/b and a reverse primer in the 3' terminal exon 4 of GHRLOS[1].
Ito4-F | CATGGAAGTCTCCAAGCCTG[1] |
Ito4-R | CTGCTCTACTGCCTCAATGTC[1] |
To detect long, chimaeric transcripts, we employed RT-PCR with a forward primer in exon 2 of TATDN2(ChiOut-F) and a reverse primer in exon 1 of GHRLOS[1].
ChiOut-F | TGAAAGCCCAGAAGGAGGA[1] |
ChiOut-R | TCTAAGTTTAGAAGCGCTCATCTG[1] |
To allow strand-specific and RNA-specific amplification of GHRLOS transcripts, reverse transcription was performed using a gene-specific primer in exon 4 with a linker (LK) sequence attached to the 5' end of the primer[1].
GHRLOS-Real-F | CATTGAGGCAGTAGAGCAGTTGA[1] |
LK | CGACTGGAGCACGAGGACACTGA[1] |
To detect sense GHRL transcripts,a strand-specific RT-PCR approach was employed, with a reverse transcription primer spanning the 3' terminal exon 4 of the ghrelin gene (GHRLex4_RT_LK) followed by PCR with an exon 4 specific forward primer (GHRLex4_F) and a linker-specific reverse primer(LK)[1].
GHRL-Real-RT-LK | CGACTGGAGCACGAGGACACTGAGCCAGAGAGCGCTTCTAAACTTA[1] |
GHRL-Real-F | GCCCCAGCCGACAAGTG[1] |
LK | CGACTGGAGCACGAGGACACTGA[1] |
Labs working on this lncRNA
Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, Queensland, Australia[1].
References
Basic Information
Transcript ID |
NONHSAT087920 |
Source |
NONCODE4.0 |
Same with |
GHRLOS, GHRL-AS1, NCRNA00068 |
Classification |
antisense |
Length |
1765 nt |
Genomic location |
chr3+:10322636..10335133 |
Exon number |
5 |
Exons |
10322636..10322686,10327432..10327537,10329098..10329165,10329629..10329786,10333752..10335133 |
Genome context |
|
Sequence |
000001 ACATGGAAGT CTCCAAGCCT GTGCCATCCA CCTGCCAAGG AAAAGCACAA GCATACAGTT TGAACATTTA TTCGCCTCCT 000080
000081 GAGCTTGTAC AACAGTCGTG GGAGTTGCTG CAGAAGCAAG CGAAAAGCCA GATGAGCGCT TCTAAACTTA GAGAGAGGGA 000160 000161 GAGCGCCTCA TCTCTTCCAT TTTCCAGGTA TTCCTGAGAT GATTTATTGG AGCTCAAAGC TTTGGGAGAG TTGGGGCCTT 000240 000241 CCATTCCCTC CAGTAAATAC TTGTTTTTCT TCCACCGCTG AGGCAAATGC GGGGTGGCTG ATCACCTGGC AGACATCTTA 000320 000321 GGAAACAGGA GCACCGGTCT GGGAAACTGC TGGCCTGGCC TAACACCTGG CGCTGTGGTG CAGGAGGGCA GCCACTGAGG 000400 000401 ATTCTTGAGG AGAGAGAGac catagaatga atgaaagagt tggaaagact ttaaagctct gggaggctga atcctttatt 000480 000481 tcgctggaag aaaaactgaa tgccaagagg gcctcacttt ccccaaagct atacagccat caggggcaat gctgggattc 000560 000561 ccccatggat ctcttgactc ctaatacagt gctctttctg atatgccatc tgGCTCCATA ATGAACATTG TGTTCCAGGA 000640 000641 AAATCAGTCT GCGCTGAACA GGATGGAATT GGGGCAGGAC GTCCAGTGAG GAGGACATTG AGGCAGTAGA GCAGTTGATT 000720 000721 GCCGAATGAC CACCTACCCT GACTTAAAAG ATCAACCTCA GGGAGGATTG GAGCTTTCTA GAGTCTCTTG GGACAGCAGA 000800 000801 AGCACAGGCC GGGTTGGACT GAATCTTTAA GTGGACATGA GGGACAAAGT ACCTCCTGTT GGTAAACATC TCACCACCAA 000880 000881 CCCACTGTGG TGGCTAAAAT CCCACCTTTA GTCCCAGCAA CATATGAGCA TGTCACCAGG AGGTCTTCAC AGGCCTGTCT 000960 000961 GCCACCTGAG TGTAGACATC TTTTGGCCCT GGAGCCCAGA GAGGCTGAAT GTGGAGAGGG TGGAGAGAGG TCTGTAGTCC 001040 001041 TCAGAGAAGA CTTGCAGCTT TTTCAGAGCC ACCAACCCAT AAAAAAAAAA ATCCCCAAAC AGAAAAATCC TAACTGTGGT 001120 001121 GACCAGGTAC CTCCTGAGAC ATGAAGCCTC CACTTACCTG GACCCTGGAG GCCTCTCCGG GCACAGCTGC CAATGTTGAT 001200 001201 CTTAGATAAG ACCACCAGCA AGTAAACATC CACTGTGCAA AGCTGTGTTA TCTTTGGGGA ACTGAAATGT CCCCTGGGAG 001280 001281 TTGGAAACTC CCCTAGCCAC ATACCACAGA GTTGAGGAAG GAAGAGCTGA TGGACGGAAG AACCATGGCG GGAGGAGTCA 001360 001361 TCCGGAAGCT ACCTCGCTGC CCTGTCAGTT ACGGAACAGA GGAGAGATGC CGGCTGGAGG ACACAGCAAA TTTGAACCAA 001440 001441 GAGGAGCTTG GAGGAAGCCC GAGCGACCTG GAGGGGACTG GCTGACCTTC CTCATTCTTT TCAAGTGTGA ATAATAACCA 001520 001521 AGCCCAGTTT GGCAACTCCT TGAGGGTGAG GACGAAGCCC CATTCTCCTT TTTGGAACTT GGTGGGGCTC AGGAAGCAGG 001600 001601 TTCTCTCCAG TCGGTGGCTT TCCTTTCTGT TGCGGGTCTC TTGAGGGCCT GCCTTCATGA AGGCACATGA GTGACTCATC 001680 001681 ATTTGTGAAT TAATTGCTAT ATGTGAAGGG CATCTGAGAA CAAATTATCT TCATAGACTT TTCATTATAA TTTTATTTGT 001760 001761 ACTGA |
Predicted Small Protein
Name | NONHSAT087920_smProtein_2:196 |
Length | 65 |
Molecular weight | 7193.0709 |
Aromaticity | 0.078125 |
Instability index | 78.734375 |
Isoelectric point | 9.30072021484 |
Runs | 7 |
Runs residual | 0.03921875 |
Runs probability | 0.0353042117748 |
Amino acid sequence | MEVSKPVPSTCQGKAQAYSLNIYSPPELVQQSWELLQKQAKSQMSASKLRERESASSLPF SRYS |
Secondary structure | LLLLLLLLLLLLLLLEEEELLLLLLHHHHHHHHHHHHHHHHHLLLHHHHHHLLLLLLLLL LLLL |
PRMN | - |
PiMo | - |