NONHSAT087920

From LncRNAWiki
Jump to: navigation, search

GHRLOS, which gives rise to endogenous ghrelin natural antisense transcripts. exhibits features which are common to many non-coding RNA genes, including extensive splicing, lack of significant and conserved open reading frames, differential expression and lack of conservation in vertebrates.

Annotated Information

Name

GHRLOS:ghrelin opposite strand/antisense RNA(HGNC nomenclature)

"GHRL antisense RNA 1 (non-protein coding)", GHRL-AS1, NCRNA00068, "non-protein coding RNA 68"[1]

Characteristics

Mapping of GHRLOS transcript 5' and 3' ends.[1]

GHRLOS transcripts that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene(GHRL) initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). The 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important inprotein transport. GHRLOS that is riddled with stop codons is little nucleotide and amino-acid sequence conservation between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons[1].

Function

GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames[1]. GHRLOS natural antisense transcripts, functioning as a non-coding RNA,may regulate one or both of these genes, because GHRLOS overlaps the terminal exons of both GHRL andSEC13-T[1]

Expression

We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. For non-quantitative RT-PCR analysis of GHRLOS splicing, RT-PCRs were performed with a forward primer in a region common to the 5' terminal exon Ia/b and a reverse primer in the 3' terminal exon 4 of GHRLOS[1].

Ito4-F CATGGAAGTCTCCAAGCCTG[1]
Ito4-R CTGCTCTACTGCCTCAATGTC[1]

To detect long, chimaeric transcripts, we employed RT-PCR with a forward primer in exon 2 of TATDN2(ChiOut-F) and a reverse primer in exon 1 of GHRLOS[1].

ChiOut-F TGAAAGCCCAGAAGGAGGA[1]
ChiOut-R TCTAAGTTTAGAAGCGCTCATCTG[1]

To allow strand-specific and RNA-specific amplification of GHRLOS transcripts, reverse transcription was performed using a gene-specific primer in exon 4 with a linker (LK) sequence attached to the 5' end of the primer[1].

GHRLOS-Real-F CATTGAGGCAGTAGAGCAGTTGA[1]
LK CGACTGGAGCACGAGGACACTGA[1]

To detect sense GHRL transcripts,a strand-specific RT-PCR approach was employed, with a reverse transcription primer spanning the 3' terminal exon 4 of the ghrelin gene (GHRLex4_RT_LK) followed by PCR with an exon 4 specific forward primer (GHRLex4_F) and a linker-specific reverse primer(LK)[1].

GHRL-Real-RT-LK CGACTGGAGCACGAGGACACTGAGCCAGAGAGCGCTTCTAAACTTA[1]
GHRL-Real-F GCCCCAGCCGACAAGTG[1]
LK CGACTGGAGCACGAGGACACTGA[1]

Labs working on this lncRNA

Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, Queensland, Australia[1].

References

  1. 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.07 1.08 1.09 1.10 1.11 1.12 1.13 1.14 1.15 1.16 1.17 1.18 Seim I, Carter SL, Herington AC, Chopin LK. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene[J]. BMC molecular biology. 2008,9:95.

Basic Information

Transcript ID

NONHSAT087920

Source

NONCODE4.0

Same with

GHRLOS, GHRL-AS1, NCRNA00068

Classification

antisense

Length

1765 nt

Genomic location

chr3+:10322636..10335133

Exon number

5

Exons

10322636..10322686,10327432..10327537,10329098..10329165,10329629..10329786,10333752..10335133

Genome context

Sequence
000001 ACATGGAAGT CTCCAAGCCT GTGCCATCCA CCTGCCAAGG AAAAGCACAA GCATACAGTT TGAACATTTA TTCGCCTCCT 000080
000081 GAGCTTGTAC AACAGTCGTG GGAGTTGCTG CAGAAGCAAG CGAAAAGCCA GATGAGCGCT TCTAAACTTA GAGAGAGGGA 000160
000161 GAGCGCCTCA TCTCTTCCAT TTTCCAGGTA TTCCTGAGAT GATTTATTGG AGCTCAAAGC TTTGGGAGAG TTGGGGCCTT 000240
000241 CCATTCCCTC CAGTAAATAC TTGTTTTTCT TCCACCGCTG AGGCAAATGC GGGGTGGCTG ATCACCTGGC AGACATCTTA 000320
000321 GGAAACAGGA GCACCGGTCT GGGAAACTGC TGGCCTGGCC TAACACCTGG CGCTGTGGTG CAGGAGGGCA GCCACTGAGG 000400
000401 ATTCTTGAGG AGAGAGAGac catagaatga atgaaagagt tggaaagact ttaaagctct gggaggctga atcctttatt 000480
000481 tcgctggaag aaaaactgaa tgccaagagg gcctcacttt ccccaaagct atacagccat caggggcaat gctgggattc 000560
000561 ccccatggat ctcttgactc ctaatacagt gctctttctg atatgccatc tgGCTCCATA ATGAACATTG TGTTCCAGGA 000640
000641 AAATCAGTCT GCGCTGAACA GGATGGAATT GGGGCAGGAC GTCCAGTGAG GAGGACATTG AGGCAGTAGA GCAGTTGATT 000720
000721 GCCGAATGAC CACCTACCCT GACTTAAAAG ATCAACCTCA GGGAGGATTG GAGCTTTCTA GAGTCTCTTG GGACAGCAGA 000800
000801 AGCACAGGCC GGGTTGGACT GAATCTTTAA GTGGACATGA GGGACAAAGT ACCTCCTGTT GGTAAACATC TCACCACCAA 000880
000881 CCCACTGTGG TGGCTAAAAT CCCACCTTTA GTCCCAGCAA CATATGAGCA TGTCACCAGG AGGTCTTCAC AGGCCTGTCT 000960
000961 GCCACCTGAG TGTAGACATC TTTTGGCCCT GGAGCCCAGA GAGGCTGAAT GTGGAGAGGG TGGAGAGAGG TCTGTAGTCC 001040
001041 TCAGAGAAGA CTTGCAGCTT TTTCAGAGCC ACCAACCCAT AAAAAAAAAA ATCCCCAAAC AGAAAAATCC TAACTGTGGT 001120
001121 GACCAGGTAC CTCCTGAGAC ATGAAGCCTC CACTTACCTG GACCCTGGAG GCCTCTCCGG GCACAGCTGC CAATGTTGAT 001200
001201 CTTAGATAAG ACCACCAGCA AGTAAACATC CACTGTGCAA AGCTGTGTTA TCTTTGGGGA ACTGAAATGT CCCCTGGGAG 001280
001281 TTGGAAACTC CCCTAGCCAC ATACCACAGA GTTGAGGAAG GAAGAGCTGA TGGACGGAAG AACCATGGCG GGAGGAGTCA 001360
001361 TCCGGAAGCT ACCTCGCTGC CCTGTCAGTT ACGGAACAGA GGAGAGATGC CGGCTGGAGG ACACAGCAAA TTTGAACCAA 001440
001441 GAGGAGCTTG GAGGAAGCCC GAGCGACCTG GAGGGGACTG GCTGACCTTC CTCATTCTTT TCAAGTGTGA ATAATAACCA 001520
001521 AGCCCAGTTT GGCAACTCCT TGAGGGTGAG GACGAAGCCC CATTCTCCTT TTTGGAACTT GGTGGGGCTC AGGAAGCAGG 001600
001601 TTCTCTCCAG TCGGTGGCTT TCCTTTCTGT TGCGGGTCTC TTGAGGGCCT GCCTTCATGA AGGCACATGA GTGACTCATC 001680
001681 ATTTGTGAAT TAATTGCTAT ATGTGAAGGG CATCTGAGAA CAAATTATCT TCATAGACTT TTCATTATAA TTTTATTTGT 001760
001761 ACTGA
[back to top]

Predicted Small Protein

Name NONHSAT087920_smProtein_2:196
Length 65
Molecular weight 7193.0709
Aromaticity 0.078125
Instability index 78.734375
Isoelectric point 9.30072021484
Runs 7
Runs residual 0.03921875
Runs probability 0.0353042117748
Amino acid sequence MEVSKPVPSTCQGKAQAYSLNIYSPPELVQQSWELLQKQAKSQMSASKLRERESASSLPF
SRYS
Secondary structure LLLLLLLLLLLLLLLEEEELLLLLLHHHHHHHHHHHHHHHHHLLLHHHHHHLLLLLLLLL
LLLL
PRMN -
PiMo -