LINC00470
Contents
Annotated Information
Name
Approved symbol: LINC00470
Approved name: long intergenic non-protein coding RNA 470
HGNC ID: HGNC:1225
Previous names: chromosome 18 open reading frame 2
Previous symbols: C18orf2
RefSeq ID: NR_023925
Ensembl ID: ENSG00000132204
LncBook ID: HSALNT0246856
Characteristics
LINC00470 (also known as C18orf2) is a long non-coding RNA located in chromosome band 18p11.32 between RP11-16P11 and RP11-732L14, Its alternative splicing of seven exons generates four transcripts[1].
Expression
LINC00470 expression levels in astrocytoma were significantly higher than those in normal brain tissues, high expressed in GBM[1].
Regulation
Please input regulation here.
Function
7SK snRNA functions in transcriptional regulation by interacting with PTEF-B complex ((Nguyen 2001)) ((Yang 2001)), BAF chromatin-remodeling complex ((Flynn 2016)), or hnRNP R ((Briese 2018)). Consistently, it has been found highly enriched in isolated chromatin fractions, which may be related to its role in transcriptional regulation ((Mondal 2010)). In addition to its critical role for controlling transcription, 7SK snRNA is also involved in alternative splicing ((Barboric 2009)) and the localization of protein in nucleolus ((He 2007)). Therefore, 7SK snRNA has a variety of functions in the nuclear, playing important roles in cell growth and differentiation ((Nguyen 2001)) ((Yang 2001)), axon maintenance ((Briese 2018)) and vertebrate development ((Barboric 2009)).
7SK snRNA controls RNAP II activity by inhibiting P-TEFb elongation factor, which is a cdk-cyclin kinase that functions as both a general and an HIV-1 Tat-specific transcription factor ((Nguyen 2001)) ((Yang 2001)), with an impact on cell growth and differentiation. Specifically, 7SK snRNA functions as the central scaffold that coordinates protein-protein interactions and, by inhibiting P-TEFb kinase-mediated CTD phosphorylation, regulates RNAP II elongation ((Nguyen 2001)).
At an early stage of the HIV transcription cycle, elongation is prevented as P-TEFb is recruited to the HIV-1 promoter in a catalytically inactive state bound to the 7SK snRNP and also the Tat trans-activator of transcription protein. The inhibitory 7SK snRNP may be displaced by the nascent TAR HIV RNA that also binds Tat protein, activating P-TEFb kinase and transcriptional elongation ((D'Orso 2010)). Displacement of 7SK may also be performed by cellular RNAs, as indicated by the 3'-untranslated region (~300-nt) of HIC mRNA, which forms complexes with P-TEFb and is necessary and sufficient for stimulation of P-TEFb-dependent transcription of the HIV promoter ((Young 2007)).
7SK snRNA inhibits enhancer transcription by modulating nucleosome position. 7SK physically interacts with the BAF chromatin-remodeling complex, recruits BAF to enhancers and inhibits enhancer transcription by modulating chromatin structure ((Flynn 2016)).
In axons, 7SK snRNA interacts with hnRNP R to regulate its function in axon maintenance ((Briese 2018)).
7SK snRNP (composed of 7SK snRNA, Hexim1, Larp7/Pip7S, and the P-TEFb subunits CycT1 and Cdk9) is not only critical for controlling transcription, but also for regulating alternative splicing coupled to transcription elongation ((Barboric 2009)). 7SK snRNP disintegration promotes inclusion of an alternative exon via the increased occupancy of P-TEFb, Ser2-phosphorylated (Ser2-P) RNAPII, and the splicing factor SF2/ASF at the minigene ((Barboric 2009)).
7SK snRNA also inhibits APOBEC3C deaminase activity and sequesters it to the nucleolus, suggesting broader role for 7SK RNA in regulating key nuclear functions ((He 2007)).
Disease
colon adenocarcinoma [1]
Evolution
Please input evolution information here.
Labs working on this lncRNA
- Hunan Provincial Tumor Hospital and the Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, 410006, Hunan, China.
- Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, 410078, Hunan, China.
References
Sequence
>gi|193083199|ref|NR_023925.1| Homo sapiens long intergenic non-protein coding RNA 470 (LINC00470), transcript variant 1, long non-coding RNA
000081 TCAGCTGTTC TCTAGGTCTT AAGTCTGCTG CTCTAACCTA CAAATTTTGG ACATGCCAAT CTCCACAGTC ACATGGGCCA 000160
000161 ATTCCTTAGT TTCTCTGGAG AATCCTGACT AATGTATGCA ATGTCTTTAT TTCATCCTCA TTCTTAAAGG ATATTTTCAT 000240
000241 CAGCAACACC ACACCCTCCT GATCACCATA GCTTTATGAT CTGAGCTTTC ATCTGGTATC ATTTCCTTTC CGTCTGAAGA 000320
000321 ACTTGCTTTC CTTGCAACAA CAAGAAACAA ATTTATTAGC TAACCTAACC ACTAATGACG CAAGAGACAA TTCTAAGGAC 000400
000401 TTTCAAAACA GCAAAGTAGG AGCAGCTGCT ACCTCTAGGG ATGAGGGAGA AACTCAAAAA TGCATACAAA AACCATTGCA 000480
000481 TGAAAAGTGA CTGGATTTGT ACATAGCGTC AGGAGATGTG CGTAGTGTCA AAGTATCTCA TCACACATTA CTTGATAATT 000560
000561 ACAAATGAGA AAAATGAACC TTCACAGTGG CAAGACTTGA CTTCTACCTC TTTCAAAAAG ATGCAATTGT CCAATTATTG 000640
000641 GTGAAATTGT CATTTCATGC TATTGGCTAT TTGAAATTCC TCCTCTAATT TCAGAATAAA TCACTGAAAT TGACATCGGC 000720
000721 CAGTCTGAAT TTCAAGAAAT TACCTGCTGA AGACAAGAGG GATCTCTTCT TCAGATTTGC AGTCTGGGGA AGACACAGCC 000800
000801 TCTACTGTAC TTTAGAACCT GAGATATGGT GGTGGAGGGA GCCCTGGGTC GAGTGGTAAG ATTCACCCTT AGGTTAGTAT 000880
000881 TGACGTAAGG TGACGAGGAG CTGTAGACAA AAGATTGTAA CCATAAGAAC TTCATAGTTT TTGTATTTTC ACCGAGCTTA 000960
000961 TATTTGGTGT GTTTTTTGTC TTTTCTTTAT GATTATCAAT AAAATGCTTG AAAGGAGATG AGGTTGGGGA ATAATTTTTG 001040
001041 GGAATACCAC AAAAGACACT TTTGTGATGG AAATCCTTAA AAAGACACAA TCCATTACCT CATTGGGTTC AAAAGGCAAT 001120
001121 TGTGAACTAC TGTGGAGTTT GGAAAGAAGC AATGAGGTAA TCAAGGATAC TGTTGACAAT CTAGCTTATC CTATGGATGG 001200
001201 AAGGAAATTG AAACTAATGG AGGCGAGGCT GGTAAAACAA TAGGGTTTGA GACAATTCTG TGGCATTAGA AATGAAAGAG 001280
001281 GAAGGTGAAT CCGGGAGACG GAGCTTGCAG TGAGCTGAGA TCGCGCCACT GCACTCCAGC CTGGGCGACA GAGCGAGACT 001360
001361 CCATCTCAAA AAAAAAAAAA AAGAAAGAAA GAAAAGAAAG AGGAAGTAAT AATCTGTGAA ATTTTTCCTT AGGAACTTAT 001440
001441 TGGCAATTTA AAAATGAATT TGTTAAGCCA TGCTGGTTCT GACCCAAAAG CCATTCCCCA GCCTTCCTCA CTCCCCTCTT 001520
001521 TCACTACTGG CAGAGATTGT CTCTCATTTT ACAAGCTGAA AATGCCAGAT GCTTGCTTTT ACAGTCTTCC TTACACCCAG 001600
001601 AGCATGTGCA TATGTTTAAA CGGTCAAGAA GAAGTCATAA CATGGGTGTC TGGGAGAGCT TTTATCCCAC AAAAACAACA 001680
001681 CTTCACTCAA AAAACAAACC AAACAAAGAA AAATTCTCCT TCCTGCCATT GGATGTGAGG CTCAGACCTC TAGTAACCAT 001760
001761 TTTGTGACCA CAAAGCAACA AGCCTGAGGA AAAGTCCTAC ACGCTGAGCA ACAGGCAGAA ATATTGCCAT CGCTGAGTTG 001840
001841 CAGAAACAAA TCTAGAGATG TTTTGCTTCT GTAATTATTT TTTATGGGAG ATTACAAGTG GGTTTACTGT TCACTTTTCA 001920
001921 AATCTTATTT CTCTATGATG TTTAGCTTGG GTAAATTTTA CCTTAAATCC ACTTTTTTAT GTAAGGTAAC ATATTTGTCG 002000
002001 GTTTCAAGGA TTAAGATGTG GGCATACTTG GAGGCCATTA TTTTGCCCAC CACAGGTGAA AAAGGAAGTG TTATTCTTAA 002080
002081 ATCATTTGGA AGGATCTCTG TGTAAATGCA AGAGCGAGAC AAGAAAATGC TGTCATTCTT TTGATATGGA CTCGAATTTC 002160
002161 CACTTCATGG TTGTCTGCTT CCTTTTTAGA GTATTATTTA TCCTCCTAAT AAAAAGAAAG TGAAATTTCC C
Predicted Small Protein
Name | LINC00470_smProtein_1883:2104 |
Length | 73 |
Molecular weight | 8461.072 |
Aromaticity | 0.191780821918 |
Instability index | 13.9932876712 |
Isoelectric point | 9.65484619141 |
Runs | 13 |
Runs residual | 0.0339372514361 |
Runs probability | 0.0446713682008 |
Amino acid sequence | MGDYKWVYCSLFKSYFSMMFSLGKFYLKSTFLCKVTYLSVSRIKMWAYLEAIILPTTGEK GSVILKSFGRISV |
Secondary structure | LLLEEEEEHHHHHHHHHHHHHHLHHHEEEEEEEEEEEEEHHHHHHHHHHHHHEELLLLLL LEEEEELLLEEEL |
PRMN | LLLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLL LLLLLLLLLLLLL |
PiMo | iiiiiiiiiiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooooooooooo ooooooooooooo |