Difference between revisions of "NONHSAT130421"
Chunlei Yu (talk | contribs) |
|||
Line 38: | Line 38: | ||
==References== | ==References== | ||
− | + | [http://www.lncrnadb.org/ANRIL/ Annotation original sourced from lncRNAdb.] | |
{{basic| | {{basic| |
Revision as of 05:41, 26 August 2015
Please input one-sentence summary here.
Contents
Annotated Information
Name
ANRIL: Antisense noncoding RNA in the INK4 locus;. CDKN2B-AS1: CDKN2B antisense RNA 1;. P15AS: P15 antisense RNA
Characteristics
Present in multiple splicing isoforms (Folkersen (2009)) (such as the CDK2BAS RefSeq sequence NR_003529.3 of ~3.9kb). Also detected as an unspliced transcript of 34.8 kb (Yu (2008)), specified as p15AS, but which seems to correspond to an isoform with the first intron of CDK2BAS retained. Antisense to the tumor suppressor gene p15/CDKN2A. (Yu (2008)). Traverses a noncoding region centromeric to CDKN2A, a region containing the INK4b/ARF/INK4a locus and implicated in a range of complex diseases including cancer (such as melanoma-neural system tumor syndrome), type 2 diabetes, periodontitis and coronary heart disease (see extensive reference list in Literature below, and Popov (2010) for a review). Transcript found to be quite unstable with a half-life >4 hr in human Hela cells (Tani (2012)).
Function
In vitro over-expression of p15as/ANRIL RNA was originally demonstrated to induce epigenetic silencing of p15 gene expression (Yu (2008)). Recently, ANRIL RNA was found to interact and regulate chromobox 7 (CBX7), a component of the Polycomb Repressor Complex 1 (PRC1), which methylates histone H3 lysine 27 to promote epigenetic silencing, and is also up-regulated in prostate cancer. Interaction occurs with a conserved chromodomain of CBX7 and is essential for the ability of PRC1 to repress the INK4b/ARF/INK4a locus and control cell senescence. (Yap (2010))
Expression
Expressed in tissues and cell types affected by atherosclerosis. Increased levels of ANRIL are found in patients carrying the atherosclerosis risk haplotype expression, including in peripheral blood mononuclear cells, whole blood and atherosclerotic plaque tissue, and the levels were directly correlated with the severity of atherosclerosis (Holdt (2010)). Up-regulated in prostate cancer (Yap (2010)).
Conservation
Please input conservation information here.
Misc
Please input misc information here.
Transcriptomic Nomeclature
Please input transcriptomic nomeclature information here.
Regulation
Please input regulation information here.
Allelic Information and Variation
Please input allelic information and variation information here.
Evolution
Please input evolution information here.
You can also add sub-section(s) at will.
Labs working on this lncRNA
Please input related labs here.
References
Annotation original sourced from lncRNAdb.
Basic Information
Transcript ID |
NONHSAT130421 |
Source |
NONCODE4.0 |
Same with |
, |
Classification |
intergenic |
Length |
3857 nt |
Genomic location |
chr9+:21994790..22121096 |
Exon number |
19 |
Exons |
21994790..21995160,22029432..22029593,22032673..22032985,22046316..22046448,22046750..22046899,22049105..22049227,22056251..22056386,22058358..22059053,22061952..22062025,22063943..22064017,22065661..22065756,22066234..22066352,22096371..22096513,22097257..22097363,22112319..22112394,22113665..22113798,22118643..22118766,22120199..22120409,22120503..22121096 |
Genome context |
|
Sequence |
000001 AGCTACATCC GTCACCTGAC ACGGCCCTAC CAGGAACAGC CGCGCTCCCG CGGATTCTGG TGCTGCTCGC GTCCCCGCTC 000080
000081 CCCTATTCCC CTTATTTTAT TCCTGGCTCC CCTCGTCGAA AGTCTTCCAT TCTTCAAACT AGATTATTTA AAAATGAAAA 000160 000161 AGGAAGAAAG GAAAGCGAGG TCATCTCATT GCTCTATCCG CCAATCAGGA GGCTGAATGT CAGTTTTGAA CTAAAAGCCG 000240 000241 CTCCGCTCCT CTTCTAGATT TGGAAAACAA GCGAAATTAA ACTAAACCGC TGCACGCCTC TGACGCGACA TCTGGACACG 000320 000321 GCGCGGCGCT GGCGCTGCCG GAGCTGTCGA CCCGGCCTGG CGCCGGACTA GGACTATTTG CCACGACATT TCAAAGGATT 000400 000401 CCAAGAGAGA ATATTGGTGT CCATGCTGTG ATGATTCCTC AGCTCCTCTC ATCTGATCTC CGTCCTGGCC CCCATGACTT 000480 000481 TCTTTGTGGT AGTTAGGGTG TGGTATGTGC CACTGAGGCC CACACCTATT GCTGCAATTT ATAGCACTGA TCTGTCATCA 000560 000561 ATACCACTTG CTGTCTTGGA TGTGAAGATG ATTTTTCCTG CAGGGATTCC CTCTACAAAA TTAAAAACAC TGGGCATGTG 000640 000641 GAAATAATAT TCATGCTTTA AATTGTCTTT TCTCTTCACT ACACCAGGGG TCCCCAACCC CTAGGCCACA GACTGTGGCC 000720 000721 CTAGTGTAGT GAATAGAAAA GACAATTTAA AGCGTGAATA TTATTTCCTC ATGCCCAGTG TTTTTAATTT TGTACTGGTC 000800 000801 TGTGGCTTGT TAGAAACCAG GCTGCACAGC AGAAGGTGGG CAGCAGATAT TGAGAAACCA CAGAAAGAGA GAAGTTAAAT 000880 000881 AATTTTCCTG TCAAAGACCA CATCAATGAT GAAGCCAGAA TTTGAGCTCA TGTACTTAAC CACTGGACTA CCTGCCTGCC 000960 000961 CTGTCGAGGA ACAGCTAAGT GTCCCTTTTG ATGAGAAGAA TAAGCCTCAT TCTGATTCAA CAGCAGAGAT CAAAGAAAAG 001040 001041 ACTTCTGTTT TCTGGCCACC AGATATATGT TATCTGTGCT TAAAGAATTG AAAAACACAC ATCAAAGGAG AATTTTCTTG 001120 001121 GAAAGAGAGG GTTCAAGCAT CACTGTTAGG TGTGCTGGAA TCCTTTCCCG AGTCAGTACT GCTTTCTAGA AGAAAACCGG 001200 001201 GGAGATCTAT TTGGAATGTA TCTAACTCCA AAGAAACCAT CAGAGGTAAC AGTAGAGACG GGGTTTCACC ATGTTGGCCA 001280 001281 GACTGGTCTT GAACTCTCGA CCTCGTGATT CGCCCGCCTC GGCCTCCCAA AGTGCTGGGA TTACAGGTGT GAGACACCAC 001360 001361 ACCCGGCGGA TAGAGAGAAT TTTGACAGTC TCTCCAATGA ACGCCTTCAC TGATATCCAA AGCATGAAGG ACACACCAGG 001440 001441 GAAAAACATA GACCTAACAC AGGACAAATG GAATTATTAG AAACATTTTC TAGCAGAAGA ACACTATTCT GTTGCCATTT 001520 001521 GAATCTTTGC TTCTTTCTAG GTTTGACAAT GAGCCTATCA TATAAGCCCA AATGTAAACA GAAAGAGGTT GAATCAGTCA 001600 001601 CGATAAGCCC AATTATGCTG TGGTAACAAA CAACCTCAAA ATCTCATTGG CTTAAAATAT ACAGAATTAT TCTTACTCAT 001680 001681 GGCACATATC CATCTATCAT CTGCAGGGGA TCTGCTCACT GAAGTCACTT AGGAACTTGG ACTGATGGAA CGGCCACTTT 001760 001761 TTGGTCACTA TATGTATTAA TCTGTTTTAA TCCTGCTGAT AAAGACCCAA AATTGGGAAC AAAAAGAAGT TTAACTAGAC 001840 001841 TTACAGTTCC GCATGGCTGA GGAGGCCTCA GAATCATGGT GGGAGGCGAA AGGCACTTCT TACATGGTGG CAGCAAGAGA 001920 001921 AAAATGAGGA AGAAGCAAAA GCGGAAACCT CTGATAAACC CATCAGATCT TATGAGACTT ATTCCACTAT CAAGAGAATA 002000 002001 GCATGGGAAA GACTGGCTCC CATAATTTAC CTCCCTCTGG GTCCCTCCCT CAACATGTGG GAATTCTGGG AGAAACAATT 002080 002081 CAAGATATGA CACATTCATA ATTTAAACAG AAGCCTACGA AGAACTCATA AATTAAAAGA AGATAATCTT TTCACAAGGT 002160 002161 GATGGAGGCT TTTTATTTTG CCACAAAACC ACTGGTGACG TTGCCTGTGG CCACCTTGGA GAAGACACTG GAGGCCTGGG 002240 002241 ACATGGAGAC TGCTTTTCTG CAGAAACCAC ATCCCTTGGA GTAATGAGCT ACACCTACCT CAATTATTCA GTGCAGTACA 002320 002321 ACACTCCAGA CAGGGTCTCA CTCTGTCACC CAGGCTGGAG TGTATTGGCA TGATTACAGC TCACTGCAAC CTTGAACTCC 002400 002401 CAGGCTCAAA CCTGAGCAGC TGGGACTACA GATGCACCAC CATGCATGGT ACCAGAGATA TAATAATGAG AAACAGACAT 002480 002481 GCTCCCTCCC CTCATTGAGG TTACAGCTTA GTGTGGAGAC ACACAGATGC CTAACGCACT ATGGTATGGA AGGTGCTATG 002560 002561 GACACAGTGC TCAAATCCAT GATCTACATA GGTGGAGAAC TTCAGTAGAG GAAGTGGCAG GAATTTGGGA ATGAGGAGCA 002640 002641 CAGTGATTAA ACTGGGGCCA TTCATATGAG AGTTTAAGAA CTCAGACCAG TGACTTAGAT TGGCTTCTCT CACATGGCAA 002720 002721 GAAACATTGC TGCTAGCACT TCCCGAGTTC TACGTTCTAC AACATCCACC ACTGGATCTT AACATAGACG TAAGATCAAA 002800 002801 TGCAATAGCA TGTCAAACAA TGTGTAACTC CAGTTATACA AACATTACTG TATCTCATTG GGGATACGAA GCTCTACACA 002880 002881 CTTGAAGATG GTGAAGGAAT ATAAAAATCT ATGTCTCACA GTCCAGACTT GGAGTACAAG TAATAAGAAG AATAAAACTT 002960 002961 AATCCCTTAA GTAGATTCAC CATAAGTTAG CTCAGAGCAA TTCCAGTGCA AGTATGGTCT GTGATCCAGT AGTATCTTAC 003040 003041 AGACAGCAAG TTGAACATTG TGGGATGCAT GAGCTATTGA GGCCTTTGCA GCTTTCTGCT ACATGGAGGC TAGGGCCAGA 003120 003121 GTCAAGATTT ATGCTTTGCA GCACACTGGT CAGCTGTTTT TGCAAATCAG ATTAAATGAT TTTTAAATGA GGCTGAGAGC 003200 003201 ATGGGAGATA CTAATGTGTG TTTCCTTGTG AGCTACTGCA TAAGTTAGGA AATTGAAATA CAGAAAGATG AAAAGTGATT 003280 003281 TGCCCAAGCA TATAGATCAA AGCTGTGGCA GAACCAGGAC TGGAACCTAT ATCTCTCTAC TAATGGTTTT TTTAAAAAAA 003360 003361 TAACCTTGTT TCAAAAATAT TAAAAAGTCA CAAGAAAGGT AAACATGTGG ATAAACAAAA TGAAGAAAAT AAAAATTATC 003440 003441 CAGTAATAAC ATATTGGCAT ATGTCTTTCT GGTATATTTT CCTGTGTTGT CATCATTATC ATCTCCATCA TCATTATATC 003520 003521 CATCATTATC ATCATCATCA TCATCATCAT CATCATTATC ATCACCATAG TGAACATGTA ATGCTTACCT AGTGCCAGAT 003600 003601 GCTGTCTAGG CATTTTACAT GTGTTACTGG TAACTCATGT AATCCTCATA ACAACCTTAT AAGGTGGTTG CTATTATCCC 003680 003681 CATGTTACAT ATGAAGAGAC AGAAGCATAA AGAAGTTGCA CCGCTGGTAA TTGGCTGGGA TTTGAACTTA AGCAGTCTAA 003760 003761 CCTTAGAGTA ATGATTTTAA CAACTATGCT ATATACATAC AAATTTACAA AATAAAACTG GGCTCAGACA ATAAAAAAAA 003840 003841 AAAAAAAAAA AAAAAAA |
Predicted Small Protein
Name | NONHSAT130421_smProtein_905:1090 |
Length | 62 |
Molecular weight | 7021.0695 |
Aromaticity | 0.0983606557377 |
Instability index | 61.3950819672 |
Isoelectric point | 4.71600341797 |
Runs | 9 |
Runs residual | 0.00527605049934 |
Runs probability | 0.0399026477459 |
Amino acid sequence | MMKPEFELMYLTTGLPACPVEEQLSVPFDEKNKPHSDSTAEIKEKTSVFWPPDICYLCLK N |
Secondary structure | LLLLLEEEEELLLLLLLLLLLLLLLLLLLLLLLLLLLLHHHHHHLLEEELLLLEEEEEEL L |
PRMN | - |
PiMo | - |