ENST00000357401.3

From LncRNAWiki
Jump to: navigation, search

CYYR1-AS1 is a long transcript overlapping with CYYR1 as antisense RNA.

Annotated Information

Name

CYYR1-AS1:cysteine/tyrosine-rich 1 antisense RNA 1(HGNC nomenclature)

Characteristics

CYYR1-AS1 is on the opposite strand compared to CYYR1 and its four exons sequences are in correspondence of CYYR1 intron sequences. It was classified in GenBank as moderately similar to a transposase and was obtained by the NEDO human cDNA sequencing project from the NT2 neuronal precursor cell line. [1].

Function

The CYYR1-AS1 expression in normal human tissues studied seems to suggested that a fine regulation of the locus may occur in different kind of tissues.[1]

CYYR1-AS shows correlation with CYYR1-1,2,3,4[1].

Regulation

CYYR1-AS1 could regulate the CYYR1 locus as a putative non-coding RNA (ncRNA), classifiable as a long non-coding RNA (lncRNA, greater than 200 nucleotides in length)[1].

Expression

qRT-PCR analysis of CYYR1 and CYYR1-AS1 in different human normal tissues.[1]

CYYR1-AS1 is absent or significantly less expressed according to ANOVA analysis, heart, colon, placenta,except for brain[1].

Labs working on this lncRNA

Department for Life Quality Studies, University of Bologna, C.so d’Augusto 237, 47921 Rimini, RN, Italy[1]

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 Casadei R, Pelleri MC, Vitale L, Facchin F, Canaider S, Strippoli P, et al. Characterization of human gene locus CYYR1: a complex multi-transcript system[J]. Molecular biology reports. 2014,41(9):6025-38.

Basic Information

Transcript ID

ENST00000357401.3

Source

Gencode19

Same with

lnc-GABPA-4:2,NONHSAT081541,CYYR1-AS1

Classification

antisense

Length

3412 nt

Genomic location

chr21+:27778262..27941571

Exon number

4

Exons

27778262..27778447,27875901..27876021,27922638..27924446,27940276..27941571

Genome context

Sequence
000001 AGGTGTCGAA CCCAAGGGCT CTTCTCAGCA GAGTGTCTGC ATGCCACACT CTGCTTTAGA GTCTGTTCCC TGGGGAACAC 000080
000081 AACCAGAGAC TGAAATAATT GGTGGGCAGT GGGGAGCACA AAGGGCTGTT TTTGTTTCAT GTGAAGACGA TGAGCATGCT 000160
000161 ATGAACCCCG AAGAGCTGTT GTATAAAGTA TCTGCCTGGA AACCCTGTTG CCTGGAGACC ACTCTTGAAG AACATCCACT 000240
000241 GCTGTGGCCT CTTAAAGATT TAAGACATCT AATGAACAGC TGACAGAAAT CATCAATTTG CTCCTCAGCC TCCCTCCTCC 000320
000321 CTGAAACACA AAAACATTGA AATTAGGCCA ATTAATAACT CTACAATGGC CTCTAAGAGT TAAAATGAAA GGAAGTGTAA 000400
000401 AATGGTTTCT CACTTGAAAT CAAAAGCTAG AAATGATTAA GCTTCGTGAG GAAGGCAAGT TGAAAGCTGA GACAGCCAAA 000480
000481 AGCTGACTCC CGCACCAGGG AGCCAAGTTG TGCAAGCAAA GGAAAAGTTC TTGAAGGAAA TTAAAAGTGC TGCTCCAGTG 000560
000561 AACACATGAA TGATAAGAAA GCAAAAGTGA TCTGGGTAGA AGATCAAACC AGCCACAACA TGCCCTTAAG CCAAAGCCTA 000640
000641 ATCCAGAGCA AGGCTTTAAC TCTCTTCAAG TTTGTGAAGA CTGAGAGAGG TAAGGAAAGT GCAGAAGGAA AGTTAAATGC 000720
000721 TAGAAGAGTC GGTTCAGGAG GTTTCAGGAA AAAGCCATCT CCACAAATAA ACATACAACG CGAAGCAACA AATGCTGATG 000800
000801 TAGAAGCTGT GGCAAGTTTT CCAGAAGAGC TAGCTCAGAT CATTGATAAA CGTGGCTACA TTAAACAACA TATTTTCAGT 000880
000881 GTGGATGAAA CAGAGTCCTA TTGGAAGAAG ACACCATCTA GGACTTTCAT AGCTACAGGG AAAAGTCAAT GACTGGCTTC 000960
000961 AAAGCTTCAA AGAACAGGGT GACTCTCTTG TCAGACACTA ATACAGCTGC TGACTTGAAG TTGAAGCCAG TGCTCACTGC 001040
001041 ACATTCTGAA AACTTAAGGA TCCTTAAGAA CTGTGCTAAA TCTACTGTGC CTATGTGCAA CAAAGCCCTT ATGGCAGCAT 001120
001121 GTCTGTTTAC AGTATCATTT ACTGAATATT TGAAGCCTAC TACTGAGAAC TACTGCTCAG GAAAAAAGAT ACTTTTCAAA 001200
001201 ATAGTACTGC TCTTTGACAA TGGACCTGGT CACCCAAGAG CTCTGATGGA GGTGTGCAAG GAGATGAACG CTGTGTTCAT 001280
001281 GCCTGCTAAC ACAACACCCG TTCTGTACTC CATGGATCAC AGAGTAATTT TGACTTTCAA GTCTTATTAT TTGAGAAATA 001360
001361 AATTTTGTAA GGCTATAGCT GCCACACATA GTTATTCCTG TGATGGATCC GGGCAAAGTA AATTGAAAAC CTAGAAAAGA 001440
001441 GTCACCATTC TAGATGTCAC TAAGAACATT TGTGATTCAT GGGAGGAGGT TAAATTATCA ACATTAATAG AAGTTTGAAA 001520
001521 GAAATTTATT CCAGCCCTCA TGGATGACTT TGATGGGTTC AAGACTTTGG TAGAGGAAGT AACTGCAGGT ATAGTGATAA 001600
001601 TATCAAGGAA ATTAGAACTA AAAGTGGAGC CTGAAGATGT GACTGAATTG CTTCAGTCTC ACGATAAAAC TTGAACATAA 001680
001681 CAGCAGTTAA TTCTTACGGA TTAGCAAAAA AGTGGTTTTG TGAGATGGAA TCTATTCCTG GTGAAAATGC TGTGAACACT 001760
001761 GTGGAAATGA CAACAAAGGA TTTAGAATAT TACATAAACT TAGTTGATAA AGCAGCAGTA GGGTTTAGAA GGATTGACTC 001840
001841 CAATTTTGAA AGAAGTTCTA CGGTGGGTCA AACGCTACCG AAGAGCGTTG CACGCTATAG AGAAATCTTT CATGAAAGGA 001920
001921 GGAGTCAACT GATGTAGCAG ACTTCACTGT TGTCTTATTT TTAAAAATTT CCACAGCCAC CCCAATTTTC AGCAACCACC 002000
002001 ACCTTGATCA GTCAGCAGCC ATCAGCATCA AAGCAAGACC CTCTTCCAGC AAAAAAATTA CAACAACTTG CTGAAGGCTC 002080
002081 GGATGATTTT TAGCAACAAA CTATTTTAAA ATTAAGATTG TCAAACATGA CCCTTGAGGC TTACCTCTGG ATTGTGGTAT 002160
002161 GAAGGAATGA AAGCGAAAAA TAATTACCTT TGTGAGATTC AGTAAGTACT TAAGTCCACT TTTAAAATTT GAAAACAGAA 002240
002241 ACAAAATCTA ACGATTTAGA CACAAGGGAG AAGCCAATAT ATTGACAATA GATGCTTTTT GCAGAGTACA ACAGACTTTT 002320
002321 AAAGGCTATT TATTTTACAG TTTTCTTGGT GAATTTCCAT AGCTCTCATT TTTAGTGCTG TTTAATTTAT TCAAATATTT 002400
002401 AGACTGGTCA GTTATCCCAA GGGCTTAGTG GGGATGTTTT GCTTCATGTT CTTAAAAGCC ATTCAATGTA CGCCTACAGC 002480
002481 CATCTGATCT TTGACAAAGT CAGCAAAAAT AAGCAATGGG GAAAGGACTC CCTACTCAAT AAATGGTGTT GGATAACCAG 002560
002561 TTGGCCATAC ACAGAAGAAT GAAACTGGAC TCCTATCTTT TACCACATAC AAAAATTAAC TGAAAATGGA TTAAAGATTT 002640
002641 AAATGAAAGA CCTCAAACTA TAAGACTCCT AGAAGAAAAG CTAGGAAGCA CCGTTCCTGA CATCAGCCTT GGGAAGGAAT 002720
002721 TTATAACTAA GTCCTCAATA GCAATTGCAA CAAAAGCAAT TGACAAGCGG GATTTAATTA AACTAAAGAG CTTCTGCACA 002800
002801 GCAAAATAAA CTATCAACAG AGTAAACAAT CTACAGAATG GGAGAAAATA TGTGTAAGCT ATGCATCTGA CAAAAGCCTA 002880
002881 ATATCCAGAA TCTATAAGGA GGTTAAATAA TTGAACAAAC AAAAACCAAA TAATCTCATT AAAAAATGGG CAAAGGACAT 002960
002961 CAACCAGACA CTTCTCAAAA GAAGACATAC AAGCAGCCAA CAAACACAAC AAAAAAATGT TCAACAAGTC ACCAATCATC 003040
003041 AGAGAAATGC AAATCAAAAC AGAGGGCTAT TATTGAAAAG TCAAAAAAGC AACAGATGCT GGTGAGCCTG TGGAGAAAAG 003120
003121 GGAATACTTA TACACTGCTA TTGGAAATGT AAATTAGTTC AACCACTGTG GAAAGCAGTT GGGAGATTTC TCAAAGAACT 003200
003201 TAAATCAAAA CTACCATTTG CCTCAGTGAT CCCATTGCTG GGGTATCTAT CTAAAGGGAA ATAAATCATT CTATCAAAAA 003280
003281 GACAAATGCA GTTGTACATT CGTCACAGCA CTATTCAAAA TAGTAAAGAG ACTGATTCAT CCCAAGTGTT CATTAATAGT 003360
003361 GGACTCAGTA AAGAAAATGT GGTACATACA CACCGTGGAA TACTATGCAG CC
[back to top]

Predicted Small Protein

Name ENST00000357401.3_smProtein_1724:1936
Length 71
Molecular weight 8097.0554
Aromaticity 0.0857142857143
Instability index 42.51
Isoelectric point 6.56390380859
Runs 10
Runs residual 0.0019801980198
Runs probability 0.0251633986928
Amino acid sequence MESIPGENAVNTVEMTTKDLEYYINLVDKAAVGFRRIDSNFERSSTVGQTLPKSVARYRE
IFHERRSQLM
Secondary structure LLLLLLLLLLLEEEELHHHHHHHHHHHHHHEELEEEELLLLLLLLLLLLLLLHHHHHHHH
HHHHHHHLLL
PRMN -
PiMo -