MIR4435-2HG

From LncRNAWiki
Jump to: navigation, search

Annotated Information

Approved Symbol

MIR4435-2HG

Approved Name

MIR4435-2 host gene

Previous Symbols

MIR4435-1HG

Synonyms

LINC00978, AGD2, AK001796

Chromosome

2q13

RefSeq ID

NR_015395

OMIM ID

_

Ensembl ID

ENSG00000172965

pubmed IDs

19531736, 25888808

Sequence

>gi|1015254011|ref|NR_015395.2| Homo sapiens MIR4435-2 host gene (MIR4435-2HG), transcript variant 1, long non-coding RNA

000001 ACAAGTATGA AGAGAATGTC GGGAGAGGAA GTGGTGGCTA TGAGTCAGCA TGAGTCATCT CGTTCCAATG AGAATGAAGG 000080
000081 CTGAGGTGTG CGCCTTTTTT TTTTTTCCTT CTTAGTCGTG TGTACATCAT TGGGAATGGA GGGAAATAAA TGACTGGATG 000160
000161 GTCGCTGCTT TTTAAGTTTC AAATTGACAT TCCAGACAAG CGGTGCCTGA GCCCGTGCCT GTCTTCAGAT CTTCACAGCA 000240
000241 CAGTTCCTGG GAAGGTGGAG CCACCAGCCT CTCCCTGACA AGCAAAGTGG ATCAGCAAAG GCTGCAGTCA CCAGCATCTT 000320
000321 TTCCAACCTT AATGAACTGT ATCCTCAAAA GAACACTATC AGACTGGTAA GACAAGCATT ATTGTCCCAC TTCACAGATG 000400
000401 AAGAGCTTGG GGGCCCTGTG GATGTGTAGG AGAGTCGGCC TTCCTTTTTA TAGGCCTTTT GCTGAGCTCA GAGGACAGAG 000480
000481 CAGAAGACAA AGCCGAATGC AGCCCTAAGC AGACACTATC CCAGCCCAAG GAGCTATGCA GACAGCTCCT GCCTAGAGGA 000560
000561 GAAACAATTC TCTCCTTCAG CCAGGCATGT GGCACAGAGG TGAACAGGAA ATGCTTAGGG AGACAGTTCA CAGATATATA 000640
000641 TGTACTTCAC AGGGCACAAT TTAATCCATA AATCAATCAT GGATCACCGC TAAAGAAAAC AAAATATGCT GCCACTAATT 000720
000721 TGCCACCACC CTGTGAGCAT GATTGGATGA TGTTTATTTT GATTAGGAGA TTGCTCCATT TATAAATCTT CAATACATCC 000800
000801 TGTCCCCCTA AAACGGCATC TGGGTCTTTT GAGGGTTAAA AAAAAAAAAA AGTGAAATTG GATGAGGGAT GAGAGCTGGA 000880
000881 CTTCTGTGTG TCCCGGGGTC CCTGGATCCA CAGCCTGGTG CCTAGCATCA GTTAAGCCCA CGCTGGGGTC GCATCACATC 000960
000961 CCGCAAAGCC ACGTGCTCTG TGAGGCCAAC AGGAGCCCCG CCGGGTGGAC CAGCTGGAAT CTCAGAGAGC GCCGACTCTG 001040
001041 AAACTACCCG GCTCTGCAGA AGCACGCTGG GCCCAGGGGC TTCTAGACTG ACAGCTCCAT TTATCAACTA CCTATTGGTT 001120
001121 TTAAAAATTG GAGTGTCTTT TCCGCGTTGA CTGATTTTGG CTCTAAGAGA TGTCGCTGGT CATTTCAGAG TGACTGAACC 001200
001201 TCCCCTCTAA CAGATCCCGG GAATTGTTTT CAGGAAAGGT AAAAGGCAGC CTTTTCTGTC ACAACACAAC GCTGAGCCGG 001280
001281 CAGCCTGGCT CTGTCAGGAT CTGGGGCTCC CGCGCCCGAG AAGCCCAGCC TCGCCGGCGG CCAAGTTCAC CGCGAGGCCC 001360
001361 CGCGCTGCCT GCGCTGCGCT CCCGGACCCG CACCGACCGC AGCGCGCGCC GCCGGTGCTT CTCCCACCCC AGCCTGGAAG 001440
001441 CTGCCTCCCT CCGCCTATGC CTGCAGGATA AGAAGCCCGA GGAGGCGGAG CATGGAACTC GACAGTTAAA ACATTTAAGA 001520
001521 GAGAAAACCT AGTGTCTTGC TGGCCTGAAA TCGAGTACGC AGCCCGGGGT GATCAGGGTC TCCCGGCCCG GATGTGTGAG 001600
001601 ACTTGCTTCC CCTGGGCAAT AGGCGATACG ATGCTTTAGG AGGAAGGTGT CTCTCCCTCC TAAGCCCCGG AGGGGAGAAC 001680
001681 TTCCAAAGAC AGAAAACCAC AGGCTTCCTG GCACAGAGCT TTCCCTTTAT CAGCTAAAGC AGAATCTTTT CTGGCCTTAA 001760
001761 CCTGGCCCCT TCCTCTAACT GCAGGCAGAG AGGCAGACAG AAAAGCACTT GCTGAAACAC AAAGTTTTGT TCTGTCCTCA 001840
001841 ACGAACTGTC TAGAGCTGAT TGCTGATAGT CGTGGTGCAT TATGCCTTCC TGGTTTTCAT TTAATTGGGC ACCACGCTGC 001920
001921 CTTTCAAGAC GCCTTAAAGG AACCAACAAC CAAATCCAAG AGAGCTGGAC AGACCATTGA ACACACAGTA GGCTGTGTCT 002000
002001 CGTGGCTTTC GTTGTCTGGT GCCTCAAAGA AAACACCAGA AAGATTGTTT CTAAGCTAGA GCCACCCCAG ATTGCTTAAA 002080
002081 GTGCAAAGCT CACTGCTGTT GGGGGTACCC TTGTGAGACA CTGGAAAGCT GGTTTTACCG TGGCCCTATG AAGAGGAAGA 002160
002161 CTGAAATTTA GACAGTAATA CCTTTACTAG GATTGGAAAA GATTTGGTTA ATGACAGCCC TGTCATTTCT AAAACCCATT 002240
002241 ATCACTGTAT GAGAGATTCC TTTGCGCTGC ATCCTCGACA GTGCTTCCTA AGGCTCTGCC GACTTCCAGT TCTGGAACAA 002320
002321 GATGGTTAAA CTCATTTTTC CCTGCTCTGC TCCTCTAAAT ACAACTAAGT ACCTTGGAAA CTATTCAGCA GACAATGATA 002400
002401 AAGGGCTCTG AAAGCTAGAA GAAAAGGTGT ACTTGCAAGA AACCTCAGGA CTTGAGTAAC AGCAACATGA ACAATCCATT 002480
002481 TCACAGATGA AGAAACAGAC TCAGTGAGCA CGTGATCACT TCTCACAACT AATGGAGCCA AGATTCTGTC CTTATGGCTC 002560
002561 CAGAGACCTC TTTTTTTTCC CACTGTACCA CAACACTCAC CAGGACTGGA GTGCCACCTA TGACCTCATT GCATAATGGA 002640
002641 TGGCTTGTCC TCAAATGGGC CCGGTCTTGG ACGAGCCTGA GGATGTCTAC AAGTGGAAGA AAAGAAGCCA CTGGAGCAGA 002720
002721 AGGTGGGGAG GATAAAATTT GGAGCAAGAT TCTCAAGGAA GCAACAAGAT CCTAAGATCT TGTTCTCACT AGAGAATAAT 002800
002801 TTCTACATTA TGCCCAGGTT CTTCTGAGCT ATGAAGGGGC CCAGATTTAA GGGCTATTTT TGACACCCTA AATGTGCTGA 002880
002881 GACAAGTCAT TAAGGTGGTC CTGCCAGGAC ACAGCCATCT AAAGCAGCAA TCTGCTTCTT GCCAGAAAAT CTCGTGCCTC 002960
002961 TGCAGAGCCT TTTCCAGAAT GAACCACACC ATGCTGAGGA AAGGAGAAAG AGACTACCTA CTGCATTTCT GTCACTCGCT 003040
003041 GAAAAGGACA CTCTGTCAGA AAATCTTCTA GCAAACTTCA AAGGGCAAAA TCACCCCTTG TTACTGATAA AGCCCAGAGA 003120
003121 GCTTCAGCAG CTAACATTCC CTGGACAGGG CACAGCAAGG ATTTGAACCT AGGTCAGTCT GGCCAGAACA CCCACAAGCT 003200
003201 TTCCTTAACT CAGTGTGCTA TCTCCCCACG ACTAGGTCAC TACTGCTTTA TAATCACCTT TGTAGCCACC AGTGGATTTT 003280
003281 GCTCATCAGT ATTTTTCAGG CAATTGATAC TTTAGATATT CAGCTGCAAG ACGTATGCAG TTTTCATTGA CATCTTTTGG 003360
003361 AGAAACTGAC AAACCTGGAC TTGACTTAAT GCCTTTGGAA CCTTCCAAGA TGTTATATAA CTCTAGATAG AAGGCTGGGC 003440
003441 CTCCATGATG TCAGGAATGT TGCATTCTTA TTTCCCCATA GATAAACCCA TTTGTCCACA AAGTCAAGGA GTCAGGCAGA 003520
003521 GGCCCTTGCC ATGGGGCTTT TTAGGATAAA GCAACAAGCC TGGACTTTGC TCTACAACAG GGTTTTGCAT AGGGAGTGGT 003600
003601 ATGACCAGAT CCCTCAAGAA AGAAAGCTTA GAGACCAGGC CAGAGTCCAC TGCAGTAGCC CAGTCAAGAG AGGATGGTGA 003680
003681 CTTGGACTTG TAGTAGAGCC AGTTAGAATG AAAGAAATTG ACACATTCAG AAATGGTTTT AGAGATAGAG TCAAACTGGA 003760
003761 CCTGATAAAG AACTAGAGAA GCGGAGTGAG GATAAAGAGA AGAGCCATGA CTGACTCGGA AGATTTTGTC TTGAAAAACT 003840
003841 TGAGAACTCA AGACAGAGTG AAATAAAATC ACATGTGGGA AAAATCT

Predicted Small Protein

Name MIR4435-2HG_smProtein_3530:3679
Length 49
Molecular weight 6094.9944
Aromaticity 0.122448979592
Instability index 48.6673469388
Isoelectric point 10.506652832
Runs 7
Runs residual 0.019194704909
Runs probability 0.048123754006
Amino acid sequence MGLFRIKQQAWTLLYNRVLHREWYDQIPQERKLRDQARVHCSSPVKRGW
Secondary structure LLLHHHHHHHHHHHHHHHHHHHHHHHLHHHHHHHHHLEEELLLLLLLLL
PRMN -
PiMo -