NONHSAT079654

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT079654

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3748 nt

Genomic location

chr20-:37049230..37079565

Exon number

9

Exons

37049230..37049549,37049653..37049747,37050632..37051016,37055062..37055148,37056323..37058254,37059684..37059757,37063147..37063271,37063670..37064009,37079176..37079565

Genome context

Sequence
000001 GGCATGTTTC AAGATTTATT GAGTGGGTGA GAGGACAATG GTCCATAGCA CAGAAAAAGT CAACAGGAAC AAATTCTTGG 000080
000081 TTTTCAGAAG TCACACCAGG AGGAGGCGGG AGGACACGGA GGATGGCCAC TGGATGGCCT GGCCTAGATG GAAGAAAACC 000160
000161 TGTCTTCCTG GGCCATGGCA CTCTGTGGCA CTGGCAGGGC AGAGCCAGCT GAAGGGTCTT CAACTCTGGA TCCCAAGTCC 000240
000241 CCAGAGCCAC AGGGCTTTTC CTCTGCCCAC CCACGCTGAG GGAAAAGTCG AATAATGTTG TGCTGAGACA ATAGGGGCAA 000320
000321 GAAGGTCCAG CCATGAGTCA CAGGGTGACA TCACTCGTCA CTCTTGGTCT GTGATCAACC CCCAAACAAT GGCGCGAAAC 000400
000401 GAGCGTAGCT TCCTTGTCGT GTGGCCTCAG TCCTTCGCCG TCCCTCGCCG TCCTTCGCCA TCGCACGCCA CCGCACCCCA 000480
000481 TCTCTCGAAA TCTGCAGACA TCTTGATTTT TCCCACGCTG TCTGTCAGGT CTCCGCCGCC ACTCGACGCC AGGGCGCCGG 000560
000561 GTGAGTCCAC AGCAGGGCGC ATGGTCGGCT CCACTCTCAC CATCGGGGGA AACTGAGGCC CCGGGGAAGG CGGCCAGAGA 000640
000641 GGTCAGGCCC TGTCCTTGGT GCAGTGTCTC GTCCTCTTTG CCGGAGTGAG CCCCCCAGGA TTCGCGTGTC CTCGGGAAAG 000720
000721 ACACTGGCGG AGGCCTTGTG GGCTGTGCTG CACCTCGGAC GGCTTCGCAC CAGCCAGCGC CCTCTCTCTC CTGCAGCACT 000800
000801 CTGATCTGCA CCCCCTGAGG GGCTTCCACT GTCCGCGGGG TGAGAATGCC CCTGGAGGAG TGTAACATGA CTGCCGCCCC 000880
000881 ATGTGTGTGA GAGGCGTCCT CTGGGAGAGC ATGGATCCTG AGGTCCCAGT GTGTTGCTCT TCTCCCTGGC CCAGGGCAGT 000960
000961 TATTGTGAGA CATGAGGGAA GGGTGTGGGG TGATGGGTTT GGCCTCATGG AGCAGAAGGG CTGGGTGTGG AGCTGAGATA 001040
001041 GGGCTGCCCT GCTCCTTCCT CCCGGAAGCT TTCTCTGCCT CCACACCTCT CCTTCCACGT CCTTATCCAG CCCCTGCACC 001120
001121 AAGCTGAACC TGTGGAAATT GTTGCCTGAC TTGTTTCCTG GCTCTCTTTG CTGTGGTCTC AAGGGCAGTG TCTGAATTTT 001200
001201 GTTCCCTACT TTGTTATGTC TGCCAGATGG TCAGTATTCT GACAGTGTGG GACAAAGGAA AAACGGCAGC CTGGGCAACC 001280
001281 CCGCCTCTAT AAGAACTAAA AAATTAGCCA GGCATGGCGG TGCATGCCCC TGTAGTCCTA GCTTCTCAGG AGGCTGAGGC 001360
001361 AGGAGGCTCT CTTGAGCCCG GGAGGACAAG GCTGCAGTGA GCCATGACCA TGCCACTGCA CTCAGCCTGG GCAACAGAGT 001440
001441 GAGACCCTGT CTCAAGAAAG AAAAACGAGA AAGGGAGAGT CCCTCCACTG TAAGGAGATC GGGTTCATTA CATTTTGGGG 001520
001521 TGTTGGAGAA AAATACTGAG TCAGCACCTG TGTGGGATTG GTGGGAGCAG ATTTGGTGTT TTCCACCCCT TCACAGGATT 001600
001601 CTGAGGTAAC TCATTTCTGT TGGCCTTGGC CTTGTATGGG GAGGATTTCC CTCCAGCCTT GTATGGGGAG GATTTCCCTT 001680
001681 GTATGGGGAG GATTTTCCCT CCAGCTTGTG GGAAAGGAAT CAAGGACCAG AGACAGGCAG GGGAGAAGAT CACTGAGGGA 001760
001761 TTTACGGCAG CAGCCTCTGC ACGGCTTCCC CACGACCTCC CCAGCTGCTT GCTGGACGCT GCTGGAGAAA CAGCACATCC 001840
001841 CAAGATCATC ATGGCCCCAG CATCCTCTTG AACTTAGTAA CAGTTGGCCT GACAGATGAA GCGGATATCA TCCCTGCCTT 001920
001921 AGAAGTTCTC TAGCCCCTTT CTTAGAGATA AGCTGCTATT TCATTCCTAA AAGAAGTCAT GGCTGTTCAG GTGAGAAGAA 002000
002001 CGGTTTAGTC AGCATCTATG CAACACTGTA GAGATGCTTA GTGAGAGGTG TTGCAGGAGA GTGGCAGACC GCAGGGCCCG 002080
002081 AGGTAAGGTT TCTTGTCCAT CTCTGGAATT TTGGCTTTTT GCACCCTGTA AAGTCACGTC CCCTTTTTTT CTGGTGCTTT 002160
002161 GGTGGTGACG TAGGGAGAGG TGAGGATGAG AAGAGGAGCC ATGTCCTATG GATTCCCAGA TCCAGACTCC TCTTCCCTGG 002240
002241 GAAAGCTGGA TTGGAGCTGG CCCCTCCTTC TGTGCCTTCA GGGGCTGTGG AGGAGGAGGC CCCTGCAGGG AGCTGCCTGG 002320
002321 GGCGAGCGCT GGGGGCTTTT CCTGCAGTGC ATTAGCAGCA TGTCATAATG GAAAGTGGCT GGATTTCTTT CCAGTTGTAA 002400
002401 TCAATGGTTA AGATTTAACC TAGCAACATT GTAGGGGTAC ACAGCCAAAT GATTCAGGAA AAAAGATGCA GGTTGAGTCT 002480
002481 GGAGGAATCC CTGTGCCAGC TTCCTCATGC TCCCTCCCTC TCTCAGAAAT CACATGGAAC CCATTCTCCC CTCAGCAGTG 002560
002561 AAAATGCATT CAGCCTGTGT GCAGTGTTTC TGCCCAGGGA AGCCCAGTAG GGCCTCCATG CCCGGGGTGT TTACTGGGAT 002640
002641 CTGGTTATGT AGACACCACC TGCCTAGCAC GCACCAAAAT TCCAGACTTT CAGGAAAACA GGTGCCTGTG CCACATTATT 002720
002721 TGCACGGAGA GACTGTGTCC AGTGAACGGT TCTGATCAGT AAACTGTTGA CTAGGAACAT CCCGAGAGCC AAGTTCCCAG 002800
002801 ATGCCAGCCA AGGGCCATCT GACCTGTTTT GAATCTCAAT TTCCTGATAC AGAAAACAAA GAGGATTGTC AGCTGACCTC 002880
002881 TGTCCTGTGT GCCCAGTGGC CCCAGGTGAC GTGTCTTCAA GAAGAGGCTG AGCTGCGGTG CTTGTAAGAG ATGGGTCCGC 002960
002961 TGTGTGCAGA GGCTCCATGG GATGTGTTGT CTTCATAGTA GCAGGTCGCT GGACCATGGG TCCGCAAGGT GGGATTTTGG 003040
003041 ACTGTTGTCT GGAATATGTT TCTTTGCATC CTGCAGAGTC ATGTGCTGTT CCTGGGGCTT GGATGATGTA GGGGAAGCAA 003120
003121 GGTGAAAGTG GTATCCCGTG GTTCCACATT CTGTCCTGCT GTGGGGCTGT CCCTCACATG CTTTCAGTGG CTGTTGCAGA 003200
003201 GGAAGAGCCT CTGGAAATCT GCGTGGGTAA AGTGAGCTCC CCGGGGGAAC CGTCCTCCTC CCTTGAGCGC TGGTGTGTCC 003280
003281 ATAGTGTGCT GTGATCTGGA TTAGCAGTGA GTTTTTCTTT CAGGGTGTGT GTGAGGTCTC AGCCTTAGAT CCAAGGACAG 003360
003361 TCCAAGGAAG TCCTAAGACC ATGGAGTTGG TGATCTGGGA TCTGGGTTTG CTGATATTTC TCACCGTGAG AATCTCTTGG 003440
003441 TGGTGTTTGT GGGCACGAGA GGGGCAGAGA ATGGAGAGTG AGGCTACCAC ATGAAGCGTC ACCAGAGCTG CTCCCTGCTG 003520
003521 CCTGCTCAGA GCACCCCGGA TCCACTGTTC AATCTGCACA AGATTCGGGG TCCAGACATG GGAGACTTCA GCTGCCTCAG 003600
003601 AGGACCGTGG ACAGGGAAGG CCAGCCTCGC ATCCCTCTGT CCATGCCTGG AATGACTTTA ATAACCAAGA GTTTTATTTT 003680
003681 TGAATTTGTA GCTGTCGTTC ACTTTTTACC CACCCATTCA ATAAACCTTA CAGAATTGCT CCATGTGC
[back to top]

Predicted Small Protein

Name NONHSAT079654_smProtein_2207:2410
Length 68
Molecular weight 6888.7589
Aromaticity 0.119402985075
Instability index 49.6447761194
Isoelectric point 5.46343994141
Runs 11
Runs residual 0.0208642650621
Runs probability 0.0297891474362
Amino acid sequence MDSQIQTPLPWESWIGAGPSFCAFRGCGGGGPCRELPGASAGGFSCSALAACHNGKWLDF
FPVVING
Secondary structure LLLLLLLLLLLLLEELLLLLEEEELLLLLLLLLLLLLLLLLLLLLHHHHHHHHLLLEELE
EEEEELL
PRMN -
PiMo -