NONHSAT099079

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT099079

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3068 nt

Genomic location

chr4+:165798150..165820752

Exon number

6

Exons

165798150..165798559,165799371..165799424,165800072..165800163,165816568..165818722,165819878..165820134,165820653..165820752

Genome context

Sequence
000001 CTTGGAGCTT TTGGAGTACA CTTCCACTAA AGTTATAGCA TGCTTGAATG GTTTATTTCA CCAATATTTG CTTATGGAAA 000080
000081 TAAAGGGGAG TGGCCGAGGA GAAAGGAGAA GAGGAGTGGA GGAGGGGTTT GAGGCTGAGG GAGGCTCTGA CCACAGCACA 000160
000161 GAGCACCGGC AACTTTGTCT AATGTGATCA TTAACCTTCC TGCAAAACAC AGCTGGCAGT TCTCTGAGGT TTGTCACTAG 000240
000241 AATGTGAAGA CAGCCACACA GATATTGCAC AGACTATTTA CAGATCGTTT GGTTTACATT GAGAGTCATT GCTCTACTTT 000320
000321 TGTGCGGTAG GAAAATGAGA TTTCAGCAAT TCCTTTTTGC ATTTTTTATT TTTATTATGA GTCTTCTCCT TATCAGCGGA 000400
000401 CAGAGACCAG AGAAATTAGA AAGTTAAAGT TTTCTGAACC ACAAACCCAA CTTTAAATAA AAAGTTAATT TGACCATGAG 000480
000481 AAGAAAACTG CGCAAACACA ATTGCCTTCA GAGGAGATGT ATGCCTCTCC ATTCACGAGT ACCCTTTCCC TGAGGTATCT 000560
000561 CTCTAGCTAA CTTTACTGGA TCTATCAGAA GAAGAAGAGG AGTGAAGGAA AGACACCCAG CCACACAAAA GAACTTCATG 000640
000641 ATGCCAACAG CGTGATTGCT TAGAAGTTCC TACACAAAAA AAGGATCATT TGAAAGCACC TGGAATGGTT TATTAGCTTC 000720
000721 ACAGGATTTT ATTCTTCTTG GCTTCTATTT GGAGGGAAAA TAACATAAAT TCAAAAGGAT TCCAATCTGA AGCCCAAATC 000800
000801 GTTTGCCTAC ATAACAAAAA TATCTCATCT TTTCCTGCAC ATTATTATTC TTTTATGGGT TAAAAAGAAA AATACCTTTT 000880
000881 AGTGTTTTAG AACTCTCTCA TGGTAAAAAG TGCAAGAATT TAAAATGTTG CTTTCATATT CCTATAATTC TCCAAAAGTA 000960
000961 TTAAATTCGT ATATGTTTGA GTGATTTTCT AAAAACTGCT CAACCTGAAA TCAATTGCAT TGACCATTTG GCTTCGCACA 001040
001041 ATAGGGAGAA AATAATTGGT TCATTGATTA TATAGAGAGA AAGACTAAGA AAAGCTATTA ATTGCTACCA ATTTTATGAT 001120
001121 AAGCTTTAAG GTTTATGAAA GTATGTTTTT TTATTTAATG AGTAATGTCC ATTTGAAGTT GAAAGAAAAC ATGAAATCCT 001200
001201 AATTGTAGTT CATTTTATGT TCAAATGAAA CCATTGTTTT TGTTTTTGTT TTGAAACAGA GTCTCACTCT GTTGCCCAAG 001280
001281 GTGGAGAGAA GTGGCACGCT TTTGTCTCAC TGCAACCTCC ACCTCCCGAG TTCAAGTGAT TCTCGTGCCT CAACCTCCCA 001360
001361 ATTATAGGCT GGGATTACAG GTGTGCACCA CTACACCCAG CTAATTCTTG TATTTTTTGT AGAGATGAGG TTTTACCCTG 001440
001441 TTGCCCAGGC TGGTCTTGAA CTCAGGCTGG AACCATTCAT TTTTTAACCT TTCTCATCAT GTAATTATAG GAACCCAACG 001520
001521 TTTGATTTCC TTTGAAGTTT TGTTATGTCC TTTATTATTT TGTATGGATA ATTTCTTTAA AAGTCTTACT TAAAGTTGAC 001600
001601 ATCTAAAATA CAGTTATGCC AATGAAGTCC CACTCAGGGT GATATCTGTA TCTAAAAGAT GAGTGCTCAT CATCCTATTA 001680
001681 GGCTTTGTCT TGGTGGTGTT CATCCTGAGA TGCTGAGACA TGGAAATAAA AAATCAGAAG GAATTTAGGG ATATGATTAC 001760
001761 TCAAAAAAGA AACTATCCTG TCTAAATTTG AATTGTGTTG ATAACTAGGT GTTCCCCAGA TGCTAAGATG TTCTTAATTT 001840
001841 GTATTTATTG AAGGATTGTT AGCTTAGTGC CACAAAATTT TTCTTACTTT ATGTTAATTC CAGATAAGAA ATTTACAAGT 001920
001921 TTATATCTTT TTTTTTCTTT TTTTTAAGAT GAGATCTGGC TCTATCACCC AGGCTAAAGT GCAGTGGCAT GATCTAGGCT 002000
002001 AACTCCCTGG CTCAAGCGAT CCTTCCACCT CAGCCTCCCA AGTACCTGGG ACTACAGGCA CTCACGGCCA CACCTGACTA 002080
002081 ATTTTTGTAT TTTTTTGTAG AGATGGAGTA TCGCCATGTT GCCCAGGTTG GTCTCAAACT CAGGCTGGTG AGCTCAAGTG 002160
002161 ATCCGCCTCC TTGGCCTCCC AAAATACTGG GATTACAGGC ATGTGCCACC ATGCTGGGCC ACAAGTTCAT ATCTGGAGTA 002240
002241 GAAGTTTTAC TTTGTAAATA TTATAAAGTA GAAGAAACCA TAAACCATTT TGCTAAAATG AAAGGTTGGG GTTAATATAA 002320
002321 ATGTAATTTT AAATAGAAAA TCTGACAACA CTGTCGAGTT TGTCTTCCTG TCAAAGCTTA TTAAAAGTGT CTTTGCGGAT 002400
002401 GAATGGTACT TTCCACAAGT GCATTTGAGT AGAAGCATAA CCTATTCTCA GTTATATTTA TGTTTAAAAC ATGTACTGGT 002480
002481 TTGTATATTT TGTACTGAAA AAGAAAACAC TTTATAGTCA AGATACATCT CATTCAATAC AAGTCTAAAC TCTTTCAAAT 002560
002561 ACAAATTCGC ATATTCACAG AAAAAGTTAC AAATCAGTTT TACTATTGTA AAGTAATGAA ATGGTTATAC ATTTCTTAAT 002640
002641 TGTTCAATAA AACACTCAAT GATTTGCATG TCTGGCTGTC CTTACTTTGT ATAAGAAATG TTCAGTGTTG ACTTCCTTTG 002720
002721 AGGAATGAAA ATCATTGTTT GCTGTACATT TGATCAGAAG AAAAAAGAAA AACTGAAATT AGTGAAGTCA GTGAAGTCTT 002800
002801 CGGTCCCGTC CTGAATCATT TTTACCCTCT GTTTAGGGAC AGGTCTAGAA CGAGTGAGCA CAAAACTGAA GAGTGTGCCG 002880
002881 AAAATCTCAG CAATCAAGAT AACATTTGAA CTCCATTTTT GAAAAAAATA AAAACTAACA CCCACGAAAA ATACATGATG 002960
002961 AATAAAACAA TCTCTTGAAC CCAGGAGGAT GAGGTTGCAG TGAGCCAAGA TTGCACTACT GCACTCTGGC CTGGGCAACA 003040
003041 AAGCAAGACT CTGTCAAGAA AGAGAAAG
[back to top]

Predicted Small Protein

Name NONHSAT099079_smProtein_2102:2314
Length 71
Molecular weight 7899.3079
Aromaticity 0.1
Instability index 44.8814285714
Isoelectric point 9.30157470703
Runs 12
Runs residual 0.0265912305516
Runs probability 0.0476211505623
Amino acid sequence MEYRHVAQVGLKLRLVSSSDPPPWPPKILGLQACATMLGHKFISGVEVLLCKYYKVEETI
NHFAKMKGWG
Secondary structure LLEEEEEEELEEEEEEELLLLLLLLHHHHLHHHHHHHHLLLEEEEEEEEEEEEEEHHHHH
HHHHHHHLLL
PRMN -
PiMo -