NONHSAT003940

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT003940

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3870 nt

Genomic location

chr1+:73771853..73804560

Exon number

4

Exons

73771853..73771957,73772851..73773046,73776536..73776673,73801129..73804560

Genome context

Sequence
000001 TGGTCACCTG CACTAACAGC CTGAGAGAAA AGGACACTGC ATATCACAGA GAGATGCATA GATGTTACAC TCAGAAAGAG 000080
000081 AGTGAACAAC TAGAAGATGT GGAAGAACCA GTTTGGCTCT GGAGTCAAGA ATTCTCAAAA CAGCTTTCTG AACCTGAATC 000160
000161 GCACGCAGTA AAGCTGGTGT TCTCCTGAAG CCAACATTTA TGGTATCACC TTGCTGATCC AGGCAAAATG GCAAAAGTGT 000240
000241 AGTGAACATA AAGCCAGGGG CTGAGATTGT GGATGTCCTC TGTACCAAGT TTCTTATTAT GAAAATTCCA AGGGCCAATT 000320
000321 TTGAAACAAA CCAGGCAGAG ACCCAGATGC AGAATCCTGC AGGCTGAGGG ATGTTAGGAA GAGTTAGCCC ACCACTGCTG 000400
000401 CGCTGAAACC AGGATGACGC AAACTGGGCC TCAGGCCAGG ATCAACATCT TGTCTAGACA GAGAAATATT GATGGAGATC 000480
000481 AGAAAATTTA TTAGAGTATT TGAAGGTCTT GTGGGACTGG AGAGACAACG TTGGACACTT AAGTGGAAAA TCTGACACAT 000560
000561 AACAGAACTG AGACTATCAT TGTATTTTTT AAAAATAGTG GTTTTTGACA AACTGCAAAG GTTGCACAGT GGAGAAAAAA 000640
000641 GTTTTTTCAA CAAATGGTTC TGAATCGATG AAATATCTAT ATATGTTTTT AAAGTGAGCT TGAATCCATA ACATTGTACC 000720
000721 ACATAAAACA TTAAAGCAAG ATGTGTCATA AACATAAAAG TAACTCATAA TTATACATTT TCTAAGAGAA AATATGGTAA 000800
000801 AAATGTATTT TTGTTAGGTA AAGAAATAGA TAGTATGATA AAAGCATGAG CGTACAAGTG AATAAGTTAG ACTTTATCAA 000880
000881 AGTAAAAACT TCTGCTCTTT CAAAGACATG CATGAGGAGA ATAATAAGAA AAGCCACACC TTAGAAAAAA AATTTCAAAA 000960
000961 TATATCTTAT TACAAACTGA TATTCAAAAT TTAAAAACAG CTTTATAAAC TCAATAAAAA GAAAACAAAT TATTTTAAAA 001040
001041 TTGAGTAAAA TATTTGAATA AACTCTTTAC CATAAAGACA GGTGGATGGC AAATAAGCCA TTAATAGATG CTGAGTCATT 001120
001121 ACAGAAATAC ACATTAAACA CAAAGTGAGA TAGTACTGCA CACATATTAG AGAAGCTAAA AGTTAAAAGA CTGTTGATAT 001200
001201 TAGGTTGTAA TAAGGTGAAG AAAATTGGAA CTCATACACT TCATGTGGGA ATGTGAAGTG GTAAAAGCAC TTTTGAAAAG 001280
001281 CAATGTTTTG TGTTTGACAC AAAAAGTGAT AATAATTTTA AAAACAGACT ATATTAAGAT TATAATTTCT TTTCACCTAA 001360
001361 AGATACACCA GTTAGACAAT GAAATGGAAA ACCACAAAAT AGTAGAAGAT ATTTGCAATA AATATACCTG ATAAAGATAT 001440
001441 TATATTGAAA AAATATAGAG GTCCTACAAA TCAATAAGAA AAAGCCAAAT ACAAAAATGT GCAAGTGACT TACACAGAAA 001520
001521 TGTTATACCA GACATCAAAA AGGAAAAAAA AATTCACTAC TTCGTCATTA TACAAATTCA TAACATAATG CCAACATATA 001600
001601 CTCCCAGAAT GGCAAAAATG AAAAAGAAGG CAGTTACTAT AATTTTTCAG TGGGAATGTA GAACAACTAG AGCTAACATA 001680
001681 ATGTGTTGAC TGAAAAGCAA AGTGTTTTAC AACCTTTGGG AAAATTCATT TTTTTAAAAT TTTAAAGCCA AACATAACTC 001760
001761 TTAGGCATAT ATCCATATGT ATATATGTTT GGCTGGGCGT GGTTGCTCAC GCTTGTAATA CCAGCACTTT GGGAGGCTGA 001840
001841 GGTGGGCGGA TCACGAGGTC AGGAGTTCGA GACCATCCTG GCTAACACGG TTCCTGGCTA ACACGGTGAA ACCCCGTCTC 001920
001921 TAGTAAAAAT ACAAAAAAAT TAGCCAGCCT TGGTGGCGGG CGCCTGTAGT TCCAGCTACT CAGGAGGCTG AGGCCCGAGA 002000
002001 ATGGTGTGGA CACGGGAGGC AAAGCTTGCA GTGAACCAAA TTCATCAAAA AGGAGATCAT ACTGGGTAAC CTTAATCAGA 002080
002081 TGAGCTCTTT AAAAGAGAAT CCAGACATTC CCTGTAGAGA AAGATTTGAA AGTAGAGTTT CTCCCACTGC CCTAGAAGAA 002160
002161 ACCAGTAACA ATACTGTGAA CTCCCTATGA AGGGGGCCAT GGTACAAGCA ACTGAAGGAG GCCTCTAGAA AGTACGAGTG 002240
002241 ATCTCCAGTG GACAGCCAAT CAATAAACAA TAACTTTAGT CCTACAACCA CGTGAAAGAG GACTCTGAGC TGCAAATGGA 002320
002321 ATGCAGCCAT GACTAAAGCA TTGCTTGTAA CCTGGTGAGA TTCTGAGTAG TGGATCAAAC TAACTATACC CAAACTCCCA 002400
002401 ACACATAGTG ATTATGAGAT AATGAATTTG TCTTATTTAA CATCAATTAG TTTATTGTAA TTTGTTACAC AGAAATAAAA 002480
002481 AATGAATACA GAAAGTTAGG ATAATGCTTC CATTTGGTGA GTGGGAGGGG ATTACTGGGA AAGAAATCTA GTGGAGCTTC 002560
002561 TGAAATGCTA GTAATGTTCT ATTTCTTGAG TTTGGTATTA TATAAATAAA TATTGACTTT GTAGAAATCC ATCAACTAGA 002640
002641 TACTTTTAAT ATATACAGTC TTAGGTATGC TCTATATTAA TCTCTAAAAA AGAGATATAT AAAGAAAAAA TGAAAACAGA 002720
002721 ATGAACCATC AATATATATT GCATTGTTTT ATCTGCAGAA TAATTTTAAT ACACATATGA ATAAAACTAC ACATTCAATA 002800
002801 ATTTCATAAA TGTAGTTTTA ATAGTTCTTA AAATGGCTAT ATTTCTCAAT TAGTCTTTTC CTAAAGGTTT TCAGTCCTGC 002880
002881 TATAATTTTT CTCTTTTTTC AGTCCTGTTT CAATTTGTAT TCCATTTTTT CTTCCAAATA TGGCAAAAAA TAAGACAAAT 002960
002961 AAGTAATAAA ACTACAGCTA ATTTTTTAAT AAAATATCAC CACTCCTGTG ATTAAAATAC ACTGTATATT TATATATTTT 003040
003041 AAGTACCAAT TAACACTATT GCCTTTCCAT TTCAACTTTG CCAACAATAT TGATGATTTT AGAAATGGCT CAAGGAAACT 003120
003121 TTATGTGAAA AAGAGTTGCT CTATGTGGTT GACTCTGAAT AAATTTTTCC CACTCCCTTT TGAAAGCTTC ACTAACAAGA 003200
003201 TATAAATTCT CCATTTTCGA TATCTTACTG GCTAGTCTTG TGTTTTTGTT ACTGAATGTA CTATATTTTA TACGTGATAT 003280
003281 AACAGATTCA GTACTAGTGT TTAATTTTCT AATATTGAGT GTTTTTCCTT TCTGCTTTCA TGTGTTTTAA ATAAATGTGA 003360
003361 TGTATTCTGG GAGTCAATAT TTTTATACTA TCTTACAATC TATGATGATT ACAAATCTGA AATTTGAATA GACACTTTTA 003440
003441 GCTCTTGGCC ATTTTAAGAC ATTAAAGCTC ATTAGCAATC ATTTGTCTAC ATGAACTACT TTGTCTATAG CAAAACTGAG 003520
003521 ATCAAATAAA TGGTAGGAAA CCTGCCTAAA AATTTCCTGC TCTTCACACT AATTTGAATA TATCATCAAT TTTTACTAAC 003600
003601 AGATTTGATA TGATGGTAAA GAAAGATATT AAAAGAAAGG CTAAGTTGAA ATATGTACAT TAGCAGATGT TTGTTTATTC 003680
003681 AGTTGCCAGT AATTTCTAAA ATTAAATAAC TTTATTTTTA AATAATTCCA AGTGATGCAT ATTATTTCTT TTAAATAATT 003760
003761 AGAAATAATT GTTTTTGATT GACAAATTAT GATTATATAC ATCTATAGGA TACAATGTGA TGTTTTGCTA TATGTATACA 003840
003841 ATGTGGAATG AATAAATCAA GCTAACACAT
[back to top]

Predicted Small Protein

Name NONHSAT003940_smProtein_53:187
Length 45
Molecular weight 5295.8226
Aromaticity 0.113636363636
Instability index 89.6318181818
Isoelectric point 4.70477294922
Runs 9
Runs residual 0.0414438502674
Runs probability 0.0431608078667
Amino acid sequence MHRCYTQKESEQLEDVEEPVWLWSQEFSKQLSEPESHAVKLVFS
Secondary structure LLEELLLLHHHHHHHLLLLHHHHHHHHHHLLLLLLLLEEEEEEL
PRMN -
PiMo -