NONHSAT104646

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104646

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3591 nt

Genomic location

chr5+:151329132..151650009

Exon number

7

Exons

151329132..151329224,151330535..151330597,151331199..151331241,151574285..151574365,151575002..151575091,151610824..151610860,151646826..151650009

Genome context

Sequence
000001 GCTAGAATGC ACTGTCTTTG AAGCATTGAG CAGGCCCTCA CCAGACACCA AATCTGCTGG TTCCTGGATC TTGGACTTCC 000080
000081 CAGCCTCATA ACTAGGCATC ACAAATCTCC GAGAGGAACG CATCCACTTC CTGGCCTGGT AGAGCGGAGA GCAGAGGTGG 000160
000161 GCCCAGGAGT CCAAGAACTT CCCCTAAGGC TGCTAGTCGC ACCCATCTCT TGAGCAGAGA CCCAAGAGGG CAGGACAGGA 000240
000241 AGAATGGAAG TGAGATTTGG CCCCGGTCAT GTTTTCCAAA GTGCTGACAT CAGGAAAATA ATAGAAGTTA ACGCAGATAT 000320
000321 GGGGCTGCCA CTGAGGAGAG AGAAGACTGA AAGAAGCATA TAGATAAAAG TTTCAGTTAC CTGAGGTCAA CCTTGGTCTG 000400
000401 AAATTAGATT ATCTGAGTAG GCTGAACCAT CGGTGAAAAG GAGGCCAAGC ATCTCAAAAA TAGGACACGT GAGAGTATGG 000480
000481 AGCCAATGTG ACACTCAGGT GAGTTAGCCT TATGATCCCT AAGCATGCAG TCCTTGGACC AGATGCATCA GGATCTTCTG 000560
000561 GTAAGTGTGC TAAAATATAG GCTACTGAGA CCCACCCTAT GTCTGTCAAA TCAGAATATT CTAAGGAAGC AGGTCCCAGG 000640
000641 GAATATATAT TTTAAAGCTC CCTAAATACA CTTCATGGGC TGCCAGATTT GGGAATCGCA GGATTAGGTA GCTACCTAAA 000720
000721 TGAAGACACT TAAATGTGCC AAAAGCTTCT CTTTCACCCA TCCATTTATC CATCTATTCA CCCATCCATC AACCAAACAT 000800
000801 GTGCCAGTAT CTGGTATTTG CTTCATGACC TGCTACTGCT GTGGTGGGTG CAAAGATAAA TGATCAATGT TCTCAAAGAG 000880
000881 TCAATAGTCT AAGAGAATGG ATGAATGAAG TATAAATAAT AAAAGATGTG CTATGAAGTA CGATGGGGGT CCAGAAAAGT 000960
000961 ATATACTTAT TCCAGATTGT GGCAGAAGAC TCTGGAGAGG TTTCGTTTTA TATAATGGCA TGAATTCCCC TTCAGATCCA 001040
001041 TCTCTAAATC AAGCTTCACC TACTTCTGAC TGTTTGTGTT AGATACCAAT CCTATGGATT TCTATAGTTT TCTGGAATTA 001120
001121 CTTTTTACAC ACTACAGTAG AGTTATGTGG CTGCTTCCAT GTTTTACTTA GAAGCTGAGA GCTTGATAAA AGTGAAGGCT 001200
001201 GCATCTTAAT CATCACCTTT TCCCCCACCT CCTAGCTAGG GTCTGGCTCT GCCTAAGCAG CTGCCATTGG TGCTCAATAA 001280
001281 ACATTGAAGA AGTTGGGTTG TTGCACTGGT TATGCAAAGA AAGGGACACA GTGGGTGTAA GCGTGGGGTG GGGAGCAAAG 001360
001361 CTGGGACCAA TCATGGAACA CCTAGAATTT GAATCAAATT TGGGCTTTAG TTTTTAGGCA ATGAGCCTAA GACAATTTTG 001440
001441 GAAGCATCCT TTGTTCACAA AAAAGTTAGA AATGGTGAGA ACTAGGGCAG TCTATCACAA AGTGTGGGAG AGGTAATTTT 001520
001521 ATATGTAAAC TATTATAAAT CACATTTGAG TTATGAAGTG AGACAATTAT TCCCTTTGCA GTTTCATTTC ATTCTGTCTA 001600
001601 ATTATTTCAG GGAGACATTC TGAATTGGGT GCTAGTATTT TAACACTTCT CTAACACTTT CTCTTTTTAA ATTTTGTTTA 001680
001681 TTTTTAATTG ACAAATAATA ATTGTGTATA TTTATGGGGT ACATTGGGCA TGGAGGGTGG GTTATTCTGT GATGTCTTGA 001760
001761 CCTGTGTGTA CATTGTAACA CTTCTCTAAT ACTTTCTAAT CCAGCCATCC CCAACCTTTT TGGCACCAGG AACCAGTTTT 001840
001841 GTGGAAGACG ATTTTTCCAC AGATGGTTGG GGGATGATTT TGGGATGGAA CTATTCCACC TCAGATCATC AGGCATTAGA 001920
001921 TTCTCATAAG AAGCATGCAA CGTAGATCCC TCACATATAC AGTTCACAAT AGGGTTCATG GTTCTATGAG AATCTAATGC 002000
002001 TGCCACTGAT CTAACAGGAG GCAGAGCTCA GCTGATAATG CTTTTTCACC TGCTACTGAC TTCCTGCTGT GTTGCCCAGT 002080
002081 TCCTAACGGG CCATAGACTG GTACCAGTCC ATAGCCTGGG GCTTGAGGAC CCCTGCTCTA TGATGTCTAT ACAACAATAA 002160
002161 AATTGCCTAA TGATACATTT CTCAGAATGT ATACCCATCA TTAAGTGACA CATGGCTGCT TGTGTATTTT TATATATAGC 002240
002241 TAAATGTGGC TAGATTAGAT GTGATATCTC TTTGGATGCA GGGATAGGTG GGCAAGTTTC TAAAACACGA AACTAACTGT 002320
002321 GTCTTTTCTC TGCTTAAGAA CCTTCCATGA CTCCCTGTTG TTCATAACCG GATAAATTCT AAGCTGCCTA ACTCAGAATT 002400
002401 CTAGCTTAGA GATGTCTACG CCTGGCTGCT CCCAACAGTT AGTCTCATGT CCTGTCTCAA ACATGGGGCT TCAGTAAGAA 002480
002481 TGAAATTTCC AAGGAGAATT TTATATTCTG CATGCCTTTA AAGGCTCAAG CCTGGAGACC CCCCTTCTTG CCTCTCTTAC 002560
002561 CCTTTTCACC AGGGCACCCT CTATTCATCC TTCCAGAGCC TTCTCTGGGA AGTCCTTTCC CAAAGCCCAC TTCCCAGCCT 002640
002641 AAGCTTAGTG CTATTTCTGT AGATACCCAG AAACACATCA ACTATATGTA CAGTTTACCC TAGTATGTCA AAGTTATAGT 002720
002721 TACTGCCTTA GGAGGGATTT TGAGCTTTTC ATCTTTATTA CTCGGGCTTC TGCTACTGTG TCAAGCAGGT AGTTGACACT 002800
002801 CAACAAATGT TTGTTTAGCC TAACTCAGCA GAACACATGG AGTGTGTTCA CATGGAGTAA CCTGATTTTG CTTAGTCTGC 002880
002881 AACATTTCTT GCATGCATCC CCTGCTTGCA GTCTCAATTA CAGTCATCCT GTTTCAGTTT TCTTAACTTG TCACTCCCTT 002960
002961 CCTCTGATAA CTTTTGCCAT ACCTCTGTAT AACATGGCCT ACTTTTTGAA AAAGATTTTT TTACATTCAC CTACATTTGT 003040
003041 CTCAAATTTT TTTTTAAAAG AAACTTTGTA TTACTACCGT CATTGAAAAC CTTCATCACT TGGCACAAAA GAAAGTTTAA 003120
003121 AAAATACAAC AAAAGACATT CTACTAAATT CTGGCTAGAC AACCTTGCCT GCTCTAGTAG ATGCACTGAG ACCATTAATG 003200
003201 AATCTTTCAT CCCACCTTGT GTCATATTCT TTGCCACATG ACTTTGGGTC TCCCTCACTA CTAGGCAAGT TCCTGTCTCC 003280
003281 ATCTCTTGAT TCTGAATTCA GCCATAATAC TTGCTCTCCC CAATGTTGTT TTAGCAGATT TGACACATAC AGAAGCGTGA 003360
003361 AAAACTCTTG TGTGACTGGG ATAGCTCCTC TGCAATGAGA ACATGCCTGG GATAATGTAC TGAAAAGATG TGAGAAATAC 003440
003441 ATGAAGCAGA GTCCAGTCAT CCCAGCCAAG AACCAGCTGT GAATGTAACC GCATGACTTT GAGGCAAGTC CAGACCAAGA 003520
003521 ACAAAGAACC ACTGGTCTTA AAGACAGACC TGCAGATTCC TGAGCTAAAT AAAAGTTTAT TGTTGAAAGG C
[back to top]

Predicted Small Protein

Name NONHSAT104646_smProtein_3197:3397
Length 67
Molecular weight 7243.0877
Aromaticity 0.106060606061
Instability index 62.9439393939
Isoelectric point 6.79315185547
Runs 11
Runs residual 0.0160587915079
Runs probability 0.0491561079798
Amino acid sequence MNLSSHLVSYSLPHDFGSPSLLGKFLSPSLDSEFSHNTCSPQCCFSRFDTYRSVKNSCVT
GIAPLQ
Secondary structure LLLLLEEEEELLLLLLLLHHHHLLLLLLLLLLLLLLLLLLLLEEEEELLEEEEEELLEEE
EEELLL
PRMN -
PiMo -