NONHSAT122124

From LncRNAWiki
Revision as of 06:40, 17 October 2014 by 124.16.129.48 (talk)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT122124

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5106 nt

Genomic location

chr7-:96594575..96608908

Exon number

2

Exons

96594575..96598430,96607659..96608908

Genome context

Sequence
000001 TTCAGACAGA GTCTTGCTCT ATTGCCCAGG CTGCAACTGG TGTGATCTCG GCTCACTACA ACCTCTGCCT CCTGGGTTCA 000080
000081 AGCGATTCTC CTGCCTCAAG CTCCCAAGTA GCTGGGATTA CAAGCATGCA CCACCATGCC TGGCTAATTT TTGTATTTTT 000160
000161 AGTGGAGACG GGGTTTCGCC ACATTGGCCA GGGTGGTCTT GAACTCCTGA CCTCAAGTGA TCCACCTGCC TTGGCCTCCC 000240
000241 AAAGTGCTGG GATTATAAGC ATGAGCCACT GCACCCAGCC TTATACTGAA CTTTCAATGG GTTCAATTCC ACTAGGAGCA 000320
000321 TAAAGGCCAC TGCATATGAG TTGTGGAAAG AAGAGATTAG AAGAAGGAAG AACTTGAGAT GAGTTCCTCC CTTCAACATT 000400
000401 CTGTCTCCTC CTACCTAGCA TCTTCTTTCT TTTAGTCTTT CTAGAATGTC CATCTGTTTT TGGCCATTGC GGAGAGAGAA 000480
000481 GCTGAGCTTT AAAGGAGTAG GAGCTTCAAA GGCGTAGGAG CTTCAAAATT CTTGTTTCTT CATGTTTGAT CACCCTTCTA 000560
000561 AACCTGTCTT CTGTTCCTTC TGCTATTCTT TTTTCTTAGA GCATAGGAAA GGGGAGCTTT TAAATTAATA CTTAAAGCAT 000640
000641 GGAAAAAAAG AACTTGAGAA GAAAGTAAAA CAAGGGAGAT GAGGCTAGTA AAGTAAGGAA AATGAAGAGG AAGAGGAGGA 000720
000721 AGGGTTAGCT TCTAAATTCC AAGTCAAATT GATATGGAAC AGGCAAGCCG CTTGTCTTAC TTAAACTTCA GAAAAGGATC 000800
000801 TGCTGAAACT TGATAGAAAT GGAAAGGGAA ATCCTTGGGG TGGGGAACCT CCAAACATTA GTAATGATAT TGAACAACTC 000880
000881 AAAGTATTGA GGAAATCTGC AGGCTACATG CCTGAAGATT ACCCATGCAG ATAGACCAAA AGGATTAGAA TTATCTGTTG 000960
000961 ATATTAGTAA TATTTATTGA CATCTAGCTA GTATTGGTAA TTTTAAGTTT TAGATTAATT TCTTTGGTAA TAGCTATGAT 001040
001041 ATATTTTATA GACAAGAATT ATATCTATAG GCTTGCTATC ATAGGCTCTT TTAATCAGCA TTAATTTAGT CTACTGATTT 001120
001121 TTAGCACATT TGAATCATTC ACTTATGCTA GGTAACTCAT TGCAAAATAA AAAGATGATT CCTGTATGTA TGGCAGCTAT 001200
001201 ACATTAAGGA GGAGTCTACC AGAATATGAA AAAGTCAGCT GACCTAAATA TTGCTGAGAC AAAGGAAAAC CCACTCCCTT 001280
001281 GGAGGAGCAT GACCTTTTCC TGTAATTCTT CCCACTGCTG TTGTTGAGCT CCTTGGATCC TGGCTCCTGG ACACCATCAT 001360
001361 CAAGAAGACT TTATGGATGG GCTGTCCACC CACTGAGAGA AGAGGAGCAT CAGCTACAGT TTCTCTCTAG ATTGCCTTCT 001440
001441 TCATTTTGAG TAATGACTGT CAGCAGGGTC AGATTAAACA CAAAACAACT GGACAATTGC TTGGAGGACT AAACTATAAG 001520
001521 GGCACTAACA TGTCAATAGT AGGCTAACAC ATCCATGGAA AATATATTTA CCAGCTCTTC TCTCAGGGAG GATTCTGTGT 001600
001601 GGGGTTGGAA GTAATGATTT GTTAAATTCC TTAGGGGTAG AAAGTAGGGC ATAATCAGAA TATAGAGGAA TATGCTGTTT 001680
001681 GACTTCAGGG TTTCTGTTTT TCTTACTAGG ATATATAAAA CAGGGACTCT AGCTAGATTG TTTATGACCA CAGAGGGTAG 001760
001761 GCTGAGTGCT CCCATGATCT TCCTGCTTGG TTCTTGCCCA TACAGAGGTC AGCCTTTCCT CTAATAAAGA TTGAACAAGT 001840
001841 AGTGGTCTGA GGGAGACACC AATTCATTAC CCTACATGTC TCTTCTCTGC ACTCCAGGGC TTTGATAATA AAGACACTGG 001920
001921 CAGACTATCT ATCTTCCATT TCTATAATGT GAGCCCTTAG GGAGTCTTCG TTCACTTGGG GGTGAGGGTC ATTGCTCACA 002000
002001 GAGTAGTTCA AGTCAAATGG AACTTGAACT CTTTGCCTAT GGGCCTGGTG GTCAGACTCT GTGTTGAGTT CATTAGATTA 002080
002081 TTGGAGACAC AAGGTAGAGC TGGATGCTTC AAAAATATTT GGCTAAAGGA TGACATTGCT GGTTATTTGT AGATAAAGCC 002160
002161 ATGATGGAAC CTGCTTGGAA TCATGAAATA TGGAACTGGT GGTCATGTTT AAAAATACAA CTAATAGTTA AGTACCTACT 002240
002241 GGACACTGTA GAGACTTAGG GGCTAGACAG ACATGGTTCC TGCCCCCTTG GAGCTTACAC TGTAGCTTCC CCTTAGGTAT 002320
002321 GAAGAACAGT GGCTACAACT AACAAATGGC CACAAAGATA TAATTGAGCC AGTGTTCCAA TTATTAGGGT AATTCCTATT 002400
002401 TCCTTAATCA TTCCTATTGA CCATGTTCTA TAAGCCTGCA TTCTATAAAT GGCGTATGAC CATGGGCTGT TTCCCCCCAG 002480
002481 CAAGTTGTAC AAAGTTCTGT GGTACCAGGG AAAGGGCTTA AGGTTAGCAG GGCCTCTGCG GAAGGACATA TGAAGTGACT 002560
002561 TGGGTTAGGA AACAGGAAGG GATAGGATTC AAGAACAGCT ATTGCTTCTG TTCTATATAG GAAACTGCAG CGTGAAAAAT 002640
002641 GCTGGGCTGG GAATTTTGAG ACCTGGGTTT TAGTTTGTGT TCTAATACTA ACAAGCTATG TGACTGTGGG TAAGTCATTT 002720
002721 CACATTCCAT TTGGATGCCT CTTGAGTGAC TCCAGGCCTC TCCAGCTCTA AAACATTAAG ATCAGGCCCT ACGCTACAGC 002800
002801 TGGCCAGTGT GTAATTCTTC TGTTTCTATG CTGTTAGGTC AAATAGATCT TCAATAGTTA CTTGATTGTT ATTACTTTTT 002880
002881 TCTGAAGTGG GTGTTTTATC AATGTTTTAG GATACAGTGA GTCTGCTTCT CCCCTTTGGA GTTAGGAAGG TTGTAGGAAT 002960
002961 ATACACTGTA GAGCATATGG GAGCTTTATC CCCTCCTTTT TTCCCCGCTA CCTTTCTCCC TCTCCTTCCC TTCATCACAT 003040
003041 TTCTATTGAG CATATGCCAT TGCCGTGTAC CAGGTGTGTG CTAGGTTTGG AGATACAAGG TAAACCTGAG ACTTTCCCAG 003120
003121 TCTCCAGGAG ACAAAACCCT AATTTCTTTC ATCTGCTGTC TTTCTCTTTG GAAAGAATCA ACGATATCCC AGGGGAATGT 003200
003201 GCCCATGTCC CAGGGTAACC AACTACAGAC AGATGCCCCA TTCTACTCAA GCAACTTTTA GAGTGCCTTG AGATACACAT 003280
003281 CAGATAATTA TGCAGGGCCA GGCATGGTGC TGCACACTTG TAATCACAGC ACTTTGGGAG GCCGAGGCAG CAGATTGCTT 003360
003361 GAGCCCAGGA ATTCAAGTGT AGCCTAGGGA ACATGGCAAA ACCCCAGCTC TACAAAAAAA TACACAAATT AGCTGCGTGT 003440
003441 GGTGGCCTAT GACAGGAGGC TGAGATGGGA GGATTGCTTG AGCCTGTGAG GTCGAGGCTG CAGTGAGCCG AGATCATGCC 003520
003521 ATTACTCCAG CCTGGGTGAC ATAGGGAGAC CCTGTCTTGA AAAAAAAAAA AAAAAGATAA TTATTCAGCC CTAGAGTCAT 003600
003601 TGTGAAAAGA TCTATCTTCA GATATAAGGA AGAAACAATC TTTTATTTCT TAGGATAAAT CTGTAGAAGG ACCTCCAGAC 003680
003681 AGTGAAGGCC ACTGACTACT TTATACTCTG TAAGCCATCC CTCCCTGGTA GGAAGGACTA TTTCCAATCT TACAGAGTAC 003760
003761 CTCTCAGCAA ATAGACGTTT TCACATATAC TGTGATTCAT ACATCCCTAT GGCTGGTGAC CTCTTTAAAA AGGAAAGGAA 003840
003841 AAAGCCTAAT CAAACAAAAA GATGCTGCTA GTAATTCTTA CCCTATTGTG AATCCTATAT AAGCAAATTT GTATCTTTGT 003920
003921 TTTTTCCTAC ATTAGCAGAT CTATTTGATG TATATCTCTG AGTGCAGAAA ATATTTTATG GAAAAATCAA TATATGGAAT 004000
004001 TTCAAATTCA GAATTGCTGA TACACACTAT TTGGTTTCAC AATTTTATCC TAGGAAATAG TATTAGAGAT TTCAATTTCT 004080
004081 GGCTTAAATG GTAGAATTAA TTACTCTTAA CTCTTAATTT TACTTCTGAG TTGAGGTCAA GGAACAGGCA GACACCTGCA 004160
004161 GTTAACGTCT ATACCTCTCC ATGGCCAAGA GTTTTAATTT TCTCGTCTTC AATTTTGTAG ATGTTCATCA TTACTAAATG 004240
004241 GATTGATTAG TATTTTATCT CCTCTCCTTG TCCTTACTTT CCCTCTGGTA AATATGTTAT AAACAGTGTA AGGCTCCTAA 004320
004321 GATAGAGTAG CTGGTAGGAC TTAGAAGAGA AACAAAGGGC ACTGATAACT CACATAAATG GAAAATTGGC TCTGGAATAA 004400
004401 CTGACAACAT ATTCAAGTAT TTTAGTGCAG TGTCACTCTC ATTAAGAAGA AGAGAATCAG TAAATCTATG TGACTCTAAA 004480
004481 CATTCTAATG AAAAAAGGAA TATTCTGCCA ATTATCTCAC ATTTCTAAAT ATCTGGATAT TGGCCATTGT AAAGACAAAA 004560
004561 CATACAGATG ATGGACTTGT CTTTCCACCT CTCATTTGCA TGGTTTGGAG CATTGTACCT CCAGCCATAG ACTCTAAGGC 004640
004641 AATTTATATT TGCTTCCTCT TCCCTCTTGA GAGAAAACGA AAATCTTATT TTTCCAAGCA ATTAAAACTC TTCTGCTTCA 004720
004721 GCTAGGATGA AAGAATTAGG AGTTCTGTCT CCTTGTATCT AATTGCATGT TTCATCTTTC TTGTTTTAAT GATTGACAGA 004800
004801 AAACTAATAA ACTGAGACAT CTTTGAATCC AGGTTGAATG TACTCTCTTG GTGGCCCCAT TGCTAATTTG TTTGACTATT 004880
004881 TTGCAGGATT TCTTACTCTG TAATGGAAAG GTTTATTAAA TATGAGGGGT GCAAAGCTTT CTGAATACTA ATGAACTTAT 004960
004961 TTGCCAAAAT TTAAATGTTC TTCTTGTCAG TGAATGTCTG TCTCTCTTAA CAGGATCCAA ATTGAATAAT GAAGAAAATT 005040
005041 AGACTCTATT GTACCCTCAG GAGAAATCGT GGTGAATCGT AATAGAATAC AGAGGGGGAA AGGGGT
[back to top]

Predicted Small Protein

Name NONHSAT122124_smProtein_125:322
Length 66
Molecular weight 6947.9041
Aromaticity 0.0769230769231
Instability index 61.6646153846
Isoelectric point 5.83160400391
Runs 13
Runs residual 0.0497115384615
Runs probability 0.042796189855
Amino acid sequence MHHHAWLIFVFLVETGFRHIGQGGLELLTSSDPPALASQSAGIISMSHCTQPYTELSMGS
IPLGA
Secondary structure LLLEEEEEEEEEELLLLEEELLLLEEEELLLLLHHHLLLLLLEEEELLLLLLLEEEELLL
LLLLL
PRMN -
PiMo -