NONHSAT097492

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT097492

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

5648 nt

Genomic location

chr4+:99580055..99585702

Exon number

1

Exons

99580055..99585702

Genome context

Sequence
000001 AGGCGAGAGG GAGAAGGAAG GGCGCCCGGG GAGCAGGCTG GCAGAGTGAC CGGGATTGAT GAGCGGCGCA GCTCTGGAGG 000080
000081 GACGCGGGCG AGAGGGTCGC TCCTCGGGGA GAGGGCTGGG TGAGTGAGGA CCTGCGAGCC ACCCCCGACA CCCCTTCCAG 000160
000161 GGTAGCCACA CTGGCGCCCC CTCCTCCACA ACAGGCTTAT GTGGCTAGAG AAGATGCGAT GCCACTGGAC AGGGAAGGAA 000240
000241 AGTCGTCGTG GACTTCGCTG AATGGAAAAG GGGGTAGTTG GGGAAGGTGG AGAAAAGAGC CACGGGAAGT GGGAGGAAAG 000320
000321 GATGAAGGGA TGGGACGTTG AAAAGGAAAG CTTCCTGAAT CCCTGTGTGA CTCTAACCCA ACCTCCCCAC CTACCGGAGA 000400
000401 AATGGGTTTT ATCCTAGAAA TGGGGACGAG GAAAAAGAGA TGGCTCAAGG GAGTCCTGGC CCGGGATGCG ATCCTCGCCC 000480
000481 TAGGGTCTAT TGCAACCACA CAAACCCGGA CAGCGCACGC TCCCCCCTAC CCCGACAGAA TAGTGCCTAA GAAACCTGCT 000560
000561 TGCCTCCTAA CTCTAGGAAG GAATCCACAA CGGATTCCAG GGAGGTCTTC TGGGGAATCT GAATTGGGTT TTTGGATCAC 000640
000641 GAACCATTAA GCTTTTGAAC TCTGGGGAAG AATAGAACAG GTGGCTGGAA ATTGTTTTAG AACCCTACCT GAGCAGGTGT 000720
000721 GCTCATTAGT TGCTTATCCG CCCTCCTCCT GGAGCACTTT CATAGGTGTT GGCAAAGCAG GTACCAGTAT TTCACGGAAG 000800
000801 GAGAAGGAAG AACCGCCCAG GACTGTGTAT AAATGTCTAT GGGAATATTT TTAAGGATTC AAAAGGCTAG AAGGAGGTGA 000880
000881 GATCAAAAAA TTTTTCCAGC ACCACCCAGT GCAACCCGAT CCTGGACACA GAAATCTAAA AGATGATACT TAAATACTTG 000960
000961 AGTTTGAGAG CCTCTAGTTA TAAGGTTTCC TACGTAACAA CCTCTCACTT TTCTCAAAAA CTGGCTAGTT GACAACCGGC 001040
001041 AGTCAAGTTT TGGCTAGTCA AGAAGCTACT TTTTCCTGCA ATAAACCACA TTTAGAGATA GTACTGGGCT TCAGCAGGAT 001120
001121 TGAGGCTATA TTGAGGTCGC AAGTATGAAC GAGGTACCCA CAGAGATGGG AGGGCCAGGA GATCCTACTC ATTCAGTACC 001200
001201 TCAGAAGCTA TGACACGGGA TAACAGTAGC TTCAGCCCTT TGGATGAAAT AATCCTCACT CCCTCAAAAG TGCTCTCTAC 001280
001281 CCACTCAAAA CACAGGCCAT ACTACCACCA ATCTGGTAGA CTTCAAAAGT AAGGTCTTTC AGCAGCAGAT TTCCTTGTTA 001360
001361 ACAGGAAGGT ACTGACAAGC TTTACCTAAG CCTATTGGAG CTATGTTGTA ATTCTGTATA TAAATAGAAC AGTCGCTGCA 001440
001441 GGAATACTGG ATGAGTCAGA AGAAAGGTCT GTCCTAAGCT GATGTGACAC ATTTCAAAAG GACCAGAAAT TAGATGCAAT 001520
001521 TGATGACTGA GTTGATCTCT AAGTGGATTT ATCTTTGAAA TAAAAATCAG TGGATATCCC ATATACCCGA CCCCACACCC 001600
001601 TCCCCCATTG TCAACATCCC CAACCAAAGT AGCACATTTG CTACAAATAG TGAACCTACA TTGCCAGATC ATTATCACCC 001680
001681 AAAGTCCATA GTTTACATTA GGGTTCACTC TTGGGGCTGT ATATTCTATA GGGTTGGACA AATATAGAAT AACATGAACC 001760
001761 CACCCTCATA GTATCATAGT ATCATACAGA GTATTTTCAC TGCCCTAAAA GTCCTCTGTA CACCACCTAT TTATCCCTTC 001840
001841 CTCCCCCCAG TCTCTGGCAG CCACTGAAGT TTGACTGTAT CCCTGTCTCC TTAGTTTTGC CTTTTCCGGA ATGCCATATA 001920
001921 TAAGGAATCA TACAGTATGC AGTCTTTCTT TCAATTTTGC TGTGAAACTA AAACTGCTAA GAAAATAAAG TCTTAAAAAG 002000
002001 AAAAATCAAT GGAGAAGGGA AAGGTAAGAT AACTAATCAG TAATTTGGAG GGGGGGGTGG AAAAACTAAT TGAATTATCT 002080
002081 CATTTCGTAT ATCCAAAATA AGTTCTATGA GGATTAAGGA GTTAAATATA TTTTAAAGCA GACCAGAAGG AAAAATATTT 002160
002161 GTCAAACCTC TGTATAGAAG TATTTCTCAA CTTAGAGGCA ATAGAAGGAA ATAATCAATG GATATGATTG GGTAATATCT 002240
002241 TAAACACTTC TACGTGTCCA GCTTTTTTTA GTGTAAGCTA AACCAGAAAA AAAATTGGCA GCAAACATGA AAAATAAAAA 002320
002321 CCTGAATGTC TATATTATAA GTAGGGCTCA TAGAGCTATA GTAAAATATA GAATCGCAGT GGTTGAGTAT AAACAGAAAA 002400
002401 TAAATCATAT AAGAAATAAA ATATATGGGG AAATTAATCT AGTAATAAAT GAAGTATACA TTAAAACTAA TAAAGATACA 002480
002481 AAAATAATGA GCACTGGTTA GCACATACAT TGTTGGTGTC CTTATGAAAT AATATGTCCA TTTGGAAAGC ATTTTGGCTG 002560
002561 TATATATCAG GTCAGAGTGC TCCTACATTT TGACTCAGCA GTCCCTCTTC TGGGAATCTA CCCAAAGAAA TACCCCAGCA 002640
002641 GATGGAAAAA GTTATACACT GCAAGATGTA TATCATAGCA TTATTTATAT TGAAGACTTG GCCAGAAATT AAAGATCAAA 002720
002721 GAGTAGGGGT GATATCCAAT TCCTTGTAGT TACTTGGACC ACAATGTTCC TTCACATTCT GTGCCTCTGC ACATGTCATT 002800
002801 TAGAGTTTCT GCCTGGGGCA TTCTCCTTGT CTCTTGTCTG TGAGCTGATT TCTCTTATTC TTCACTACTT TTGTTCAATG 002880
002881 TCTCGGAGGC TCCTTCTTTA TTCCCTCAAC ACCCGCTGTG TTCTTCCATC CCGGCACTTA AAGGCTGCCT GTAAGAACTT 002960
002961 GTTTACCTAC CTGTCTCCTC AATCTGCTGT GACTCCTTCT AGGGTGTGAC CTAACCCTTG GTAATCTTAG AATCACCAGA 003040
003041 GCTCTGCAGA TATGGCAGGT GCTTGATAAA TATTTATTAA AGGAATGAAT GAGTCATTTG CTCACAGTTT TTACAGCTCT 003120
003121 TGAAGATACT GCTTATAGAG TATGTGGCAA TATATGGATA CATTATAATG TTAAATTTTA AAGGCACTAT ATATGGAATT 003200
003201 ATCTCCAAGT AATATTGTTA AATTTTAAAA ACAAGGAACA AAAACAGTGT ACACAGTAGG CTATCATTTT GTGAAAAGAT 003280
003281 AAGCTCTGTT GGCAAAAGAA TATCCATACA CATATTTCCT TGTTAATGGG TAGGCTATCT ATGGAAGAAA ACGAAATAAA 003360
003361 CTGGTAACAG CGAGTTAGGG TGGAATGGGG GGCAAACTGG GGGACTAAAG ATACTGATAG GAAGGAGACA TGTTTTATTT 003440
003441 TTAAATAACC CCTTTAGTCT GTTTTTGAAA TTTGTACCAT GTATATGATA CACGTTTTGA AAAATAAAGC AGTCTACAAA 003520
003521 ACGGTATGCA GTTGTATGAA AAAAAGTTAG GAGTTAATTT TTGCCAAATA AGAACAGTTA TAAGGCTGGG TGATTTTGTT 003600
003601 TTCATGTTTC TGGTTTGCAA TTTTTAAGTG TGGTTAAATT GCTTTTACAA TTAAGAAGAT AAAATTATAC ATAAAGATGT 003680
003681 AAGAAAGTAA CTAAGTAATC ATATTTATTT CACGCAACTC CTTCACAACT CAATAAATGC TGATCTATTT TAAGGAGAAA 003760
003761 AGGAGAAAAA TTTTCTGTCT GAATACTGCT AAGGCCACCT CCCTTATAAC TCTGAGGTTG AAAAATGGTT AATAAGTTTA 003840
003841 ACTACTGACA AGTTAATCAG CTCTTCAAGT GGCAGTGAAA ACAGGTAAAG CGTGTTCTTC TAATAGTTGC CTGGCTGAGG 003920
003921 AGCAAGATAT GACAACAGCG TTTTAACCTA AGACACCAAA CTTAACGAGC TTGTCTGTCT CCACCTGTGC TTTATTGTTC 004000
004001 CTGCCCACCT TCCTGCCTCC CTCCTCTTGG GCTCTGTCAT TGTCTGCCTC TCTGCTCCTC TCTGTTTTTC TTTTGCTTCT 004080
004081 CTGTAGCCGC CCTGCTGGTT GCCTCCACAC CCCTCCTGCC GCCTTCCTCT CAGAACCCTA GCAGCTGTGT TCAGCGTCCC 004160
004161 CCTCTTAAAG CAGCCACATG CTCTGGGAAG ACTGGCCCTC TGCGTCACAT CTGGCAGTCC CATTCCTCTT GCTAATGTGT 004240
004241 GGTTCAGGAA TGGGCAGGTG GCCCAATTCT GGTCAACAAG CCATGAGAAG AATCTGCTAG AGGTTTCCTT GCATCTAAGA 004320
004321 AATGGTCTCT TTCGCCCTTG GACTTGGTTG GGTTTGGATG TGATGCCTCG ATCCTCTACA GCCTGATGAC AAAGACCACA 004400
004401 CCAAGGATGG CAGAGTGGAA GGATAAAAAG AACCAGGGCC TTCCAGCCTT TGTTGTATGG CTGACTTAAC CAACCTTGAA 004480
004481 GGTGGCCTTA CTCACGACTT CCTGTCAGAT GAGACAAATT TCTTGGTTAA GATACTTTGA CAGGGGTTTT CCGTTTCATA 004560
004561 CAACTGAAAG TATCCTGACT TAAACACCTT TTGAGGTTCT GCCCCAAGCC TTTATTGATT TTCTCCTCTT GAATCACCAC 004640
004641 CAGCATTTCT GCATTTCTCT TAGTACCACG TTTTTTCTAT TCCCCACACC CACCACCCCC CCTCTTCCCT AGCTGTTTCT 004720
004721 GTCTCAAACA ATACAGGGAA ACATTTTAGG GTGTCTGACA GATCACACCC ACCCCTTTCT CAGACCTTCT CAACCACAAG 004800
004801 GTGCTCTCCC ACCTCCAGCG AGGTTTGTAC TTCACAGTTG ATGGGGTCCC GGTAGATTCA GGCTGGGCCA ACCTGAGAGT 004880
004881 CTGTCTCAGT AACAAAGGGA ACCCTGCGCC CAGCTGAAGT GCAGGCAATG TTTACCCGAA AGCTGGGGGT GGGTCCGCAT 004960
004961 TTCCAGGAGG AAGCCAAGAC TCTCCCTCAT CCTCTGCAGA AAGAGGAAGG AAGCCTAAAC AGAGAAAAAC AGTAGCAGAG 005040
005041 TCTGACAGCC TGGGTGCTGT GTATGGTTGT GCATTTTGTG CTCTGAAAGG CTCCCATCCC TGCTAAAGGC CTGCTTCTAG 005120
005121 ATTTAAAGAG GACTCTGTCT TTGAGGTCTA TGAGATTTTT AGATGGAATC ATTCATCACA TCCCATTTGG GCTTTAGATA 005200
005201 CTTTGAAGTG AGTATCTCAA ATTTTCAAAT CAAAGAACCC TGATTAAAAC AAAAGGTCTT CTCACCTCAA ATCCTTTATG 005280
005281 AAATGAGGTC AACTTAAAAT ACTTTTTAGT TAGCTATGAT ATAAAATTAA ACTATATTTT TGTTATAGTT AATGAATTAA 005360
005361 TATTGTATCT TCTGTTCCTT AGTCGAAAAG TGAGCAGTTA AATGGTCTAT TTCTCTAACA AAACAATTAA CTGATTCCTC 005440
005441 AGGATTTACC GCCCACTGTG ACCCTCTTGC CCTGTGCCCA CACATACCTA CTAAAGACCA GTTTTCAGAT CCCCAGGAAT 005520
005521 AACATAAAAT GGGGCATATC TGTCTCGTAA ATATGATTGA ATAAGCATAT GTAGAGCACA GTTGTTTCAC TTTTAAGAAT 005600
005601 CAATTATAAT CATTTGTACA ATAAAATGAA TTATGTCTTT CACACAAT
[back to top]

Predicted Small Protein

Name NONHSAT097492_smProtein_401:649
Length 83
Molecular weight 9065.5215
Aromaticity 0.0609756097561
Instability index 49.5975609756
Isoelectric point 10.9357299805
Runs 9
Runs residual 0.0261456023651
Runs probability 0.0183222438125
Amino acid sequence MGFILEMGTRKKRWLKGVLARDAILALGSIATTQTRTAHAPPYPDRIVPKKPACLLTLGR
NPQRIPGRSSGESELGFWITNH
Secondary structure LLEEEELLLLLHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLEEELLLL
LLLLLLLLLLLLLLLLEEEELL
PRMN -
PiMo -