NONHSAT130074

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT130074

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4843 nt

Genomic location

chr9+:4876530..4890193

Exon number

7

Exons

4876530..4877092,4879020..4879993,4880015..4880954,4882186..4882687,4883205..4883965,4886064..4886806,4889834..4890193

Genome context

Sequence
000001 GATAGAAATG ATAGCAAATA TATTTTGTAA ATCAATTAAT GAATTGCAAA CATGAAAACA CAACAGGGAA CTGTCTTGCT 000080
000081 GGACCAGAAA TAGCAAAGCA TTCCTAAGAG AAGGATGCAG ACAGGAAACC CAAAGTAAAT CAGTCTTGAA ACCATAAGCC 000160
000161 AGGCTGCTCC TCTGTCAAGA AACTGTCTAA GCCCAAGACT TACTCAGAAG AGTACCAATC AAGAATTACC AAACATCTGA 000240
000241 GACAATAATA CCGTGAAAGA AAGAAAAACA ACTCAGCAAA TGAAAGTCCT TCCTAAGGAG GCAGTGTAGA TGGGAGGTAG 000320
000321 AGGGCTGAGC CATACCCTCC CTTAAAGAAT CCCATGCGAC CTGGGCATGC CCACTTCCCA GTAGACTGGC TCTTCCCAGC 000400
000401 ATGGTAACCA GTGGTGGTTG TGCTGAGCAA ATGCTTCTGT GATTGGGGAG GTGCTGGCAG GGATGTAGGC ACACAAAGAT 000480
000481 GCTGTGGATA GTCTGGATTT CTTGCTAACT TGGCCTGGGG TTTTGGAGCC TTGGGTCCAG GATGCTCCTG GAAGAAGGGT 000560
000561 CAGGTGGTGT GATCCTATCC CATAGCTCCT TAATCTTCCC TATATCCTAG TACCCAGATT TTAAAAAATA GGGCTCAGGC 000640
000641 TGGGCTCTGA GGTGAGGGAA GAGGAAAGTT GGGGCTAGCC AGGGCCCTGA GGCATGGGCG GGGCGTGGTG TGACATGGTA 000720
000721 AGGGGTGGGA GTGGGCAGGA GAGACTGCCT GGGTGAGGCT TGCCTCAGAT GGAGAAGCCT GACCTGGATC CACTTGCTGC 000800
000801 CTGTAGGAGC TCAGTCGCCA AAACCTGTCT GGCCTAAAGA CAGGCACAGT TTGATTTAAT AGTTGGTACT GACCTCACTT 000880
000881 GAACATTAAA TCAATAAATG CCAACTTAAT GGAGCTGTTG CTGGGAAACT AAGGAGTATC AAATATCCAA TATCTGCACA 000960
000961 GAGGAGATTG TTTTGATCTG TTTTGTTTTT CTAAAGCTGC AACTTGCTCC TTGGTGCACT TTAGGTTATT CCTTTTTGGT 001040
001041 AGGAAAGTGG CTCCCACACT CTCCTCTCCA CCCACCCTTT GAAGAGGCTT GGAAATGCAC TCAGCAGCTG CCCCCATGAT 001120
001121 AAGCCGAAGG GCCTGTCCTG TCCAGGTTGG CAGGTCCTCC CAGCAGGCAG AGTGAGGAGT TCATGAATGA GGGAGGAGAC 001200
001201 TATGGTGTCT CAGAAGGAAA ACTCTAGAGG AAAAACACTT GAGTAACAAT GCCCAGTTCA ACCTCTGGAT CTCTCTTGCA 001280
001281 GCTTGCTGAG GCTCTATTTC CAGCTGAACT GTGTAGCTCT CAGCGTGGGT GCCTTTTATC CATCACTGGT GTCATGCCAG 001360
001361 GTCATCTTCA TCTTTTCTTA GCTAACTGAG GAGGACAGTC ATCCAAGCTG GGGTCTCCTC TCCTGTGTGT CTCTCCTAGG 001440
001441 GATGGAGTTG CCAGGCTTTG TGCTTGTTTT ACTTCTTCTC CAGCCTTTCC TTCCCCTCCC TTTCCTGTTC CCTAGGTTGG 001520
001521 TCATAGCAGA GGTTGTGTGG GAAAAAGAGG ACAATCCCCA TGTTCTTCAA CACGGCTGCC GCAAACACTA CCCAGAAACA 001600
001601 GGCTGGGAAT GGGGACTTGA ATGCCAGCTG TGGGGTGCCT GCCTTGCAGC CAGCTCTGCC AAAGCTTGTG GTTATGATTG 001680
001681 TCTCCTTTCT TCTGCTTCAC AGCCAGTCAT AAATGCCATC TTCCTTGGAT GTCTGAGTCC CTTGGAAACT GGGCTCTGAA 001760
001761 CTCGATTCCA CTGGGGTTAA CCATGGGCTA TGCTATCAGA ATTTCCACCT AGGCCTCCAG TTTCCTTAGA GCAAAAGGAA 001840
001841 AACAAATTGT GAAAGGGGAG TAGCAGAGTG GGAAGGTGGT CTAAGGACAG CTGTTCTGAC CTGAGTAATA TTACATCCCT 001920
001921 GATGGGAGGC AGCAAGTGCA GCACCTTCCA CACCTGAAAC CATCACTTAA ATAGCGCTTC ATGGTTTGGG GACTGTGACA 002000
002001 GTTTCCTAGG GGGCTATAAC AGAGTACTGT AAACTGGGTG GCTTAAAACA ACAGAAATTT ATCATCTCAC AGTTCTGGAG 002080
002081 GCCAGAAGTC CAAAATTAAG TTGCTGGCAG TTGGTTCCTT CTGAGAGTTC TGAGGGAGAA TCTTTTCCTT GCCTCTCTCT 002160
002161 TAGCTTCTGG TAATCCCAGA CATTCCTTGG CTTATTGATG CTTATAGATG TCCTTTGAAT AGTAGACGCA ATCCTCCATT 002240
002241 TTCATATGGT GTTCTCCTTG GGTCTCTTTC TTCATGTCTG TCTTCTGTTA AAGATACCAA TCATATTGAA TTAGAAGTCC 002320
002321 ACCTACTCCA GTATAACCTC ATCTTATCTA ATTAAATCTT CAACAACCCT ATTTCTAAAT AATGTTACAT TCTGAGGTTC 002400
002401 TGGGATTAGG ACTTCAACAT ATCTTTTTTT AGGGGATATA TAGTTCAACC CATAATGGAG ATCAAAGGCT GAAGGATTTT 002480
002481 ATCTTAAGGA TACAGGAGTA TCTTGTGGAA TACGAGGGAA GGCATACAGC TGGGCCTTTG GAGAGAACTT GGAAGCTATC 002560
002561 AGGACCCTGG CAGCTTCTTT CTGCCCAGAT GCTTCATTCT CTCTCTCTGC AATCCTCCTT TTTCTGCTTC TTAGTGTACC 002640
002641 CAGTGGGAAG AAAGTGGTGT CTCTACAACC TCCCGGTTTT GATCTTCTTC ATTCAAGGGA CCAGCCTAAT TTAAATGTGC 002720
002721 AAGGAAAGAA AGCTGATTGA TCCAGCTTGA GTCAGATTTC CACCCCCTAA TCAATTGTGT TCAGTGTGTG TGGGGGAAGG 002800
002801 GGGCATACAC CAGAAACAAC ATACCCAGAG GAGAAGGATC CAGTATGTGT CAGTTTTAAT CTCCTGCAAG TGTAAAAGCT 002880
002881 CCCTAGAATC ATCCTAAGGA TCATTTTATC CTAAGACCTT CATTTTGCAG ATAAATGGAA GTCTCAAAAA GTTAAAGAGT 002960
002961 TTGAGATCTC ACCCATACAG ACTGTGTAGT ATCCCTGTGT TGGGCACCAT TCTTGCACAT GGTAGGTGCT TTGTGAATAT 003040
003041 TTGTCAGGAA AACCAAAAAG AATGAAACAT AAGAGAACAA CAAGAAAAAG AATCTTTACC TTTACCCTTT CTCTTGGAGG 003120
003121 AATATTTGAC TTCTTATTCT TGTGAAATCT GGGAGATTAG GAAAAAAGCT TCTGAAATGA AGGTTATTTC AGAAGTAATT 003200
003201 TTTATGGGAG AGCTTCCAGA ATTCTTAGCC TCAGTAATGA TAATGGCTAA CATTTTGGAG CAGCTACCAT CTGCCAGGCT 003280
003281 TGGCAGTGCT AAGTGCATTG TCCCTTTTAA TCCTCGCATG AACTCTTTGG AATAGGTACT ATAGGTGCTG AGCCTGGCTC 003360
003361 TTCAGAACAA TAAGCATCCT GACAAATGAC TGGTCCCTTG AAGACAGTAC CAGGTCTTAT TCTTTGCACT TGGCCGGAGA 003440
003441 CTAGCCAGAT GCTGAGGTGT GCTCAGTAAA TGGGGATCAT GGCTGAGAGG CCCTTGCAGG AAGCTAGAAC CTCACACACT 003520
003521 TCCTCTGGTT CCAGAGCAGG GGTGTGTAGA ATCTGTAGGG AGACACTGGT GGGTGTTTTC ACAGGAGACT GTAGTGCCCA 003600
003601 AGCAGCTTTA AAAGGGCTAA GGGAATTTTC TGGGATGCTC TACGCTTATG TGGGAGCTGA CTTTGCTTTG GGCCAAAAAT 003680
003681 CTGTGGGTGG TGTTCAGATC TCCTTCTATG TTCTTGTAAA TAAAAAACCT TTGAAATGCA GCAAATGCTC ACGTAAATAT 003760
003761 ATCCCAGTCA AGATACAGAA CATTTCCATC ATGCTAGCAA GTTCCTTCCT GCCTCTTTCT GGTTAGTCTC TCCTGTCTCC 003840
003841 TGCTCAGAGG CGATGTTCTG ATTCTACCAA TTTGTTTTGT CTGATGATCA CGAGTCAGTA TTGCCTCTTC TAGGACTTTA 003920
003921 AATAAATAAA ACCATACAAT ATATCACCTT TTGTGTCTGA CTCCTTTTGC TCAAACTACT AAGTCAGCCA TGCTATTACA 004000
004001 TGTATCAATA GTTCAGTCCT TAGCCGGGCA TGGTGGTGCA CACCTGTGGT CCCAGCTACT TGGGAGGCTG AGGCAGGAGT 004080
004081 ATCACTTGAA CCCAGGAGGC GGAGGCTGCA GTGAGCTGAG ATGGGGCCAC TGCACTCCAG CCTGGGTGAT AGAGTGAGAC 004160
004161 TTGGTCTCAT GAAAAAAAAA AAAATCAGTC CTTTTTTTAA TGCTGAGTTG TATTCCATTG TATGAATATA TCATATCCTC 004240
004241 TCACCAGTTG ATGAATGTTC ATATCATTTC CAGTTTGGAG CTATTATAAA AATAACTGCT TAAGTCTGTA TTTTTTAAAC 004320
004321 AAACATCCAG CTAATTCTGA TGCAGGTGGA CCACATGTTG AGAAACACTG CTCTAACACA GAGAATTGAA GCTTATTGCC 004400
004401 CAAAGCCACA TAACCAAGAA GGCTTTGACG CCAGAGCGTG GATTCAAGAG CCCATGCTTT TCCTACTTGG ACAGTAGATG 004480
004481 TCAAGGAAGG TGGTCTGATT GAGCAACTTT ACTCTCCACA TGGATGTCTA ACGGCTATTT AATCTTCCTA GAGATGCTTC 004560
004561 CAGTTTTCCA AATTAGTTTA GATTACATGA ATCTCTAATT AGTGGAATTT ACTGTAACAA GATAAAATGG AAAAAAAGGT 004640
004641 GATGAGAGAT ATTTTGGGGC AAGTACTCTC ACTGATTCTG ACTTTTTTTC AGATTCTCCC CCTAGGATTA ATAATCTAGA 004720
004721 TTATTCTGGA TTTTTATAAC TTTTTGGGTC TTTTAAAGAT CCCTCCCTGC TTCTTTTTTT CCCACAACTA CCGCCATAAG 004800
004801 CCTGGTTGAG AAGGGGCTAT CTGAGTGAGC ACAGCCATAG GCG
[back to top]

Predicted Small Protein

Name NONHSAT130074_smProtein_3989:4156
Length 56
Molecular weight 6029.8513
Aromaticity 0.0727272727273
Instability index 71.7054545455
Isoelectric point 6.95587158203
Runs 9
Runs residual 0.0121212121212
Runs probability 0.0233272488175
Amino acid sequence MLLHVSIVQSLAGHGGAHLWSQLLGRLRQEYHLNPGGGGCSELRWGHCTPAWVIE
Secondary structure LEEEEEHHHHHHLLLHHHHHHHHHHHHHHHEEELLLLLLLLLLLLLLLLLLEEEL
PRMN -
PiMo -