NONHSAT104514

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT104514

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

3120 nt

Genomic location

chr5+:148931009..148946472

Exon number

2

Exons

148931009..148931686,148944031..148946472

Genome context

Sequence
000001 ATCTCGGGCG CTCGCTCCGA AGGATCGGCA GCGAGAGGGC GGAACCGGAT CCGTGAGAGT CGCGGCCCAC GCTCCGAGCG 000080
000081 TCACCCTGTC ACCGCCCAGC TGCGCATGAG CGCTGCAACT CCTGCTGACT GCGCAGGCGT GCCGCGGCCT GTCCTCACCC 000160
000161 CCGTCCCGGG CTCTCTTCCA CTTCCGGTCG GCGGCGGCTC AGCAGGTGTT ACCTGTTACT CGCCTCGGAC CCGCCGGGTC 000240
000241 TCTGGAAGGC TGAGTGGCCC CCCGGCCTTG ATATTGGGAG GAGTCACCTT CATGTTTCGT GGCAGATGAA AGCTTAAGGC 000320
000321 CTCCTTAGGA GTTGGGTATT GACCTGTGCT CTTTGAGGCG GTTTTTAACC TTTGATCTTT CTTCTGGCCG TTCCCCCGGA 000400
000401 CCGGCCTATT CCAACCCAGT TTCTGTTTGG AAAGAGGCTG TCGCCTCTGG TGGAGAGGAA GAGAAAAGCA GCCAGCGTCG 000480
000481 AGGAATCTCT CAGGGCTCTC GGTTGTAGCA GAGCCTAGCT AATTACCCTT CATAATTAGA ATTGTCCTAT CTACTACGGA 000560
000561 GTGTTATAGA CCCTCTTTCC ACAGGAGGGG AAGAAAGAAA GAAGCCGGAA TCCCCCTGCC ATCCTATGTC TTTGCATTAT 000640
000641 TGCCTCACTA CCACATATGA TCTTCAGAAG AAAACCTGGA CTACAAAACA CCCTGGGAAG GAAACTGCTC ATCTCTTTCA 000720
000721 AGATTAAGGG AAGATATAAA AACAGACATC CTCggccggg cgcggtggct cccgcctgta attccagcac tttgggaggc 000800
000801 caaggcaggt ggatcacctg aggccaggag ttcgaggcca gcctgaccaa catggagaaa ccccgtctct actaaaaata 000880
000881 caaagttagc tgggcgtggt ggtgcatgcc tgtaatccca gctactcaag aggctgaagc aggagaatcg cctgaaccca 000960
000961 ggaggcggag gttgccgtga accgagatcg cgccattgca ctccagcctg ggcaacaaga gtgaaactct ctcaaaaaaa 001040
001041 caaaaCAACT CCCAGACATC CTCTTAGTAG ATCCCATGGT ATGGCACAGC AAGGTATGTT AATGAACCCA TGGCACATCG 001120
001121 TGATCTTAAT GATTATACAA TGCTGCCATC TTGAGGATAT GATCAGTAAT TGTTATTCTA GCCAGCCTTC TGTGGCCTAG 001200
001201 CATCAGGCAC AGCCAGAAAG CTGAATCTGG ACAGTCAGAT GTTAAATACC CAGCAAGGAG CTACAGTCAA GGAGGCCAAA 001280
001281 ATAACATTTG AGGAGCCAGG CAAGTGTTAG GAGATTGAGC AAACCTGTGG TGCTTTCAGA ACAAGGTAAA TGAAAAGGAC 001360
001361 CAAATTAGGG CACATCAATT CATGTATGCA GGATAAGAAA AACGGGGcct tttatttact cccaaagcac cctgttcttc 001440
001441 cttctagtat agtgccatct cactgtatta taattgctat aaactggttt ctcccagtgg accgatgagc catgacagca 001520
001521 gagaataagt cctgttatag ttatatctgc attgtgtcct cagcattagc acactgggta ataagtattt agtaaatatt 001600
001601 ttgtggctta atgagtgaCA GGCATAGGCA GCAGGGGACA AAGTCAAAAT AATGGCAGAG CCAGTTTTCC AGATTTCCAG 001680
001681 ATCAGTGCTA CTATCTACTA TACAACATAT atttatgaat gggcaaaagc atactggcta ccaagagaca ctgaaatata 001760
001761 cctagccttt taggtggcca ccagcaatga tagccaTGAG GCATGTCTCT AAGGCTGTAA GCACTGGGCT tatttattga 001840
001841 ataccacttt ggaccagtca ctgcactaag tgccaggact gcaatgaaga acaaaaaaga tagccccagc ccttatgagg 001920
001921 tttacagcct agATCTAGAC TATACTACTG ATCAGACAGG GCCTTTCATG GTCATCATGA CTTGGTGAGG ACATGTGATT 002000
002001 CCACAATCTG GTCTTAGCTT TGTAATGTCA GAAACTCTAA tttttttttt tttttttttt tttttttttt tttttgtgag 002080
002081 acagagtttc cctcttgttg cccaggctag agtgcaatgg tgcgatctct gctcactgca acctccgcct cccaggtaca 002160
002161 agcaattcac ctgtctcagc ctgccaagta gcttggatta caggcatgtg ccaccacgcc tggctttttt atatttagta 002240
002241 gagacaatgt ttcaccatgt tagttaggct catcacgaac tcctgacctc aggtgatcca cccgccttgg ccgcccaaag 002320
002321 tgctgggatt acaggcctgc accaccgtgc ctggccCAGA AACTCTATTT TATCTATGGC TTTGGAAGAA GTTTAGACTC 002400
002401 ATTGAACCTT CAAAAACCCT ATGTGggaca ggcgaggtgg ttcacacctg taatcccagc actttgggag gctgaggtgg 002480
002481 gtggatcacc tgaggtcagg ggttcaagac cagcctggcc aacatgatga aaccctgtct ctactaaaaa tacaaaaatt 002560
002561 agctggtcat ggtagcaggc acctgtaatc ccagctactc gggaggctga ggcaggagaa ttcacttgag cctgggaggc 002640
002641 agaagttgca gtgagccaag attgtgccac tgaactccag cctgggtgac agagcgagac tgtctcaaCC CCCCCATCCT 002720
002721 CCCCCCCAAA AAAACCAACC Aaacaacaac aacaacaaca acaaAAAACC TTATGTTGGG GGTATCGTTA TCACCCCAGa 002800
002801 tttttatacc cagctcacac aaatgaggaa actaagtatt aagacaggct agttaacttg tccaaagtta caaaagatag 002880
002881 taagtggagg agccaggaca tgaaatttca agttcactgt ccttccagtg tgtgacaAGG ACCAGTTTAT TCACTTGTTA 002960
002961 ATGGTTATAG CTTTCTCTGT CAATCCAACA GAACCGTTAT CTCACTGAGA ACTACATGTC ATTCTACTCC CCACTTTCAA 003040
003041 TTTTATATGT ATATATGTGT GTACATATAT CTCTTCTAAT TATATATATA ATCATGTCAC TTATTCATta aatacttcag 003120
[back to top]

Predicted Small Protein

Name NONHSAT104514_smProtein_2024:2266
Length 81
Molecular weight 9665.2677
Aromaticity 0.2375
Instability index 58.11875
Isoelectric point 8.61224365234
Runs 10
Runs residual 0.0138157894737
Runs probability 0.064122946476
Amino acid sequence MSETLIFFFFFFFFFFFVRQSFPLVAQARVQWCDLCSLQPPPPRYKQFTCLSLPSSLDYR
HVPPRLAFLYLVETMFHHVS
Secondary structure LLLHHHHHHHHHHHEEEEELLLHHHHHHHHHHHHHHHLLLLLLLLLEEEEELLLLLLLLL
LLLHHHHHHHHHHHHHHHLL
PRMN LLLLHHHHHHHHHHHHHHHHHHLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLL
LLLLLLLLLLLLLLLLLLLL
PiMo ooooTTTTTTTTTTTTTTTTTTiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiii