NONHSAT101661

From LncRNAWiki
Jump to: navigation, search

Please input one-sentence summary here.

Annotated Information

Transcriptomic Nomeclature

Please input transcriptomic nomeclature information here.

Function

Please input function information here.

Regulation

Please input regulation information here.

Expression

Please input expression information here.

Allelic Information and Variation

Please input allelic information and variation information here.

Evolution

Please input evolution information here.

You can also add sub-section(s) at will.

Labs working on this lncRNA

Please input related labs here.

References

Please input cited references here.

Basic Information

Transcript ID

NONHSAT101661

Source

NONCODE4.0

Same with

,

Classification

intergenic

Length

4572 nt

Genomic location

chr5+:60474133..60479163

Exon number

3

Exons

60474133..60476024,60476179..60476280,60476586..60479163

Genome context

Sequence
000001 CAGGTAATGC CTTGGTTTGC CCAGCTCTTG GTAGAATTTT GAACTGACAG CGGATTACAT TGTGTCATGA GGTGGGAAGG 000080
000081 GCTGGGAAGG CAACATCCAC ATATCCCCAT TATGATCTAG TCATTGTTTC CATTTCAAAG GAGCGATATA TTCAGCTCAG 000160
000161 CACTTCCGTC TATTTAAGAT ATGTTAGCTC CATGAATCAG AGTGAATCTA CATATTCCCA TTGTGAACCT GACATTGAAA 000240
000241 TATACGTTTT CCAGTTTCAA TCTACTCACC TGTTATGAGT GAAGCTTACC TTTAAAAAAA AAAAAAAAGG AGCAGAGATA 000320
000321 TTTAATATTC CCAATATTTG CTTTTGATGA ACTGATACTT TTGAACTTTT AAAACTATTT TATGTCTTCT GTACTCACAG 000400
000401 CTCGTATACC CCCACAAGAG TGGAGAAGCC ACAGGGAAAT ACAACTGGAT CTTACTGTGA AAGGCAGGAT TATGAGGTCT 000480
000481 CGATATCAAG GCTGCCTAGA GGCTGGTATA AAAAAATGCT GATGTCCACA TAACTTGTAA CTCTTAAGTA AATCAAACCT 000560
000561 GTAATTCAGC TTGGCTGCTT CTAAGTCATT TTGATTCCTG CCAACAGAAC ATTAATGAAC TTAGCTTCTC CGCAAGTTTA 000640
000641 TTCATGTGTA ATACCCAATT CCTTCTTCCT CGTTCCAGCA GTTGCCCCTG GTACATCTGA TAAGGTGGAA AGCTAGACTA 000720
000721 GGTCATACAA ATCCAGGTCA TTACTTAAAC CCAGATTATT ATAGACTGTT CTCTAATTTT TCTGTCTGGT TTTAGAGTGG 000800
000801 CAAACAGAAG AAAGTTTCTA GGGCTTGGGT TTGGAAAAAA AATCCATACA CTAATAATTC TTGTCTTGAA GAAACATTTT 000880
000881 ATAAAATTTA GCTTATTCTA ATGCCTTTAT ATTTCCTACT TTTATTCTCC TTTTGACAAT GATAAATAAT TCAATTATAT 000960
000961 TGCAGTTTTG CTTTTGGATC TTGGTTTACC TCCATGGATT TACAGTGATG GGAATATTCT AGAACAAGTA GTCATCTAGA 001040
001041 TTTCCCCAAA TGCCCTGAGA CTGATATAAG AAGCATACTA AAATGGACAT GAACCTAATT TTTCGAAGTA TAAAATAATA 001120
001121 TAAACcagtt gtgaccaaac ttgcctacaa attagaattg cctggagatc tctaaaatgc caaagcccgg gacatatccc 001200
001201 aaaccaatta aatcactatt tctcgaggtg ggacacaggc gtcaggagcg ctttaactcc caggatgatt cctatttgta 001280
001281 gacaagctta ggaaccactg ACCTAGACAA CTGTGTTACT TGAGGATTCT AAGTCGGCAC AGCTGTGGCG TGTTCACTAC 001360
001361 CCAGCCCTGC CCTTGGCATG ATGTTCTCTC CTATCCAGCT TCTCCATCAT CTTCCTCCAC CTGATTCGTA GAGACCTCTG 001440
001441 GACATCTGAG CCTAAGCAGG GCCACTAAAT GAGAGAGAAT CCCATCACTG AAAAGCAACA ATGAATATAT TAAGGACCAC 001520
001521 AGGTGGGTGT AAAATAGATT TTTGCAAATT TTGTGAATTG TTATTTGTTA TATTTTTCCT GATAACAAAA GTTTTATGTC 001600
001601 AAAGATTTTT GTCTTCCAGT TTTAGTTTCT TATGCAGTAG TTTTAGGAAA CTCTGGTGTG CCAAGATGAT ATTTTTTTTT 001680
001681 CATGCTTGGG TGATTGTTTT AGAATTCCTT CTTTGAGTTC AAGAACATCT TTAAAATATC TAACTCAGGG GTATTATGTG 001760
001761 TGTTGAGTGA GAATTGTCTA TTTGCCTTCT TTAAAGAAAC ACAATTTGAC ACAAAGATAG GTATTAAAAT TTTTATTTTC 001840
001841 TGTGTATACA CAGCACATTA GTTATTTATT TAAGCAACCA CTGGCTGTCC AAGAGAGATA GAAGAAATAT TGCAACTGAA 001920
001921 GCTGCTAGAT GGTTATGATT ATCTGAATAG GTGTGATCTT GACTTTAGCG AATTAACAGG TAAGCTTCTT ACTAATTTTA 002000
002001 TTCTATAAGT GATTATTGGG GGCAGAGGGT AGAAAAAAAG TGAGAATACA TGTCTGTCTC ATGATAAACA AAAGCCTGAG 002080
002081 ATGTTACAGT TACTTACTAA AATCACTTTT ATCATATGAA GATTTTGGAT AAATCATTAT TAAGGAAACT AAGGACAGCT 002160
002161 GTGGCATGGA ATCAGCGGGC TTTCAGTTTG AATCCCAGTT ACTCACAAAT GTAAAGTGGT ATTTTCATTC CTCTTTTGTT 002240
002241 AAAATGCTTT TTTTAAATGT AGGAAGTTTA CATTTTGTCC TAGATCACAT TGGCTTCTGT TCCCACAGGA AATGTTATAT 002320
002321 GCTATATGTC AGTTTTCATT TCATTAAAAG TGAAAGTAAA CATGGTATGC ATCAGCCAAA CAGTAACTAA TCCCGACATA 002400
002401 ATTTCAGTAT TAAATGAGGC TTGGAATATG TATTTGAATG GGCATGATAG TTTGGGAAAC TCTAATTCAG AGATGTCCCA 002480
002481 CTTATAACAG TGATGTTGGA GAATAGGGTT ACCAGTTATG GCCTATATTT GCTCTGTCAT ATCACGTTCT TACAGTATGT 002560
002561 AATTATATTT GGAGAAATAT CCTTTGAGCC ATAGTATTTT GGAGTCAAGT ATTAGGCTAT AATTTAGACA TTGTTTTAGT 002640
002641 AGAATAAAGT AACTTTATAA GATTGTTCCA GATCTTTGTT CTATATCTCT GTCTTTTATC CTTCCCCCTT GCAGAAGTCA 002720
002721 TTCCTATTTA CCTTATTTAA AAAACGAGAG AAAGTATAAC TCAGGCTGAA ATAAAGCACC AGACTAAAAA GAGATGCCCA 002800
002801 TTTTCATTCA ATATTTATAT TTGCCAGAAT AAAGGGAGAC AAATAATTAT TTTGCCTTTA AGGCCTTGAT TCTGGATGAT 002880
002881 AGATTAAGTA GACAGACTGT TGACCACATG TGACTTAAGG AATATTGTTA ATTATTTTAC CAATCTTTGG ATCAAGGTTT 002960
002961 GCTTGAAGTA AGTAAGCTGT GCTGTGGATA AACCTTTGGA TGATAAAGAG TGAGCTGTAT TGCCTTCAAT ATTGTTTGCA 003040
003041 TCAGAACATT ACTAAGTGGT ATCGATTTTG AAAAATAAAA TTCAACTTTC TGAGAACATA GACCAGATTC AAGTATTTTT 003120
003121 ATCTTTTTTT CCCCCCtttt ttcagagaat ggaatctctc cacgttgccc aagcaggtct cgaactcctg tacttgagct 003200
003201 attctcccac ctctgcctcc ctaagtgctg ggattacaag cgtgagccac tgttcatggc cTTAGGGACT TTTAAATGTT 003280
003281 AAATTGATTT TGACCTACCC AGGAGCTAGA GAATACCAAT GATTACAGAG GGGAGAATAA AATTGCCGTA TAGATTTTCT 003360
003361 GTGCTCCTTT AGAAAATCCA TTCTGGAGTA AAGAGATGGG CAATTGGTCA ACTGAGTTTC TTTCTTGTAT AATATGTCAG 003440
003441 AGTTTGATAT TGTCAGTCTG TTAACCAAGA TTTTTATTGG GATATTTGCT GAAAGAGGCT CTAAGAATAT CCCATCACCA 003520
003521 GTAATTAGAA TTTGTTACAG GTTTCAGTCA GGGTAGTTGA AAGGGAGCTA ATTTGGTAAC ACAATAACCT AGTACCCAAC 003600
003601 TAATTCAGGA GCATGAAGAA AAATTGGTTG ATTAAGGCTT AAGTCAGATA AATAATACTA TAGGTTGACA TGGCTGGGTT 003680
003681 AGCTCAGGTT TGTATATTTG TTTTTTCAAA GAGGCAGACT TTTCCTAAGA TTAGAATCCA CAGTAATGAT GGAAGCAGGA 003760
003761 AACTTAGGGG CAGGCCATAG GTGTAAAATC AAAGTTGAGC TGCTTCTGTG TTACAGCTGT AAGCACAGGA AGTTGCCCAT 003840
003841 TTTTTTCAAG CACTATTTCT GCATAGTCAT ATATCAGCTA GTGTTGTCAA CATAGACACA AGAGACAATA GGAATACGTA 003920
003921 CATGCAATTC CACCATCAGC TAAAAAACCC ACATAAGTCT TTGAAGCTCT CAAATAACAC GCATGACACT GAATGGTACA 004000
004001 TTTTGATGCA CTTTGAAACT CTTATCTCTT TAGTTGGTAA GTACATTTGA ACACTTAATA TGTGCTCAGC AGAAGAGAAG 004080
004081 GAGAACAAGG GAAGTCTAAT ATAAAATCAG GATTTTTTCT TTTCTCTTTG AAGCCAATTA GAAtgtgtgt atgtgtgtgt 004160
004161 gtgtgtgtgt gtgtgtgtgc gtgtgtgtgt gtgtgtgtgt gtATTATTTA GTGTAGTAAT CCTTGGGTTT TTTTTAAAAA 004240
004241 AACGTGTTTT ACtatgaagt gtaacataca agaaagtgat aacatatgac atatacagct tacaaatggt tgtaagaatt 004320
004321 aatacacttt gtaagtgcca cccatggtaa aaataagtag gtggagaagg aggagaagaa aagaagaatt taactagcat 004400
004401 gtcagaagtc tcatggatct ctttccagtc acaaccctct caccaagctc gattcctatt ccctgagtaa gttctaatct 004480
004481 tacttttctt tttttttttt tttttgagag agagtctgac tctgtcaccc aagctagagt ggtgcaatct cggttcactg 004560
004561 caatctctgc ct
[back to top]

Predicted Small Protein

Name NONHSAT101661_smProtein_2165:2389
Length 75
Molecular weight 8731.9771
Aromaticity 0.175675675676
Instability index 41.7783783784
Isoelectric point 8.69122314453
Runs 8
Runs residual 0.0345418589321
Runs probability 0.0464140611199
Amino acid sequence MESAGFQFESQLLTNVKWYFHSSFVKMLFLNVGSLHFVLDHIGFCSHRKCYMLYVSFHFI
KSESKHGMHQPNSN
Secondary structure LLLLLLEEHHHHHHHHHHHHHHHHHHHHHHLLLHHHHHHHHLLLLLLLEEEEEEEEEEEE
EELLLLLLLLLLLL
PRMN LLLLLLLLLLLLLLLLLLLLLHHHHHHHHHHHHHHHHHHLLLLHHHHHHHHHHHHHHHHH
HLLLLLLLLLLLLL
PiMo iiiiiiiiiiiiiiiiiiiiiTTTTTTTTTTTTTTTTTTooooTTTTTTTTTTTTTTTTT
Tiiiiiiiiiiiii