Genomics enters the deep learning era.

Etienne Routhier, Julien Mozziconacci
Author Information
  1. Etienne Routhier: LPTMC, Sorbonne Université, Paris, France.
  2. Julien Mozziconacci: StrInG Lab, Museum National d'Histoire Naturelle, Paris, France.

Abstract

The tremendous amount of biological sequence data available, combined with the recent methodological breakthrough in deep learning in domains such as computer vision or natural language processing, is leading today to the transformation of bioinformatics through the emergence of deep genomics, the application of deep learning to genomic sequences. We review here the new applications that the use of deep learning enables in the field, focusing on three aspects: the functional annotation of genomes, the sequence determinants of the genome functions and the possibility to write synthetic genomic sequences.

Keywords

References

  1. Methods. 2019 Aug 15;166:40-47 [PMID: 30922998]
  2. Bioinformatics. 2019 Sep 15;35(18):3294-3302 [PMID: 30753280]
  3. IEEE Trans Nanobioscience. 2019 Apr;18(2):136-145 [PMID: 30624223]
  4. Bioinformatics. 2020 Jan 15;36(2):496-503 [PMID: 31318408]
  5. Bioinformatics. 2019 Jul 15;35(14):i108-i116 [PMID: 31510655]
  6. Sci Rep. 2019 Jun 11;9(1):8484 [PMID: 31186519]
  7. Nat Methods. 2021 Oct;18(10):1196-1203 [PMID: 34608324]
  8. BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):478 [PMID: 29219068]
  9. Nat Genet. 2019 Jan;51(1):12-18 [PMID: 30478442]
  10. Genome Biol. 2021 Mar 31;22(1):94 [PMID: 33789710]
  11. Bioinformatics. 2018 Mar 1;34(5):732-738 [PMID: 29069282]
  12. Cell Rep. 2020 May 19;31(7):107663 [PMID: 32433972]
  13. RNA Biol. 2018;15(12):1468-1476 [PMID: 30486737]
  14. Genome Res. 2020 Dec 18;: [PMID: 33355297]
  15. Nucleic Acids Res. 2017 Jun 20;45(11):e99 [PMID: 28334830]
  16. Bioinformatics. 2019 Jun 1;35(11):1837-1843 [PMID: 30351403]
  17. Nat Commun. 2019 Mar 1;10(1):998 [PMID: 30824707]
  18. Brief Bioinform. 2021 Jul 20;22(4): [PMID: 33059369]
  19. Nat Genet. 2020 Aug;52(8):769-777 [PMID: 32601476]
  20. Science. 2019 Oct 18;366(6463):310-312 [PMID: 31624201]
  21. J Comput Biol. 2019 Jun;26(6):509-518 [PMID: 30785347]
  22. Nucleic Acids Res. 2018 Jun 20;46(11):e69 [PMID: 29617928]
  23. Nat Methods. 2020 Nov;17(11):1118-1124 [PMID: 33046896]
  24. Bioinformatics. 2020 Jan 1;36(1):81-89 [PMID: 31298694]
  25. Genome Biol. 2019 Mar 1;20(1):48 [PMID: 30823901]
  26. BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):418 [PMID: 30453896]
  27. Molecules. 2019 Nov 07;24(22): [PMID: 31703384]
  28. Pac Symp Biocomput. 2016;22:254-265 [PMID: 27896980]
  29. Bioinformatics. 2017 Jul 15;33(14):i234-i242 [PMID: 28881981]
  30. Bioinformatics. 2019 May 15;35(10):1660-1667 [PMID: 30295703]
  31. Bioinformatics. 2019 Dec 15;35(24):5235-5242 [PMID: 31077303]
  32. PLoS Comput Biol. 2018 Oct 4;14(10):e1006484 [PMID: 30286077]
  33. Nat Commun. 2018 Aug 7;9(1):3135 [PMID: 30087331]
  34. IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):667-676 [PMID: 31634140]
  35. Bioinformatics. 2022 Apr 28;38(9):2397-2403 [PMID: 35238376]
  36. Nucleic Acids Res. 2016 Jun 20;44(11):e107 [PMID: 27084946]
  37. Bioinformatics. 2017 Jul 15;33(14):i92-i101 [PMID: 28881969]
  38. PeerJ Comput Sci. 2020 Jun 15;6:e278 [PMID: 33816929]
  39. Bioinformatics. 2018 Sep 1;34(17):3038-3040 [PMID: 29668842]
  40. Nat Biotechnol. 2015 Aug;33(8):831-8 [PMID: 26213851]
  41. Nucleic Acids Res. 2019 Nov 18;47(20):10597-10611 [PMID: 31544924]
  42. PLoS One. 2017 Feb 3;12(2):e0171410 [PMID: 28158264]
  43. Nat Methods. 2020 Nov;17(11):1111-1117 [PMID: 33046897]
  44. Nat Methods. 2015 Feb;12(2):147-53 [PMID: 25486063]
  45. Bioinformatics. 2019 Jul 15;35(14):i269-i277 [PMID: 31510640]
  46. Nat Biotechnol. 2018 Nov;36(10):983-987 [PMID: 30247488]
  47. Nat Mach Intell. 2022 Jan;4(1):41-54 [PMID: 35966405]
  48. Nucleic Acids Res. 2015 Jul 1;43(W1):W39-49 [PMID: 25953851]
  49. Nat Struct Mol Biol. 2017 Oct;24(10):870-878 [PMID: 28869609]
  50. Nat Genet. 2021 Mar;53(3):354-366 [PMID: 33603233]
  51. Bioinformatics. 2019 Aug 15;35(16):2730-2737 [PMID: 30601980]
  52. Genome Res. 2020 Dec;30(12):1815-1834 [PMID: 32732264]
  53. BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):524 [PMID: 30598068]
  54. Cell. 2019 Jan 24;176(3):535-548.e24 [PMID: 30661751]
  55. Genome Biol. 2018 Jun 26;19(1):80 [PMID: 29945655]
  56. Bioinformatics. 2018 May 15;34(10):1705-1712 [PMID: 29329398]
  57. BMJ. 2020 Apr 7;369:m1328 [PMID: 32265220]
  58. Nat Genet. 2022 May;54(5):725-734 [PMID: 35551308]
  59. Nat Genet. 2019 Jun;51(6):973-980 [PMID: 31133750]
  60. Nat Mach Intell. 2021 Mar;3(3):258-266 [PMID: 34322657]
  61. Nat Rev Genet. 2015 Jun;16(6):321-32 [PMID: 25948244]
  62. Nat Rev Genet. 2019 Jul;20(7):389-403 [PMID: 30971806]
  63. Comput Struct Biotechnol J. 2020 Jun 17;18:1466-1473 [PMID: 32637044]
  64. Genome Res. 2018 May;28(5):739-750 [PMID: 29588361]
  65. Bioinformatics. 2018 Sep 1;34(17):2889-2898 [PMID: 29648582]
  66. BMC Genomics. 2019 Apr 4;20(Suppl 2):193 [PMID: 30967126]
  67. Genome Biol. 2017 Apr 11;18(1):67 [PMID: 28395661]
  68. Bioinformatics. 2019 Nov 1;35(22):4577-4585 [PMID: 31081512]
  69. Cell Syst. 2020 Jul 22;11(1):49-62.e16 [PMID: 32711843]
  70. Nat Commun. 2021 Feb 26;12(1):1167 [PMID: 33637701]
  71. Methods Mol Biol. 2018;1807:9-20 [PMID: 30030800]
  72. Bioinformatics. 2018 Sep 1;34(17):i629-i637 [PMID: 30423062]
  73. Quant Biol. 2020 Mar;8(1):64-77 [PMID: 34084563]
  74. Genome Res. 2022 Mar;32(3):512-523 [PMID: 35042722]
  75. Bioinformatics. 2019 Jul 1;35(13):2177-2184 [PMID: 30481258]
  76. Inf Fusion. 2019 Oct;50:71-91 [PMID: 30467459]
  77. NAR Genom Bioinform. 2020 Feb 19;2(1):lqaa009 [PMID: 33575556]
  78. Brief Funct Genomics. 2019 Feb 14;18(1):41-57 [PMID: 30265280]
  79. Nat Methods. 2015 Oct;12(10):931-4 [PMID: 26301843]
  80. Nat Commun. 2020 Dec 1;11(1):6141 [PMID: 33262328]
  81. PLoS One. 2019 Jun 17;14(6):e0218073 [PMID: 31206543]
  82. Interdiscip Sci. 2019 Dec;11(4):628-635 [PMID: 30588558]
  83. PeerJ. 2019 Nov 1;7:e7990 [PMID: 31695967]
  84. Elife. 2020 Jan 27;9: [PMID: 31985400]
  85. Science. 2012 Aug 17;337(6096):816-21 [PMID: 22745249]
  86. Bioinformatics. 2017 Jul 01;33(13):1930-1936 [PMID: 28334114]
  87. Epigenetics. 2017 Jul 3;12(7):505-514 [PMID: 28524769]
  88. PLoS Comput Biol. 2019 Dec 19;15(12):e1007560 [PMID: 31856220]
  89. Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:2394-2397 [PMID: 30440889]
  90. Bioinformatics. 2020 Mar 1;36(5):1405-1412 [PMID: 31598637]
  91. Front Genet. 2019 Mar 27;10:267 [PMID: 30972108]
  92. Biol Cybern. 1980;36(4):193-202 [PMID: 7370364]
  93. Genome Res. 2017 Dec;27(12):2015-2024 [PMID: 29097404]
  94. J Chem Inf Model. 2019 Jan 28;59(1):615-624 [PMID: 30485088]
  95. Quant Biol. 2019 Jun;7(2):122-137 [PMID: 34113473]
  96. PLoS One. 2019 Sep 11;14(9):e0222271 [PMID: 31509583]
  97. Cell. 2019 Jun 27;178(1):91-106.e23 [PMID: 31178116]
  98. Bioinformatics. 2019 Apr 1;35(7):1125-1132 [PMID: 30184052]
  99. J Cell Biol. 1989 Feb;108(2):229-41 [PMID: 2645293]
  100. Bioinformatics. 2016 Jun 15;32(12):i121-i127 [PMID: 27307608]
  101. PLoS Comput Biol. 2020 Jul 20;16(7):e1008050 [PMID: 32687525]
  102. Genome Res. 2016 Jul;26(7):990-9 [PMID: 27197224]
  103. Cells. 2019 Dec 14;8(12): [PMID: 31847308]
  104. Bioinformatics. 2018 Apr 15;34(8):1261-1269 [PMID: 29155928]
  105. Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8 [PMID: 19458158]
  106. J Chem Inf Model. 2020 Jun 22;60(6):2773-2790 [PMID: 32250622]
  107. IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):355-364 [PMID: 30835229]
  108. Bioinformatics. 2022 Apr 12;38(8):2102-2110 [PMID: 35020807]

MeSH Term

Deep Learning
Neural Networks, Computer
Genomics
Computational Biology

Word Cloud

Created with Highcharts 10.0.0deeplearningsequencegenomicsequencesgenomesGenomicstremendousamountbiologicaldataavailablecombinedrecentmethodologicalbreakthroughdomainscomputervisionnaturallanguageprocessingleadingtodaytransformationbioinformaticsemergencegenomicsapplicationreviewnewapplicationsuseenablesfieldfocusingthreeaspects:functionalannotationdeterminantsgenomefunctionspossibilitywritesyntheticenterseraBioinformaticsDeepEpigenomicsGeneticsMetagenomicsNeuralnetworksPersonalizedmedecineReviewSynthetic

Similar Articles

Cited By