sMOL Explorer: an open source, web-enabled database and exploration tool for Small MOLecules datasets.

Supawadee Ingsriswang, Eakasit Pacharawongsakda
Author Information
  1. Supawadee Ingsriswang: Information Systems Laboratory, BIOTEC Central Research Unit, National Center for Genetic Engineering and Biotechnology (BIOTEC), Klongluang, Pathumthani, 12120, Thailand. supawadee@biotec.or.th

Abstract

sMOL Explorer is a 2D ligand-based computational tool that provides three major functionalities: data management, information retrieval and extraction and statistical analysis and data mining through Web interface. With sMOL Explorer, users can create personal databases by adding each small molecule via a drawing interface or uploading the data files from internal and external projects into the sMOL database. Then, the database can be browsed and queried with textual and structural similarity search. The molecule can also be submitted to search against external public databases including PubChem, KEGG, DrugBank and eMolecules. Moreover, users can easily access a variety of data mining tools from Weka and R packages to perform analysis including (1) finding the frequent substructure, (2) clustering the molecular fingerprints, (3) identifying and removing irrelevant attributes from the data and (4) building the classification model of biological activity.
AVAILABILITY: sMOL Explorer is an Open Source project and is freely available to all interested users at http://www.biotec.or.th/ISL/SMOL/.

MeSH Term

Computer Graphics
Computer Simulation
Database Management Systems
Databases, Factual
Information Storage and Retrieval
Internet
Models, Chemical
Models, Molecular
Molecular Weight
Pharmaceutical Preparations

Chemicals

Pharmaceutical Preparations

Word Cloud

Created with Highcharts 10.0.0sMOLdatacanExplorerusersdatabasetoolanalysismininginterfacedatabasesmoleculeexternalsearchincluding2Dligand-basedcomputationalprovidesthreemajorfunctionalities:managementinformationretrievalextractionstatisticalWebcreatepersonaladdingsmallviadrawinguploadingfilesinternalprojectsbrowsedqueriedtextualstructuralsimilarityalsosubmittedpublicPubChemKEGGDrugBankeMoleculesMoreovereasilyaccessvarietytoolsWekaRpackagesperform1findingfrequentsubstructure2clusteringmolecularfingerprints3identifyingremovingirrelevantattributes4buildingclassificationmodelbiologicalactivityAVAILABILITY:OpenSourceprojectfreelyavailableinterestedhttp://wwwbiotecorth/ISL/SMOL/Explorer:opensourceweb-enabledexplorationSmallMOLeculesdatasets

Similar Articles

Cited By