
Quantitative structure-activity relationship (QSAR) and Quantitative structure-property relationship (QSPR) are mathematical models for the prediction of the chemical, physical or biological properties of chemical compounds. Usually, they are based on structural (grounded on fragment contribution) or calculated (centered on QSAR three-dimensional (QSAR-3D) or chemical descriptors) parameters. Hereby, we describe a Graph Theory approach for generating and mining molecular fragments to be used in QSAR or QSPR modeling based exclusively on fragment contributions. Merging of Molecular Graph Theory, Simplified Molecular Input Line Entry Specification (SMILES) notation, and the connection table data allows a precise way to differentiate and count the molecular fragments. Machine learning strategies generated models with outstanding root mean square error (RMSE) and R2 values. We also present the software Charming QSAR & QSPR, written in Python, for the property prediction of chemical compounds while using this approach.
Artificial intelligence, Computational chemistry, Materials Science, gradient boosting machine, Graph, Computational Chemistry, Theoretical computer science, Machine learning, QA1-939, Materials Chemistry, FOS: Mathematics, Environmental Chemistry, support vector machine, QSAR Modeling, Biology, Accelerating Materials Innovation through Informatics, fragment based QSAR, Computer science, Chemistry, fragment based QSPR, Computational Theory and Mathematics, Chemical Properties, Biological system, Computer Science, Physical Sciences, Environmental Science, Principles and Applications of Green Chemistry, Quantitative structure–activity relationship, random forest, Molecular graph, Mathematics, Computational Methods in Drug Discovery, Molecular descriptor
Artificial intelligence, Computational chemistry, Materials Science, gradient boosting machine, Graph, Computational Chemistry, Theoretical computer science, Machine learning, QA1-939, Materials Chemistry, FOS: Mathematics, Environmental Chemistry, support vector machine, QSAR Modeling, Biology, Accelerating Materials Innovation through Informatics, fragment based QSAR, Computer science, Chemistry, fragment based QSPR, Computational Theory and Mathematics, Chemical Properties, Biological system, Computer Science, Physical Sciences, Environmental Science, Principles and Applications of Green Chemistry, Quantitative structure–activity relationship, random forest, Molecular graph, Mathematics, Computational Methods in Drug Discovery, Molecular descriptor
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 31 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
