Downloads provided by UsageCounts
This repository contains an artificial dataset constructed for the study of uncertainty characterization and quantification in chemical ML applications. The data are designed to be noise-free and represent group-additivity calculations of enthalpy of formation, rather than calculated or measured enthalpy of formation directly. Where did the targets come from: These data files contain SMILES and targets for a simple group additivity calculation of enthalpy of formation at 298 K. The group additivity coefficients were fitted to the molecules of the qm9 computational chemistry database. Fragments were only considered that appeared in at least 100 molecules. These coefficients were rounded to 3 decimals. Groups only consider a bond radius of 1 from the central atom. Where did the SMILES come from: The group additivity coefficients were applied to the gdb11 computational chemistry dataset. The gdb11 dataset contains 26.4M molecular SMILES, attempting to cover all possible organic molecules up to 11 heavy atoms with the atoms C, H, O, N, F. Molecules that contained groups that were not represented in the group additivity coefficients were excluded, resulting in 7,906,815 SMILES. Though these SMILES contain chiral centers, they are not chirally specified. No SMILES repeats are present. Scripts used for generating data subsets and added-noise datasets are also included.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 27 | |
| downloads | 25 |

Views provided by UsageCounts
Downloads provided by UsageCounts