Downloads provided by UsageCounts
Here, we introduce QM7-X, a comprehensive dataset of > 40 physicochemical properties for ~4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures---comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-trans-and conformational isomers)---as well as 100 non-equilibrium structural variations thereof to reach a total of ~4.2 M molecular structures. Computed at the tightly converged quantum-mechanical PBE0+MBD level of theory, QM7-X contains global (molecular) and local (atom-in-a-molecule) properties ranging from ground state quantities (such as atomization energies and dipole moments) to response quantities (such as polarizability tensors and dispersion coefficients). By providing a systematic, extensive, and tightly converged dataset of quantum-mechanically computed physical and chemical properties, we expect that QM7-X will play a critical role in the development of next-generation machine-learning based models for exploring greater swaths of CCS and performing in silico design of molecules with targeted properties. The dataset is provided in eight HDF5 based files (compressed in .XZ files). One can also find here a README file with technical usage details and examples of how to access the information stored in the dataset (see createDB.py). *The paper explaining the generation of data stored in QM7-X will be published soon.
JH, LMS, and AT acknowledge financial support from the European Research Council (ERC-CoG grant BeStMo). BGE and RAD are grateful for support from start-up funding through the College of Arts and Sciences at Cornell University. The results presented in this publication have been partially obtained using the HPC facilities of the University of Luxembourg. This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357.
machine learning, molecular property, Physicochemical properties, Machine learning, qm7, Non-equilibrium structures, chemistry, Organic molecules, Quantum mechanics
machine learning, molecular property, Physicochemical properties, Machine learning, qm7, Non-equilibrium structures, chemistry, Organic molecules, Quantum mechanics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 1K | |
| downloads | 679 |

Views provided by UsageCounts
Downloads provided by UsageCounts