
This is the analysis dataset used in the paper "Isolating Unisolated Upsilons with Anomaly Detection in CMS Open Data". This dataset is a distillation of the CMS Open Dataset DoubleMuon primary dataset in NANOAOD format from RunH of 2016. It contains $pp$ collisions at $\sqrt{s}$ = 13 GeV coming from validated luminosity runs, corresponding to 8.7 fb$^{-1}$ of integrated luminosity. Individual data files are pickled dictionaries and should be read in using the $\texttt{pickle}$ python package. The values are $\texttt{awkward}$ arrays. The keys correspond to a selection of muon and jet variables. Muon variables: `Muon_pt`, `Muon_eta`, `Muon_phi`, `Muon_charge`, `Muon_pfRelIso03_all`, `Muon_pfRelIso04_all`, `Muon_tightId`, `Muon_jetIdx`, `Muon_ip3d`, `Muon_jetRelIso`, `Muon_dxy`, `Muon_dz` Jet variables: `Jet_pt`, `Jet_eta`, `Jet_phi`, `Jet_mass`, `Jet_nConstituents`, `Jet_btagCSVV2`, `Jet_btagDeepB`, `Jet_btagDeepFlavB`, `MET_pt`, `MET_sumEt`, `PV_npvsGood`, `Jet_nMuons`, `Jet_qgl`, `Jet_muEF`, `Jet_chHEF`, `Jet_chEmEF`, `Jet_neEmEF`, `Jet_neHEF` In addition, we store the triggering information for all 40 triggers listed on the Open Data record page. The events are split into (28*2) files -- for both muon and jet observables, there are 28 pickled dictionaries corresponding to the 28 ROOT files in the corresponding CMS Open Data record. All scripts used to process this data are available at this repository. Please refer to the accompanying paper for more details on the analysis.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
