
arXiv: 2001.06358
Arguing for the need to combine declarative and probabilistic programming, Bárány et al. (TODS 2017) recently introduced a probabilistic extension of Datalog as a “purely declarative probabilistic programming language.” We revisit this language and propose a more principled approach towards defining its semantics based on stochastic kernels and Markov processes—standard notions from probability theory. This allows us to extend the semantics to continuous probability distributions, thereby settling an open problem posed by Bárány et al. We show that our semantics is fairly robust, allowing both parallel execution and arbitrary chase orders when evaluating a program. We cast our semantics in the framework of infinite probabilistic databases (Grohe and Lindner, LMCS 2022) and show that the semantics remains meaningful even when the input of a probabilistic Datalog program is an arbitrary probabilistic database.
FOS: Computer and information sciences, Datalog, measure theory, Computer Science - Logic in Computer Science, Probability in computer science (algorithm analysis, random structures, phase transitions, etc.), Computer Science - Databases, stochastic kernels, Database theory, probabilistic programming, probabilistic databases, generative Datalog, Databases (cs.DB), Logic in Computer Science (cs.LO)
FOS: Computer and information sciences, Datalog, measure theory, Computer Science - Logic in Computer Science, Probability in computer science (algorithm analysis, random structures, phase transitions, etc.), Computer Science - Databases, stochastic kernels, Database theory, probabilistic programming, probabilistic databases, generative Datalog, Databases (cs.DB), Logic in Computer Science (cs.LO)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
