Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Report . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Report . 2026
License: CC BY
Data sources: Datacite
ZENODO
Report . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Catelingo: Constraint-Based Semantic Validity Verification for Language Models

Authors: Miya, Shinobu;

Catelingo: Constraint-Based Semantic Validity Verification for Language Models

Abstract

Large language models frequently produce outputs that are syntactically fluent yet semantically invalid.While recent verification approaches focus on reasoning chains or factual knowledge, many semantic failures occur without explicit reasoning or missing facts.Such errors include temporal inconsistencies, numerical impossibilities, and semantic type clashes, which cannot be reliably detected by reasoning-based or knowledge-based methods. This paper introduces Catelingo, a constraint-based semantic validity verifier that detects these failures by checking the satisfiability of explicit semantic constraints.Rather than evaluating truth, likelihood, or factual correctness, Catelingo defines semantic validity as constraint satisfiability induced by an input sentence.The proposed approach requires neither reasoning chains nor knowledge retrieval, and does not rely on model retraining. We implement a toy version of Catelingo using a small, sense-level lexicon and explicit constraint propagation over syntactic dependency structures.Experiments demonstrate that Catelingo correctly detects semantic no-go cases involving temporal ordering violations, numerical range violations, and semantic type incompatibilities.We further show that metaphorical expressions can be selectively permitted through explicit degeneration rules, and that domain adaptation can be achieved by switching constraint profiles rather than retraining models. These results suggest that constraint-based semantic verification provides a lightweight and scalable complement to existing reasoning- and knowledge-based verification methods, addressing a class of semantic failures that they do not cover.An open-source reference implementation and deterministic test cases are provided for reproducibility.The source code for the verification, Catelingo, is open-sourced at https://github.com/ShinobuMiya/Catelingo This work is intended as a design-oriented technical report and is not tied to a specific benchmark or leaderboard.

Keywords

Hallucinations, semantic verification, constraint satisfaction, symbolic methods

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Green