Name: MOSAIC: Multiple Observers Spotting AI Content
Keywords: Large Language Model, FOS: Computer and information sciences, Computer Science - Computation and Language, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, Model Ensembling, Computation and Language (cs.CL), Artificial Text Detection

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2025Embargo end date: 01 Jan 2024Publisher:Association for Computational Linguistics (ACL)Journal:Findings of the Association for Computational Linguistics: ACL 2025

Authors: Dubois, Matthieu; Yvon, François; Piantanida, Pablo;

doi: 10.18653/v1/2025.findings-acl.1244 , 10.48550/arxiv.2409.07615

arXiv: http://arxiv.org/abs/2409.07615

MOSAIC: Multiple Observers Spotting AI Content

- Summary
- Subjects
- Metrics

Abstract

The dissemination of Large Language Models (LLMs), trained at scale, and endowed with powerful text-generating abilities, has made it easier for all to produce harmful, toxic, faked or forged content. In response, various proposals have been made to automatically discriminate artificially generated from human-written texts, typically framing the problem as a binary classification problem. Early approaches evaluate an input document with a well-chosen detector LLM, assuming that low-perplexity scores reliably signal machine-made content. More recent systems instead consider two LLMs and compare their probability distributions over the document to further discriminate when perplexity alone cannot. However, using a fixed pair of models can induce brittleness in performance. We extend these approaches to the ensembling of several LLMs and derive a new, theoretically grounded approach to combine their respective strengths. Our experiments, conducted with various generator LLMs, indicate that this approach effectively leverages the strengths of each model, resulting in robust detection performance across multiple domains. Our code and data are available at https://github.com/BaggerOfWords/MOSAIC .

ACL 2025 Findings, code can be found at https://github.com/BaggerOfWords/MOSAIC

Related Organizations

Inserm
France
University of Paris-Saclay
France
Mila - Quebec Artificial Intelligence Institute
Canada
Sorbonne University
France
French National Centre for Scientific Research
France

View all View all

Keywords

Large Language Model, FOS: Computer and information sciences, Computer Science - Computation and Language, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, Model Ensembling, Computation and Language (cs.CL), Artificial Text Detection

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Related to Research communities

UArctic