Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Doctoral thesis . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Thesis . 2026
License: CC BY
Data sources: Datacite
ZENODO
Thesis . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Semantic Veracity Analyzer: Detecting Vocal Inconsistencies via FFT Peak Gradient Analysis for ASR Pre-filtering, AI Self-Feedback, and Psychiatric Voice Biomarker Applications

Authors: Fausto, Juan;

Semantic Veracity Analyzer: Detecting Vocal Inconsistencies via FFT Peak Gradient Analysis for ASR Pre-filtering, AI Self-Feedback, and Psychiatric Voice Biomarker Applications

Abstract

Open-source Python system that detects vocal inconsistencies through Fast Fourier Transform (FFT) spectral decomposition and multi-feature gradient analysis. The system combines peak amplitude gradient consistency, MFCC delta variance, spectral flux, Lippold microtremor energy (8-12 Hz), and Praat-derived vocal features (jitter, shimmer, HNR, formants F1-F4) into a Naturalness × Involuntary Stress Cartesian mapping. Key finding: deceptive speech exhibits reduced voluntary variation (jitter -27%, MFCC delta variance -40%, spectral flux -52%) but elevated involuntary microtremor (+17%) — consistent with the over-control hypothesis of deception (Zuckerman et al., 1981). Three proposed applications: (1) ASR truth pre-filtering to reduce hallucinations from contaminated input, (2) AI self-feedback where TTS systems analyze their own synthetic voice for uncertainty signals, (3) psychiatric voice biomarker monitoring for mood disorders. Includes a section on synthetic voice signal equivalence — demonstrating that FFT processes synthetic and organic voice identically at the signal level — and the proposal of "Logos Probabilis" as a taxonomic framework for probabilistic intelligence systems. Co-authored with Claude (Anthropic), who contributed Section 6 (AI perspective on self-feedback) and Section 7 (signal equivalence hypothesis). Code: https://github.com/sfaustodev/NLP-AI

Keywords

- Computer Science → Artificial Intelligence - Computer Science → Sound - Engineering → Signal Processing - Medicine → Psychiatry, Signal processing, jitter, Lie Detection, Artificial Intelligence/standards, FFT, Fast Fourier Transform, voice stress analysis, lie detection, deception detection, MFCC, jitter, shimmer, HNR, microtremor, Lippold, ASR, automatic speech recognition, voice biomarkers, psychiatric monitoring, AI safety, AI welfare, synthetic voice, signal processing, spectral analysis, Praat, Parselmouth, open source, microtremor, FFT, open source, Artificial Intelligence, synthetic voice, ai welfare, lippold, Psychiatry, Artificial Intelligence/ethics, praat, voice stress analysis, hnr, automatic speech recognition, psychiatric monitoring, parselmouth, Signal Processing, Computer-Assisted, shimmer, deception detection, spectral analysis, Fast fourier transform, asr, MFCC, voice biomarkers, Computer Science

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average