Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article
Data sources: ZENODO
addClaim

Semantic Preservation in Multilingual Telemedicine Using Hybrid ASR-LLM Systems: A Framework for Clinical Intent Alignment

Authors: Aryan Raj Thakur;

Semantic Preservation in Multilingual Telemedicine Using Hybrid ASR-LLM Systems: A Framework for Clinical Intent Alignment

Abstract

This paper addresses the critical challenge of semantic drift and latency in multilingual telemedicine through a novel hybrid edge-cloud architecture. While teleconsultation demand increases globally, existing cross-lingual tools often fail to accurately preserve complex clinical intent or meet real-time performance requirements. We propose a framework that couples edge-based automatic speech recognition using OpenAI Whisper-v4 with cloud-based large language model reasoning via Gemini 3.1. Central to this architecture is a semantic alignment layer that maps natural language inputs to standardized medical ontologies, such as SNOMED-CT and ICD-11. Such hybrid approaches are essential for maintaining diagnostic integrity in distributed digital health environments . Evaluation using a simulated multilingual dataset demonstrates that the proposed system achieves a semantic fidelity score of 0.95 and an average end-to-end latency of 200ms. These results indicate that ontology-aware reasoning significantly outperforms standalone translation pipelines in preserving clinical intent. This research advances the methodology for real-time medical translation by addressing phonetic substitution errors and colloquial symptom mapping, which are common points of failure in generalpurpose models . The proposed framework provides a robust foundation for scalable, high-fidelity multilingual teleconsultation..

Powered by OpenAIRE graph
Found an issue? Give us feedback