Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition

Name: Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition
Keywords: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, Human-centered computing → Empirical studies in ubiquitous and mobile computing; HCI design and evaluation methods, Human-Computer Interaction (cs.HC)

Fiori, Michele; Civitarese, Gabriele; Bettini, Claudio

Found an issue? Give us feedback

downloadFull-Text

Archivio Istituziona...arrow_drop_down

Archivio Istituzionale della Ricerca dell'Università degli Studi di Milano

Conference object . 2024

Full-Text: https://air.unimi.it/bitstream/2434/1105811/3/3675094.3679000.pdf

Data sources: Archivio Istituzionale della Ricerca dell'Università degli Studi di Milano

https://doi.org/10.1145/367509...

Article . 2024 . Peer-reviewed

License: CC BY ND

Data sources: Crossref

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2024

License: CC BY

Data sources: Datacite

Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 05 Oct 2024Embargo end date: 01 Jan 2024Publisher:ACMJournal:Companion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing

Authors: Fiori, Michele; Civitarese, Gabriele; Bettini, Claudio;

doi: 10.1145/3675094.3679000 , 10.48550/arxiv.2408.06352

arXiv: 2408.06352

handle: 2434/1105811

Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Recognizing daily activities with unobtrusive sensors in smart environments enables various healthcare applications. Monitoring how subjects perform activities at home and their changes over time can reveal early symptoms of health issues, such as cognitive decline. Most approaches in this field use deep learning models, which are often seen as black boxes mapping sensor data to activities. However, non-expert users like clinicians need to trust and understand these models' outputs. Thus, eXplainable AI (XAI) methods for Human Activity Recognition have emerged to provide intuitive natural language explanations from these models. Different XAI methods generate different explanations, and their effectiveness is typically evaluated through user surveys, that are often challenging in terms of costs and fairness. This paper proposes an automatic evaluation method using Large Language Models (LLMs) to identify, in a pool of candidates, the best XAI approach for non-expert users. Our preliminary results suggest that LLM evaluation aligns with user surveys.

Accepted for publication at UbiComp / ISWC 2024's XAIforU workshop

Related Organizations

University of Milan
Italy
University of Milan
Italy

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, Human-centered computing → Empirical studies in ubiquitous and mobile computing; HCI design and evaluation methods, Human-Computer Interaction (cs.HC)

2 Research products, page 1 of 1

ContextGPT: Infusing LLMs Knowledge into Neuro-Symbolic Activity Recognition Models
2024HasVersion
llm-xar software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Top 10%

Average

Green

hybrid

Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition

Using Large Language Models to Compare Explainable Models for Smart Home Human Activity Recognition

2 Research products, page 1 of 1

ContextGPT: Infusing LLMs Knowledge into Neuro-Symbolic Activity Recognition Models

llm-xar software on GitHub