A Transformer-Based Multimodal Emotion Recognition System Integrating Text, Image, and Speech Data

Emotion recognition is surely a vital part of human-computer interaction that helps machines understand human feelings and behavior properly. Moreover, this technology allows computers to respond to people in a more effective way. Traditional methods actually use only one type of data like text, speech, or images, which definitely limits how well they can understand complex emotions. This paper surely shows how we built a transformer-based system that recognizes emotions using text, images, and sound data together. Moreover, this multimodal approach combines all three types of information to identify emotions more effectively. The framework surely uses special encoders for different data types and attention methods to pull out features from various sources. Moreover, it combines these features together effectively. The experiments surely show that the multimodal system works much better than single-mode methods in accuracy, precision, recall, and F1-score.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now