
The analysis of emotions expressed in spoken language holds a pivotal role in human communication, artificial intelligence, and human-computer interaction. While emotion recognition in text has seen considerable advancements, recognizing emotional states in spoken speech presents distinct challenges and opportunities. This research introduces an innovative approach that harnesses Mel Frequency Cepstral Coefficients (MFCC) and Long Short-Term Memory (LSTM) networks to facilitate deep emotion recognition in spoken speech signals. This study explores the profound potential of the MFCC-LSTM framework, a combination of established audio feature extraction and deep learning. Mel Frequency Cepstral Coefficients offer a powerful representation of spectral features over time, while LSTM networks excel at modeling temporal dependencies. This system classifies emotional states such as sadness, angry, neutral, and happiness from the speaker's utterances. Several performance assessments were carried out on the suggested MFCC-LSTM model. There is significant improvement in the recognition rates when compared with other models that are currently available. The proposed hybrid model reached 96 % recognition success.
sentiments, speech emotion recognition, mel frequency cepstral coefficients, deep learning, long short-term memory network, TA1-2040, Engineering (General). Civil engineering (General)
sentiments, speech emotion recognition, mel frequency cepstral coefficients, deep learning, long short-term memory network, TA1-2040, Engineering (General). Civil engineering (General)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
