Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Bandırma Onyedi Eylu...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
addClaim

Emotion Detection from Speeches Using Cochleagram Features

Authors: Özer, İlyas; Efe, Serhat Berat; Özbay, Harun; Özbay, Harun; Özer, İlyas; Efe, Serhat Berat;

Emotion Detection from Speeches Using Cochleagram Features

Abstract

Human-machine interaction is a popular field of study and its importance is increasing day by day. Detecting emotions from speeches has also been an exciting topic in the field of humanmachine interaction. With the spread of technologies such as digital personal assistants, speech-totext models and call center systems, which are the applications of speech, the systems that can detect emotions from speeches that can be seen as the next natural step of this process have become more valuable. In addition, since speech is a physiological process and it is reflected in the speeches of people's mental and physical health, emotion detection studies can be used to evaluate physical, and mental health status. In this study, an approach that uses the features of cochleagram and convolutional neural networks (CNN) was proposed to detect emotions from speech. In the proposed approach, speech recordings are converted to overlapping cochleagrams. It is then converted to a grayscale image. Colored images are created with obtained grayscale images through color maps. After each image is set to the same size, it is classified with the CNN network. The proposed approach provided a 2.8% improvement in the classification performance of the data set, which corresponds to a relative error reduction rate of 19.46% compared to state-of-the-art techniques.

Country
Turkey
Related Organizations
Keywords

speech emotion recognition, convolutional neural networks, cochleagram

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average