Voice Analysis Using PRAAT Software and Classification of User Emotional State
DOI:
https://doi.org/10.9781/ijimai.2019.03.004Keywords:
Classification, Emotion recognition, Voice Analysis, PRAAT, EMORECAbstract
During the last decades the field of IT has seen an incredible and very rapid development. This development has shown that it is important not only to shift performance and functional boundaries but also to adapt the way human-computer interaction to modern needs. One of the interaction possibilities is a voice control which nowadays can‘t be restricted only to direct commands. The goal of adaptive interaction between man and computer is the human needs understanding. The paper deals with the user's emotional state classification based on the voice track analysis, it describes its own solution - the measurement and the selection process of appropriate voice characteristics using ANOVA analysis and the use of PRAAT software for many voice aspects analysis and for the implementation of own application to classify the user's emotional state from his/her voice. In the paper are presented the results of the created application testing and the possibilities of further expansion and improvement of this solution.Downloads
References
Abramson, L., Marom, I., Petranker R., & Aviezer, H. “Is fear in your head? A comparison of instructed and real-life expressions of emotion in the face and body,” Emotion, vol. 17, no. 3, pp. 557-565, 2017, doi:10.1037/emo0000252.
Baron-Cohen, S. “The Eyes as Window to the Mind.” American Journal of Psychiatry, vol. 174, no. 1, pp. 1-2, 2017, doi:10.1176/appi. ajp.2016.16101188.
Busso, C., S. Parthasarathy, A. Burmania, M. Abdelwahab, N. Sadoughi, and E. M. Provost. “MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception.” IEEE Transactions on Affective Computing, vol. 8, no. 1, pp. 67-80, 2017.
Cowie, R. “The enduring basis of emotional episodes: Towards a capacious overview”, 2015 International Conference on Affective Computing and Intelligent Interaction, ACII 2015, pp. 98. 2015. doi:10.1109/ ACII.2015.7344557.
Damasio, A. R. “Descartes’ error: emotion, reason, and the human brain. “ New York : Avon Books, 1994.
Davletcharova, A., et al. “Detection and Analysis of Emotion from Speech Signal.” Procedia Computer Science, vol. 58, pp. 91-96, 2015, doi:10.1016/j.procs.2015.08.032.
Duffy, E. “An Explanation of “emotional” Phenomena without the use of the Concept emotion.” Journal of General Psychology, vol. 25, no. 2, pp. 283-293, 1941.
Ekman, P. “What Scientists Who Study Emotion Agree about.” Perspectives on Psychological Science, vol. 11, no. 1, pp. 31-34, 2016, doi:10.1177/1745691615596992.
Ekman, P. “Respect in a Pinch.” Science, vol. 337, no. 6099, pp. 1173. 2012. doi:10.1126/science.337.6099.1173.
Haq, S., T. Jan, M. Asif, A. Ali, and N. Ahmad. “Speaker-Dependent Human Emotion Recognition in Unimodal and Bimodal Scenarios.” Proceedings of the Pakistan Academy of Sciences, vol. 52, no. 1, pp. 15- 25, 2015.
Karadogan S. and Larsen J. “Combining Semantic and Acoustic Features for Valence and Arousal Recognition in Speech.” in Proceedings of IEEE 3rd International Workshop on: Cognitive Information Processing, Baiona, pp. 1-6, 2012.
Koolagudi, S. G., Y. V. S. Murthy, and S. P. Bhaskar. “Choice of a Classifier, Based on Properties of a Dataset: Case Study-Speech Emotion Recognition.” International Journal of Speech Technology, vol. 21, no. 1, pp. 167-183, 2018, doi: 10.1007/s10772-018-9495-8.
Kragel, P. A., et al. “Decoding Spontaneous Emotional States in the Human Brain.” PLoS Biology, vol. 14, no. 9, 2016, doi:10.1371/journal. pbio.2000106.
Lövheim, H. “A New Three-Dimensional Model for Emotions and Monoamine Neurotransmitters.” Medical Hypotheses, vol. 78, no. 2, 2012, pp. 341-348.
Perlman, M. and G. Lupyan. “People can Create Iconic Vocalizations to Communicate various Meanings to Naïve Listeners.” Scientific Reports vol. 8, no. 1, 2018, doi: 10.1038/s41598-018-20961-6.
Revathi, A., N. Sasikaladevi, R. Nagakrishnan, and C. Jeyalakshmi. “Robust Emotion Recognition from Speech: Gamma Tone Features and Models.” International Journal of Speech Technology, 2018, doi:10.1007/ s10772-018-9546-1.
Sharar, S. R., A. Alamdari, C. Hoffer, H. G. Hoffman, M. P. Jensen, and D. R. Patterson.. “Circumplex Model of Affect: A Measure of Pleasure and Arousal during Virtual Reality Distraction Analgesia.” Games for Health Journal, vol. 5, no. 3, pp. 197-202, 2016, doi:10.1089/g4h.2015.0046.
Shi, Y., K. A. Peng, B. Chen, Y. Gong, J. Chen, Y. Li, and Q. -J Fu. “Interaction between Speech Variations and Background Noise on Speech Intelligibility by Mandarin-Speaking Cochlear Implant Patients.” Speech Communication, vol. 104, no. 89-94, 2018, doi:10.1016/j. specom.2018.09.007.
Smith, J., A. Tsiartas, V. Wagner, E. Shriberg, and N. Bassiou. 2018. “Crowdsourcing Emotional Speech”. ICASSP, IEEE international conference on acoustics, speech and signal processing – proceedings. 5139 p. 2018. doi:10.1109/ICASSP.2018.8462224.
Swain, M., A. Routray, and P. Kabisatpathy. “Databases, Features and Classifiers for Speech Emotion Recognition: A Review.” International Journal of Speech Technology, vol. 21, no. 1, pp. 93-120, 2018, doi: 10.1007/s10772-018-9491-z.
Trabelsi, I., D. B. Ayed, and N. Ellouze. “Evaluation of Influence of Arousal-Valence Primitives on Speech Emotion Recognition.” International Arab Journal of Information Technology, vol. 15, no. 4, 2018, pp. 756-762.
Truesdale, D. M. and M. D. Pell. “The Sound of Passion and Indifference.” Speech Communication, vol. 99, pp. 124-134. 2018, doi:10.1016/j. specom.2018.03.007.
Uribe, A., A. Gómez, M. Bastidas, O. L. Quintero, and D. Campo. “A Novel Emotion Recognition Technique from Voiced-Speech.” Paper presented at the 2017 IEEE 3rd Colombian Conference on Automatic Control, CCAC 2017 - Conference Proceedings. 2017. doi:10.1109/CCAC.2017.8276451.
Downloads
Published
-
Abstract87
-
PDF46






