Detail publikace

Enhancing multilingual recognition of emotion in speech by language identification

SAGHA, H. MATĚJKA, P. GAVRYUOKOVA, M. POVOLNÝ, F. MARCHI, E. SCHULLER, B.

Originální název

Enhancing multilingual recognition of emotion in speech by language identification

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

We investigate, for the first time, if applying model selection based on automatic language identification (LID) can improve multilingual recognition of emotion in speech. Six emotional speech corpora from three language families (Germanic, Romance, Sino-Tibetan) are evaluated. The emotions are represented by the quadrants in the arousal/valence plane, i. e., positive/ negative arousal/valence. Four selection approaches for choosing an optimal training set depending on the current language are compared: within the same language family, across language family, use of all available corpora, and selection based on the automatic LID. We found that, on average, the proposed LID approach for selecting training corpora is superior to using all the available corpora when the spoken language is not known.

Klíčová slova

multilingual emotion recognition, language identification, language families

Autoři

SAGHA, H.; MATĚJKA, P.; GAVRYUOKOVA, M.; POVOLNÝ, F.; MARCHI, E.; SCHULLER, B.

Vydáno

8. 9. 2016

Nakladatel

International Speech Communication Association

Místo

San Francisco

ISSN

1990-9772

Periodikum

Proceedings of Interspeech

Číslo

9

Stát

Francouzská republika

Strany od

2949

Strany do

2953

Strany počet

5

URL

BibTex

@inproceedings{BUT163404,
  author="SAGHA, H. and MATĚJKA, P. and GAVRYUOKOVA, M. and POVOLNÝ, F. and MARCHI, E. and SCHULLER, B.",
  title="Enhancing multilingual recognition of emotion in speech by language identification",
  booktitle="17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION - Proceedings (INTERSPEECH 2016)",
  year="2016",
  journal="Proceedings of Interspeech",
  number="9",
  pages="2949--2953",
  publisher="International Speech Communication Association",
  address="San Francisco",
  doi="10.21437/Interspeech.2016-333",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2016/pdfs/0333.PDF"
}