Publication detail

Automatic Language Identification System

ČERNOCKÝ, J. MATĚJKA, P. BURGET, L. SCHWARZ, P.

Original Title

Automatic Language Identification System

Type

article in a collection out of WoS and Scopus

Language

English

Original Abstract

This paper presents the language identification (LID) systemdeveloped in Speech@FIT. The system consists of twoparts: Acoustic LID determines the language directly on thebasis of features derived from the speech signal. We haveimproved existing approaches by adding discriminative trainingof acoustic models. In phonotactic LID, speech is firsttranscribed by phoneme recognizer into strings or graphs (lattices)of phonemes. On these, language models are trainedto capture statistics of sequences of phonemes. We have pioneeredthe use of so called îanti-modelsî for this task. All experimentalresults are reported on standard NIST 2003 data;comparison with other published results is favorable to oursystem.

Keywords

speech processing, automatic language identification

Authors

ČERNOCKÝ, J.; MATĚJKA, P.; BURGET, L.; SCHWARZ, P.

RIV year

2006

Released

24. 1. 2006

Publisher

University of Defence in Brno

Location

Brno

Pages from

1

Pages to

6

Pages count

6

URL

BibTex

@inproceedings{BUT22285,
  author="Jan {Černocký} and Pavel {Matějka} and Lukáš {Burget} and Petr {Schwarz}",
  title="Automatic Language Identification System",
  booktitle="Sborník příspěvků z odborného semináře {"}Nové technologie v radiokomunikacích{"}",
  year="2006",
  pages="1--6",
  publisher="University of Defence in Brno",
  address="Brno",
  url="http://www.fit.vutbr.cz/~cernocky/publi/2006/acr2006.pdf"
}