Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-Catalan radio news corpus

D. Escudero-Mancebo*, C. González-Ferreras, J. M. Garrido, E. Rodero, L. Aguilar, A. Bonafonte

*Corresponding author for this work

Research output: Chapter in BookChapterResearchpeer-review

1 Citation (Scopus)

Abstract

This article reports the process of building a bilingual (Spanish-Catalan) text corpus balanced in parallel taking into account prosodic features for both languages. We propose an expert guideline for text manipulation that in combination with greedy algorithms significantly improves the quality of the selected corpus. The application of this methodology to a radio news corpus empirically supports the proposed strategy.

Original languageEnglish
Title of host publication5th International Conference on Speech Prosody 2010
ISBN (Electronic)9780000000002
Publication statusPublished - 2010

Publication series

NameProceedings of the International Conference on Speech Prosody
ISSN (Print)2333-2042

Fingerprint Dive into the research topics of 'Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-Catalan radio news corpus'. Together they form a unique fingerprint.

Cite this