SEA_APa segmentation and labelling tool for prosodic analysis

  1. Paula López Otero
  2. Laura Docío Fernández
  3. Carmen García Mateo
  4. Marta Martínez Maquieira
  5. Rocío Varela Fernández
  6. Elisa Fernández Re
Revista:
Dialectologia

ISSN: 2013-2247

Ano de publicación: 2016

Número: 6

Páxinas: 233-244

Tipo: Artigo

Outras publicacións en: Dialectologia

Resumo

This paper introduces a tool that performs segmentation and labelling of sound chains in phono units, syllables and/or words departing from a sound signal and its corresponding orthographic transcription. In addition, it also integrates acoustic analysis scripts applied to the Praat programme with the aim of reducing the time spent on tasks related to analysis, correction, smoothing and generation of graphics of the melodic curve. The tool is implemented for Galician, Spanish and Brazilian Portuguese. Our goal is to contribute, by means of this application, to automatize some of the tasks of segmentation, labelling and prosodic analysis, since these tasks require a large investment of time and human resources.

Referencias bibliográficas

  • BARBOSA, P.A. (2006) Incursões em torno do ritmo da fala, Campinas: Pontes.
  • BIGI, B. & D. HIRST (2012) “Speech Phonetization Alignment and syllabification (SPPAS): a tool for the automatic analysis of speech prosody”, Proceedings of Speech Prosody, Shanghai, 1-4. <https://hal.archives-ouvertes.fr/hal-00983699>
  • BOERSMA, P. & D. WEENIK (2013) Praat: doing phonetics by computer [Computer program], version 5.4.05 [http://www.praat.org/].
  • CONTINI, M., J.P. LAI & A. ROMANO (2002) “La géolinguistique à Grenoble: de l’AliR à l’AMPER”, in M.R. Simoni-Aurembou (ed.), Nouveaux regards sur la variation diatopique, Revue belge de Philologie e d’Histoire, 80, 931-941.
  • DOCIO-FERNÁNDEZ, L., A. CARDENAL-LÓPEZ, C. GARCÍA-MATEO (2006) “TC-STAR 2006 Automatic Speech Recognition Evaluation: The UVIGO System”, in TC-STAR Workshop on Speech-toSpeech Translation, Barcelona, 145-15.https://www.researchgate.net/publication/255661612_TCSTAR_2006_Automatic_Speech_Recognition_Evaluation_The_UVIGO_System>
  • ELVIRA-GARCÍA, W. (2014a) Prosodic-data-extraction v2.1 [Praat script, distributed under GNU
  • ELVIRA-GARCÍA, W. (2014b) Blank_TextGrid_creation [Praat script, distributed under GNU
  • ELVIRA-GARCÍA, W. (2014c) Remove_tiers [Praat script, distributed under GNU General Public License] <http://stel.ub.edu/labfon/en/praat-scripts>
  • ELVIRA-GARCÍA, W. (2014d) Created_TextGrid_modification [Praat script, distributed under GNU
  • ELVIRA-GARCÍA, W. & P. ROSEANO (2014) Create pictures with tiers v.4.1. [Praat script, distributed under GNU General Public License] <http://stel.ub.edu/labfon/en/praat-scripts>
  • ESCOURIDO, A., E. FERNÁNDEZ REI, M. GONZÁLEZ & X. L. REGUEIRA (2008) “A dimensión prosódica da oralidade. Achega dende AMPER”, in E. Fernández Rei & X.L. Regueira, Perspectivas sobre a oralidade, Santiago de Compostela: Instituto da Lingua Galega / Consello da Cultura Galega, 75-93.
  • FERNÁNDEZ PLANAS, A.M., P. ROSEANO, E. MARTÍNEZ-CELDRÁN & L. ROMERA (2011) “Aproximación al análisis dialectométrico de la entonación en algunos puntos del dominio lingüístico catalán”, Estudios de Fonética Experimental, XX, 141-178.
  • FERNÁNDEZ PLANAS, A.M., J. DORTA, P. ROSEANO, Ch. DÍAZ, W. ELVIRA-GARCÍA & E. MARTÍNEZ-CELDRÁN (2015) “Distancia y proximidad prosódica entre algunas variedades del español: un estudio dialectométrico a partir de datos acústicos”, Revista de Lingüística Teórica y Aplicada, 53 (2), 13-45.
  • FROTA, S. & P. PRIETO (eds.) (2015) Intonational Variation in Romance, Oxford: Oxford University Press.
  • GARCIA-MATEO, C., J. DIEGUEZ-TIRADO, A. CARDENAL-LOPEZ, & L. DOCIO-FERNANDEZ (2004) “Transcrigal: A bilingual system for automatic indexing of broadcast news”, in Proceedings Int. Conf. on Language Resources and Evaluation, volume 6, Lisbon: ELRA, European Language Resources Association, 2061-2064. <http://www.lrecconf.org/proceedings/lrec2004/pdf/382.pdf>
  • GARCÍA-MATEO, C., A. CARDENAL, X.L. REGUEIRA FERNÁNDEZ, E. FERNÁNDEZ REI, M. MARTÍNEZ, R. SEARA, R. VARELA & N. BASANTA LLANES (2014) “CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis”, in N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk & S. Piperidis (eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik: ELRA <http://www.lrec-conf.org/proceedings/lrec2014/index.html>
  • GARRIDO, J. M. (2013) “SegProso: A Praat-Based Tool for the Automatic Detection and Annotation of Prosodic Boundaries”, Proceedings of TRASP, 74-77. <http://www.lplaix.fr/~trasp/Proceedings/19864-trasp2013.pdf>
  • GOLDMAN, J. P. (2011) “EasyAlign: an automatic phonetic alignment tool under Praat”, Proceedings of InterSpeech, Firenze, Italy, 3233-3236. <http://latlcui.unige.ch/phonetique/easyalign/easyalign_unpublished.pdf>
  • GONZÁLEZ, M. (2004) “A síntese de voz en lingua galega: O proxecto Cotovía”, Revista galega do ensino, 44, 199-215.
  • GONZÁLEZ GONZÁLEZ, M., E. RODRÍGUEZ BANGA, F. CAMPILLO DÍAZ, F. MÉNDEZ PAZÓ, L. RODRÍGUEZ LIÑARES & G. IGLESIAS IGLESIAS (2008) “Specific features of the Galician language and implications for speech technology development”, Speech Communication, 50, 874-887.
  • MAIRANO, P. (ed.) (2011) Intonations Romanes, Géolinguistique, hors-série 4.
  • MARTÍNEZ CALVO, A. & E. FERNÁNDEZ REI (2015) “Unha ferramenta informática para a análise dialectométrica da prosodia”, Estudios de Fonética Experimental, XXIV, 289-303 <http://stel.ub.edu/labfon/sites/default/files/9_MARTINEZ.pdf>
  • MARTÍNEZ CELDRÁN, E. & A. M. FERNÁNDEZ PLANAS, (coords.) (2003-2015) Atlas Multimèdia de la Prosòdia de l’Espai Romànic. <http://stel.ub.edu/labfon/amper/cast/index_ampercat.html>
  • MONIZ, H., A.I. MATA, J. HIRSCHBERG, F. BATISTA, A. ROSENBERG & I. TRANCOSO (2014) “Extending AuToBI to prominence detection in European Portuguese”, In N. Campbell, D. Gibbon & D. Hirst (eds.), Proceedings of Speech Prosody, Dublin, Trinity College, 280-284 <http://www.speechprosody2014.org/>
  • MOUTINHO, L.C., R.L. COIMBRA, A. RILLIARD & A. ROMANO (2011) “Mesure de la variation prosodique diatopique en portugais européen”, Estudios de Fonética Experimental, 20, 33-55.
  • NELSON NETO, P. SILVA, A. KLAUTAU & I. TRANCOSO (2010) “Free tools and resources for Brazilian Portuguese speech recognition”, Journal of the Brazilian Computer Society <http://link.springer.com/article/10.1007%2Fs13173-010-0023-1>
  • RABINER, L.R. (1989) “A tutorial on hidden Markov models and selected applications in speech recognition”, Proceedings of the IEEE, Vol. 77, No. 2, 257-286. <http://www.cs.ubc.ca/~murphyk/Bayes/rabiner.pdf>
  • RILLIARD, A. (2013) “Metodoloxía cuantitativa para a medida das distancias prosódicas”, Xornadas de Dialectoloxía Perceptiva, Santiago de Compostela <http://ilg.usc.es/tecandali/Descargas/AlbertRilliard.pdf>
  • RODRÍGUEZ BANGA, E, C. GARCÍA MATEO, F. J. MÉNDEZ PAZÓ, M. GONZÁLEZ & C. MAGARIÑOS IGLESIAS (2012) “Cotovía: an open source TTS for Galician and Spanish”, VII Jornadas en Tecnología del Habla and III Iberian SLTech Workshop, IberSPEECH 2012, Madrid <http://iberspeech2012.ii.uam.es/IberSPEECH2012_OnlineProceedings.pdf>
  • ROSENBERG, A. (2009) “Automatic detection and classification of prosodic events”, Columbia University, Ph.D. Thesis. <http://www1.cs.columbia.edu/~amaxwell/amaxwell-thesisfinal.pdf>
  • ROSENBERG, A. (2010) “AuToBI-A tool for automatic ToBI annotation”, in INTERSPEECH, 146-149. <http://eniac.cs.qc.cuny.edu/andrew/papers/autobi-is10.pdf>
  • SEIJO PEREIRO, L., A. MARTÍNEZ ÍNSUA, F. MÉNDEZ PAZÓ, F. CAMPILLO DÍAZ & E. RODRÍGUEZ BANGA (2004) “A Galician Textual Corpus for Morphosyntactic Tagging with Application to Textto-Speech Synthesis”, in Proceeding of LREC 2004, Lisboa, vol. 5, 1759-1762. <http://www.lrec-conf.org/proceedings/lrec2004/pdf/111.pdf>
  • TORRE TOLEDANO, D. & L. HERNÁNDEZ GÓMEZ (2002) “Hmms for automatic phonetic segmentation”, in Proc. of LREC 2002, Las Palmas de Gran Canaria [http://www.lrecconf.org/proceedings/lrec2002/].
  • YOUNG, S., G. EVERMANN, M. GALES, T. HAIN, D. KERSHAW, G. MOORE, J. ODELL, D. OLLASON, D. POVEY, V. VALTCHEV & P. WOODLAND (1995) HTK Book, University of Cambridge <http://htk.eng.cam.ac.uk/docs/docs.shtml>