Automatic generation of nominal phrases for Portuguese and Galician

  1. Domínguez Vázquez, María José 1
  2. Simões, Alberto 3
  3. Bardanca Outeiriño, Daniel 2
  4. Caíña Hurtado, María 1
  5. Iglesias Allones, José Luis 1
  1. 1 Universidade de Santiago de Compostela, Instituto da Lingua Galega - ILG, Santiago de Compostela, Spain
  2. 2 Universidade de Santiago de Compostela, CiTIUS, Santiago de Compostela, Spain
  3. 3 2Ai, School of Technology, IPCA, Barcelos, Portugal
Natural Language Processing

ISSN: 2977-0424

Ano de publicación: 2024

Páxinas: 1-25

Tipo: Artigo

DOI: 10.1017/NLP.2024.32 WoS: WOS:001327415700001 GOOGLE SCHOLAR lock_openAcceso aberto editor

Outras publicacións en: Natural Language Processing

Obxectivos de Desenvolvemento Sustentable


This paper presents XeraWord, an innovative tool for automatically generating nominal phrases. XeraWord can be used for different tasks, ranging from teaching languages to the creation of examples in lexicography, or even for the development of resources for natural language processing. In this area, Xera was the first experiment, allowing the automatic generation of nominal phrases in three languages: German, French and Spanish. This tool was extended to support other languages, namely, Portuguese and Galician.We start by presenting the theory behind the development of Xera and its new version, XeraWord, namely, the applied base methodology, and the natural language processing resources used to support it. Then, TraduWord, a tool specifically developed to construct resources for new languages, is presented. This tool allows the semi-automatic translation of the data required for the nominal phrase generation. For this, we discuss its advantages and disadvantages, analysing the quality of the translated resources, as well as the amount of manual work required to validate and correct these resources.

