Estructura argumental del nombregeneración automática

  1. María José Domínguez Vázquez
Revista:
Revista signos: estudios de lingüística

ISSN: 0035-0451 0718-0934

Ano de publicación: 2022

Volume: 55

Número: 110

Páxinas: 732-761

Tipo: Artigo

DOI: 10.4067/S0718-09342022000300732 DIALNET GOOGLE SCHOLAR lock_openAcceso aberto editor

Outras publicacións en: Revista signos: estudios de lingüística

Resumo

This study deals with the linguistic, theoretical foundations and methodological phases that support the development of two prototypes for the automatic generation of the argument structure of simple and complex nominal phrases in German, Spanish, and French. The design of these tools, Xera and Combinatoria, combines several theoretical and methodological approaches, such as the valency grammar, the semantic prototypes, the WordNet ontologies, and the processing and generation of natural language. Both generators provide a new approach not only for researching the lexical combinatory patterns, but also for a more automated development of valency dictionaries.

Referencias bibliográficas

  • Ágel, V. (2000). Valenztheorie. Tübingen: Narr.
  • Álvez, J., Atserias, J., Carrera, J., Climent, S., Laparra, E., Oliver, A. & Rigau, G. (2008). Complete and Consistent Annotation of WordNet Using the Top Concept Ontology. En N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis & D. Tapias (Eds),Proceedings of the6th Language Resources and Evaluation Conference (LREC'08) (pp. 1529-1234). Marrakech, Morocco: ELRA.
  • Apresjan, J.D., Boguslavsky, I., Iomdin, L. & Tsinman, L. (2003). Lexical Functions as a Tool of ETAP –3. MTT, 16-18.
  • Armentano-Oller, C., Corbí Bellot,A. M., Forcada, M. L., Ginestí Rosell, M., Bonev, B., Ortiz-Rojas, S., Pérez-Ortiz, J. A., Ramírez-Sánchez, G. & Sánchez-Martínez, F. (2005). An Open-Source Shallow-Transfer Machine Translation Toolbox: Consequences of its Release and Availability. En Proceedings of OSMaTran: Open-Source Machine Translation, A workshop at Machine Translation Summit, Phuket, Thailand.
  • Bentivogli, L., Forner, P., Magnini, B. & Pianta, E. (2004). Revising WordNet Domains Hierarchy: Semantics, Coverage and Balancing. En G. Sérasset, S. Armstrong, C. Boitet, A. Popescu-Belis & D. Tufis (Eds.), Proceedings of COLING 2004 Workshop on Multilingual Linguistic Resources (pp. 94–101). Geneva, Switzerland: COLING.
  • Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. (2017). Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics, 5, 135-146.
  • Delli Bovi, C. & Navigli, R. (2017). Multilingual Semantic Dictionaries for Natural Language Processing: The Case of BabelNet. En P. C-Y. Sheu (Ed.), World Scientific Encyclopedia with Semantic Computing and Robotic Intelligence. Semantic Computing(pp. 149-163). Singapur: World Scientific.
  • DICE = Alonso Ramos, M. (2004). Diccionario de colocaciones del español (DICE)[en línea]. Disponible en: http://www.dicesp.com/paginas
  • Domínguez Vázquez, M. J. (2018). Was sind Valenzwörterbücher?. Sprachwissenschaft, 43(3), 309-342.
  • Domínguez Vázquez, M. J., Engel, U. & Paredes Suárez, G. (2017). Neue Wege zur Verbalenz. Tomo I: Theoretische und methodologische Grundlagen. Tomo II: Deutsch-spanisches Valenzlexikon. Frankfurt: Peter Lang.
  • Domínguez Vázquez, M. J., Solla Portela, M. A. & Valcárcel Riveiro, C. (2019). Resources Interoperability: Exploiting Lexicographic Data to Automatically Generate Dictionary Examples. En I. Kosem, T. Zingano Kuhn, M. Correia, J. P. Ferreria, M. Jansen, I. Pereira, J. Kallas, M. Jakubíček, S. Krek & C. Tiberius (Eds.), Electronic Lexicography in the 21st Century. Proceedings of the eLex 2019 Conference(pp. 51-71).Brno: Lexical Computing CZ, s.r.o.
  • Domínguez Vázquez, M. J. & Valcárcel Riveiro, C. (2020). PORTLEX as a Multilingual and Cross-Lingual Online Dictionary. En M. J. Domínguez Vázquez, M. Mirazo Balsa & C. Valcárcel Riveiro (Eds.), Studies on Multilingual Lexicography(pp. 135-158). Berlín: De Gruyter.
  • Domínguez Vázquez, M. J., Bardanca Outeiriño, D. & Simões, A. (2021). Automatic Lexicographic Content Creation: Automating Multilingual Resources Development for Lexicographers. En I. Kosem, M. Cukr, M. Jakubíček, J. Kallas, S. Krek & C. Tiberius (Eds.),Electronic lexicography in the 21st century. Proceedings of the eLex 2021(pp. 269-287).Brno: Lexical Computing CZ, s.r.o.
  • Domínguez Vázquez, M. J., Valcárcel Riveiro, C. & Bardanca Outeiriño, D. (2021). Ontología léxica. Santiago de Compostela [en línea]. Disponible en: http://portlex.usc.gal/ontologia/
  • Dowty, D. (1991). Thematic Proto-Roles and Argument Selection. Language, 67(3), 547-619.
  • Eisenberg, P.(2006). Grundriss der deutschen Grammatik. 4. Edición. Stuttgart: Metzler.
  • Engel, U. (1996). Semantische Relatoren. Ein Entwurf für künftige Valenzwörterbücher. En N. Weber (Ed.), Semantik, Lexikographie und Computeranwendungen (pp. 223-236). Tübingen: Niemeyer.
  • Engel, U. (2004). Deutsche Grammatik –Neubearbeitung. München: Iudicium.
  • Engel, U. (2009). Syntaxder deutschen Gegenwartssprache, 4. Edición.Berlín: Erich Schmidt Verlag.
  • Eroms, H.-W. (2000). Syntaxder deutschen Sprache. Berlín: de Gruyter.
  • Fillmore, C. J. (1977). Scenes-and-Frames Semantics. En A. Zampolli(Ed.), Linguistic Structures Processing(pp. 55-81). Amsterdam, New York & Oxford: North-Holland.
  • Foley, W. A. & Valin, R.D.V. (1984). Functional Syntax and Universal Grammar. Cambridge: University Press.
  • Fuertes Olivera, P., Niño Amo, M. & Sastre Ruano, A. (2019). Tecnología con fines lexicográficos. Revista Internacional de Lenguas Extranjeras, 10, 75-100.
  • Gamallo, P. & Pichel, J. R. (2007). Un método de extracción de equivalentes de traducción a partir de un corpus comparable castellano-gallego. Procesamiento del Lenguaje Natural, 39, 241-248.
  • Geyken, A. & Wiegand, F. & Würzner, K.-M. (2017). On-the-fly Generation of Dictionary Articles for the DWDS Website. En I. Kosem et al. (Eds.): Electronic Lexicography in the 21st Century. Lexicography from scratch. Proceedings of eLex 2017 Conference (pp. 560-570). Leiden, the Netherlands: Lexical Computing.
  • Goldberg, A. (1995). Constructions: A Construction Grammar Approach to Argument Structure. Chicago: University of Chicago Press.
  • Gómez Guinovart, X. & Solla Portela, M. A. (2018). Building the Galician Wordnet: Methods and Applications. Language Resources and Evaluation, 52(1),317-339.
  • González-Agirre A. & Laparra E. & Rigau G. (2012). Multilingual Central Repository Version 3.0: Upgrading a Very Large Lexical Knowledge Base. Proceedings of the Sixth International Global WordNet Conference (GWC’12), Matsue, Japan [en línea]. Disponible en: [https://adimen.si.ehu.es/~rigau/publications/gwc12-glr.pdf]
  • Gross, G. (2008). Les classes d’objets. París: Presses de l’Ecole normale supérieure.
  • Hashimoto, T. B., Zhang, H. & Liang, P. (2019). Unifying Human and Statistical Evaluation for Natural Language Generation. En J. Burstein, C. Doran & T. Solorio (Eds.), Proceedings of the 2019 Conference of the North American Association for Computational Linguistics: Human Language Technologies, 1 (pp. 1689-1701), Minneapolis, Minnesota: Association for Computational Linguistic.
  • Héja, E. & Takács, D. (2012).Automatically Generated Online Dictionaries. Istanbul. European Language Resources Association (ELRA), En N.Calzolari et al. (Eds.), Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012) (pp. 2487-2493) Istanbul: ELRA.
  • Hölzner, M.(2007). Substantivvalenz. Korpusgestützte Untersuchungen zu Argumentrealisierungen deutscher Substantive. Tübingen: de Gruyter.
  • Izquierdo, R., Suárez, A. & Rigau, G. (2007). Exploring the Automatic Selection of Basic Level Concepts. En G. Angelova, K. Bontcheva, R. Mitkov, N. Nicolov & N. Nikolov (Eds.), Proceedings of the International Conference on Recent Advances on Natural Language Processing (RANLP'07)(pp. 298-302). Borovetz: Incoma Ltd.
  • Kabashi, B. (2018).A Lexicon of Albanian for Natural Language Processing. En J. Čibej, V. Gorjanc, I. Kosem & S. Krek (Eds.),Proceedings of the 18th EURALEX International Congress: Lexicography in Global Contexts(pp. 855-862) Ljubljana, Slovenia: Ljubljana University Press, Faculty of Arts.
  • Kilgarriff, A., Husák, M., McAdam, K., Rudnell, R. & Rychlý, P. (2008). GDEX: Automatically Finding Good Dictionary Examples in a Corpus. En E. Bernal & J. DeCesaris (Eds.), Proceedings of the XIII EURALEX International Congress(pp. 425-432)Barcelona: Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra.
  • Krek, S. (2019). Natural Language Processing and Automatic Knowledge Extraction for Lexicography. International Journal of Lexicography, 32(2), 115-118.
  • Kipper, K., Dang, H. T. & Palmer, M. (2000). Class-Based Construction of a Verb Lexicon. Proceedings ofSeventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence(pp. 691-696). California: AAAI Press.
  • Kosem, I.,Koppel, K., Zingano Kuhn, T., Michelfeit, J. & Tiberius, C.(2019). Identification and Automatic Extraction of Good Dictionary Examples: The Case(s) of GDEX. International Journal of Lexicography, 32(2), 119-137.
  • Kubczak, J. & Schumacher, H. (1998). Verbvalenz –Nominalvalenz. En D. Bresson & J. Kubczak (Eds.), Abstrakte Nomina. Vorarbeiten zu ihrer Erfassung in einem zweisprachigen syntagmatischen Wörterbuch(pp. 273-286). Tübingen: Niemeyer.
  • Lin, T., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P. & Zitnick, C. L. (2014). Microsoft COCO: Common Objects in Context. En D. Fleet, T. Pajdla, B. Schiele & T. Tuytelaars (Eds.), 13thEuropean Conference on Computer Vision (ECCV)(pp. 740-755). Zurich, Switzerland: Springer.
  • Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. En Y. Bengio & Y. LeCun (Eds.), Proceeding of the International Conference on Learning Representations Workshop Track(pp. 1301-3781). Arizona, USA.
  • Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D. & Miller, K. (1990). Wordnet: An On-Line Lexical Database. International Journal of Lexicography, 3, 235-244.
  • Moreno Jiménez, L. G., Torres-Moreno, J. M., Wedemann, R. S. & SanJuan, E. (2020). Generación automática de frases literarias. Linguamática, 12(1), 15-30.
  • Nallapati, B., Zhou, B., dos Santos, C., Gu̇lçehre, Ç. & Xiang, B. (2016). Abstractive Text Summarization Using Sequence-to-Sequence rnns and Beyond.En S. Riezler & Y. Goldberg (Eds.),Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL) (pp. 280-290). Berlín: Association for Computational Linguistics.
  • Niles, I. & Pease, A. (2001). Towards a standard upper ontology. En FOIS ’01. Ponencia presentada laInternational Conference on Formal Ontology in Information Systems. Nueva York: ACM.
  • Otter, D. W., Medina, J. R. & Kalita, J. K. (2021). A Survey of the Usages of Deep Learning for Natural Language Processing. IEEE Transactions on Neural Networks and Learning Systems, 32(2),604-624.
  • Padró, L., Collado, M., Reese, S., Lloberes, M. & Castellón, I. (2010). FreeLing 2.1: Five Years of Open-Source Language Processing Tools. En N. Calzolari et al. (Eds.), Proceedings of 7th Language Resources and Evaluation Conference (LREC 2010)(pp. 931-936).La Valletta, Malta: ELRA Japan [en línea]. Disponible en: https://www.cs.upc.edu/~nlp/papers/padro10b.pdf.
  • Polenz, P. (1988). Deutsche Satzsemantik: Grundbegriffe des Zwischen-den-Zeilen-Lesens. Berlín/Nueva York: de Gruyter.
  • REDES = Bosque, Ignacio (Dir.) (2004). REDES. Diccionariocombinatorio del español contemporáneo. Madrid: SM.
  • Reiter, E. & Dale, R. (2000). Building Natural Language Generation Systems. Cambridge: Cambridge University Press.
  • Renau, I. & Nazar, R. (2016). Automatic Extraction of Lexical Patterns from Corpora. En T. Margalitadze & G. Meladze (Eds.), Proceedings of the 17th EURALEX International Congress: Lexicography and Linguistic Diversity (pp. 823-830).Tbilisi, Georgia:Ivane Javakhishvili Tbilisi State University.
  • Renau, I., Nazar, R., Casto, A., López, B. & Obreque, J. (2019). Verbo y contexto de uso: Un análisis basado en corpus con métodos cualitativos y cuantitativos. Revista Signos. Estudios de Lingüística, 52(101), 878-901.
  • Roemmele, M. (2016). Writing Stories with Help from Recurrent Neural Networks. AAAI’16 Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (pp. 4311-4312). Phoenix, Arizona: AAAI Press.
  • Ruppenhofer, J., Ellsworth, M., Schwarzer-Petruck, M., Johnson, C.R. & Scheffczyk, J (2006). FrameNet II: Extended Theory and Practice. Berkeley, CA: Tech. rep., International Computer Science Institute.
  • Simonsen, H. K. (2020). Augmented Writing Needs Lexicography: A Symbiotic Relationship? En Z. Gavriilidou, M. Mitsiaki, & A. Fliatouras (Eds.), Proceedings of XIX EURALEX Congress: Lexicography for Inclusion, vol. I(pp. 509-514). Ljubljana: Ljubljana University Press.
  • Sordoni, A., Galley, M., Auli, M., Brockett, C., Ji, Y., Mitchell, M., Nie, J-Y., Gao, J. & Dolan, B. (2015). A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. En R. Mihalcea, J. Chai & A. Sarkar (Eds.), Human Language Technologies: The 2015 Annual Conference of the North American Chapter of the ACL(pp.196-205). Denver, Colorado: Association for Computational Linguistics.
  • Teubert, W.(1979). Valenz des Substantivs. Attributive Ergänzungen und Angaben. Düsseldorf: Schwann.
  • Trap-Jensen, L. (2018). Lexicography between NLP and Linguistics: Aspects of Theory and Practice. En J. Čibej, V. Gorjanc, I. Kosem & S. Krek (Eds.), Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts(pp. 25-37).Ljubljana: Ljubljana University Press.
  • Valcárcel, C. (2017). Las construcciones N1N2 como realizaciones actanciales del sustantivo en francés y su tratamiento en el diccionario multilingüe PORTLEX. En M. J. Domínguez Vázquez & S. Kutscher (Eds.),Interacción entre gramática, didáctica y lexicografía: Estudios contrastivos y multicontrastivos (pp.193-207). Berlín: de Gruyter.
  • Vicente, M. E., Barros, C., Agulló, F., Peregrino, F. S. & Lloret, E. (2015). La generación el lenguaje natural: Análisis del estado actual. Computación y Sistemas, 19(2), 721-756.
  • Zifonun, G., Hoffmann, L. & Strecker, B. (1997).Grammatik der deutschen Sprache vol. 3.Berlín/Nueva York: de Gruyter.