La macroestructura del diccionario (Selección del léxico y lematización)

  1. Rojo, Guillermo 1
  1. 1 Universidade de Santiago de Compostela
    info

    Universidade de Santiago de Compostela

    Santiago de Compostela, España

    ROR https://ror.org/030eybx10

Book:
Lexicografía hispánica
  1. Torner, Sergi (coord.)
  2. Battaner, Paz (coord.)
  3. Renau, Irene (coord.)

Publisher: Routledge/Taylor & Francis Group ; Taylor & Francis

ISBN: 978-1-032-30937-8 978-0-429-24435-3

Year of publication: 2024

Pages: 219-232

Type: Book chapter

Abstract

Three important changes have occurred in Spanish lexicography in the twenty-first century: the evolution from a prescriptive approach to a more descriptive one; the generalized use of computational resources (electronic lexicography); and the use of big textual corpora. These corpora provide data from which to select the lemmas, identify and organize word senses and sub-senses, extract syntactic patterns and collocations, and select real instances to illustrate meanings, etc. In this chapter, we will two especially important aspects in the field of corpus studies, which up to now have hardly been investigated in the case of Spanish: lexicon selection and the main problems related to automatic lemmatization as practiced in reference corpora.