Publicacións en colaboración con investigadores/as de Universidad del País Vasco/Euskal Herriko Unibertsitatea (18)

2021

  1. A Methodology to Measure the Diachronic Language Distance between Three Languages Based on Perplexity

    Journal of Quantitative Linguistics, Vol. 28, Núm. 4, pp. 306-336

2018

  1. Measuring language distance among historical varieties using perplexity. Application to European Portuguese.

    COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings of the 5th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2018

2017

  1. A perplexity-based method for similar languages discrimination

    VarDial 2017 - 4th Workshop on NLP for Similar Languages, Varieties and Dialects, Proceedings

  2. From language identification to language distance

    Physica A: Statistical Mechanics and its Applications, Vol. 484, pp. 152-162

2016

  1. TweeTMT: A parallel microblog corpus

    Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016

  2. TweetLID: a benchmark for tweet language identification

    Language Resources and Evaluation, Vol. 50, Núm. 4, pp. 729-766

  3. TweetMT: A parallel microblog corpus

    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION

  4. TweetMT: a parallel microblog corpus

    10th conference on International Language Resources and Evaluation (LREC'16) (European Language Resources Association), pp. 2936-2941

2014

  1. Overview of TweetLID: Tweet language identification at SEPLN 2014

    CEUR Workshop Proceedings

  2. TweetNorm es corpus: An annotated corpus for Spanish microtext normalization

    Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

2013

  1. Introducción a la tarea compartida Tweet-Norm 2013: normalización léxica de tuits en español

    XXIX Congreso de la Sociedad Española de Procesamiento de Lenguaje Natural: SEPLN 2013