Centro de Investigación en Tecnoloxías Intelixentes (CITIUS)
Einzigartiges Zentrum
Universidad del País Vasco/Euskal Herriko Unibertsitatea
Lejona, EspañaPublikationen in Zusammenarbeit mit Forschern von Universidad del País Vasco/Euskal Herriko Unibertsitatea (18)
2024
2023
2021
-
A Methodology to Measure the Diachronic Language Distance between Three Languages Based on Perplexity
Journal of Quantitative Linguistics, Vol. 28, Núm. 4, pp. 306-336
2020
-
Artificial intelligence within the interplay between natural and artificial computation: Advances in data science, trends and applications
Neurocomputing, Vol. 410, pp. 237-270
-
Measuring language distance of isolated european languages
Information (Switzerland), Vol. 11, Núm. 4
2019
-
Contextualized translations of phrasal verbs with distributional compositional semantics and monolingual corpora
Computational Linguistics, Vol. 45, Núm. 3, pp. 395-421
2018
-
Measuring language distance among historical varieties using perplexity. Application to European Portuguese.
COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings of the 5th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2018
2017
-
A perplexity-based method for similar languages discrimination
VarDial 2017 - 4th Workshop on NLP for Similar Languages, Varieties and Dialects, Proceedings
-
From language identification to language distance
Physica A: Statistical Mechanics and its Applications, Vol. 484, pp. 152-162
2016
-
TweeTMT: A parallel microblog corpus
Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
-
TweetLID: a benchmark for tweet language identification
Language Resources and Evaluation, Vol. 50, Núm. 4, pp. 729-766
-
TweetMT: A parallel microblog corpus
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
-
TweetMT: a parallel microblog corpus
10th conference on International Language Resources and Evaluation (LREC'16) (European Language Resources Association), pp. 2936-2941
2015
-
Overview of TweetMT: A shared task on machine translation of tweets at SEPLN 2015
CEUR Workshop Proceedings
-
TweetNorm: a benchmark for lexical normalization of Spanish tweets
Language Resources and Evaluation, Vol. 49, Núm. 4, pp. 883-905
2014
-
Overview of TweetLID: Tweet language identification at SEPLN 2014
CEUR Workshop Proceedings
-
TweetNorm es corpus: An annotated corpus for Spanish microtext normalization
Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
2013
-
Introducción a la tarea compartida Tweet-Norm 2013: normalización léxica de tuits en español
XXIX Congreso de la Sociedad Española de Procesamiento de Lenguaje Natural: SEPLN 2013