
Francisco Casacuberta
Publications
2023
How Much Does Tokenization Affect Neural Machine Translation? Journal Article
In: Lecture Notes in Computer Science, vol. 13451.0, pp. 545–554, 2023.
2022
Limitations and Challenges of Unsupervised Cross-lingual Pre-training Inproceedings
In: In proceedings of the 15th Conference of the Association for Machine Translation in the Americas (AMTA 2022), pp. 175–187, 2022.
English-Russian Data Augmentation for Neural Machine Translation Inproceedings
In: In proceedings of the First Workshop on Corpus Generation and Corpus Augmentation for Machine Translation (CoCo4MT 2022), pp. 1–9, 2022.
Turning Machine Translation Metrics into Confidence Measures Inproceedings
In: In proceedings of the 27th International Conference on Application of Natural Language to Information Systems (NLDB 2022), pp. 479–489, 2022.
PRHLTs Submission to WLAC 2022 Inproceedings
In: In proceedings of the Seventh Conference on Machine Translation (WMT 2022), pp. 1182–1186, 2022.
Findings of the Word-Level AutoCompletion Shared Task in WMT 2022 Inproceedings
In: In proceedings of the Seventh Conference on Machine Translation (WMT 2022), pp. 812–820, 2022.
On the Effectiveness of Quasi Character-Level Models for Machine Translation Inproceedings
In: In proceedings of the 15th Conference of the Association for Machine Translation in the Americas (AMTA 2022), pp. 131–141, 2022.
Few-Shot Regularization to Tackle Catastrophic Forgetting in Multilingual Machine Translation Inproceedings
In: In proceedings of the 15th Conference of the Association for Machine Translation in the Americas (AMTA 2022), pp. 188–199, 2022.
Spanish Lipreading in Realistic Scenarios: the LLEER project Inproceedings
In: In proceedings of the XII Jornadas en Tecnologías del Habla and VIII Iberian SLTech Workshop (iberSPEECH 2022), pp. 241–245, 2022.
Incremental Vocabularies in Machine Translation through Aligned Embedding Projections Inproceedings
In: In proceedings of the 10th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2022), 2022.
An Interactive Machine Translation Framework for Modernizing the Language of Historical Documents Inproceedings
In: In proceedings of the 10th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2022), pp. 41–53, 2022.
On the Use of Mouse Actions at the Character Level Journal Article
In: Information, vol. 13.0, no. 6.0, 2022.
Neural Models for Measuring Confidence on Interactive Machine Translation Systems Journal Article
In: Applied Sciences, vol. 12, no. 3, 2022.
2021
Quasi Character-Level Transformers to Improve Neural Machine Translation on Small Datasets Inproceedings
In: In proceedings of the The 5th International Workshop on Advances in Natural Language Processing (ANLP 2021), 2021.
A Comparison of Character-Based Neural Machine Translations Techniques Applied to Spelling Normalization Inproceedings
In: In proceedings of the 25th International Conference on Pattern Recognition (ICPR 2020), pp. 326–338, 2021.
Confidence Measures for Interactive Predictive Neural Machine Translation Inproceedings
In: In proceedings of the XI Jornadas en Tecnologías del Habla and VII Iberian SLTech Workshop (iberSPEECH 2020), pp. 195–199, 2021.
Quasi Character-Level Transformers to Improve Neural Machine Translation on Small Datasets Inproceedings
In: In proceedings of the The 5th International Workshop on Advances in Natural Language Processing (ANLP 2021), 2021.
2020
MISMIS: Misinformation and Miscommunication in social media: aggregating information and analysing language Journal Article
In: PROCESAMIENTO DEL LENGUAJE NATURAL, no. 65, pp. 101–104, 2020.
Combining Embeddings of Input Data for Text Classification Journal Article
In: Neural Processing Letters, pp. 1–29, 2020.
Modernizing historical documents: A user Study Journal Article
In: Pattern Recognition Letters, vol. 133, pp. 151–157, 2020.
NICE: Neural Integrated Custom Engines Inproceedings
In: In proceedings of the 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020), pp. 329–338, 2020.
The CARABELA Project and Manuscript Collection: Large-Scale Probabilistic Indexing and Content-based Classification Inproceedings
In: In proceedings of the 17th International Conference on Frontiers in Handwriting Recognition (ICFHR 2020), pp. 85–90, 2020.
Minería de argumentación en el Referéndum del 1 de Octubre de 2017 Journal Article
In: Procesamiento del Lenguaje Natural, no. 65.0, pp. 59–66, 2020.
MISMIS: Misinformation and Miscommunication in social media: aggregating information and analysing language Journal Article
In: Procesamiento del Lenguaje Natural, no. 65.0, pp. 101–104, 2020.
A User Study of the Incremental Learning in NMT Inproceedings
In: In proceedings of the 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020), pp. 319–328, 2020.
2019
Filtering of Noisy Parallel Corpora Based on Hypothesis Generation Inproceedings
In: In proceedings of the Fourth Conference on Machine Translation (WMT 2019), pp. 284–290, 2019.
Enriching Character-Based Neural Machine Translation with Modern Documents for Achieving an Orthography Consistency in Historical Documents Inproceedings
In: In proceedings of the 20th Internarional Conference on Image, Analysis and Processings (ICIAP 2019), pp. 59–69, 2019.
Incremental Adaptation of NMT for Professional Post-editors: A User Study Inproceedings
In: In proceedings of the Machine Translation Summit 2019, pp. 219–227, 2019.
Multi-input CNN for Text Classification in Commercial Scenarios Inproceedings
In: In proceedings of the 15th International Work-Conference on Artificial Neural Networks (IWANN 2019), pp. 596–608, 2019.
Vector sentences representation for data selection in statistical machine translation Journal Article
In: Computer Speech & Language, vol. 56, pp. 1–16, 2019.
Demonstration of a Neural Machine Translation System with Online Learning for Translators Inproceedings
In: In proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pp. 70–74, 2019.
Interactive-predictive System for Multimodal Sequence to Sequence Tasks Inproceedings
In: In proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), pp. 81–86, 2019.
Online Learning for Effort Reduction in Interactive Neural Machine Translation Journal Article
In: Computer Speech & Language, vol. 58, pp. 98–126, 2019.
Discriminative ridge regression algorithm for adaptation in statistical machine translation Journal Article
In: Pattern Analysis and Applications, vol. 22, no. 4, pp. 1293–1305, 2019.
2018
A Machine Translation Approach for Modernizing Historical Documents Using Backtranslation Inproceedings
In: In proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018), pp. 29–47, 2018.
Active Learning for Interactive Neural Machine Translation of Data Streams Inproceedings
In: In proceedings of the 22nd Conference on Computational Natural Language Learning (CoNLL 2018), pp. 151–160, 2018.
Creating the best development corpus for Statistical Machine Translation systems Inproceedings
In: In proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT 2018), pp. 99–108, 2018.
Data selection for NMT using Infrequent n-gram Recovery Inproceedings
In: In proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT 2018), pp. 219–226, 2018.
Are Automatic Metrics Robust and Reliable in Specific Machine Translation Tasks? Inproceedings
In: In proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT 2018), pp. 89–98, 2018.
Spelling Normalization of Historical Documents by Using a Machine Translation Approach Inproceedings
In: In proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT 2018), pp. 129–137, 2018.
NMT-Keras: a Very Flexible Toolkit with a Focus on Interactive NMT and Online Learning Journal Article
In: The Prague Bulletin of Mathematical Linguistics, vol. 111, pp. 113–124, 2018.
Egocentric video description based on temporally-linked sequences Journal Article
In: Journal of Visual Communication and Image Representation, vol. 50, pp. 205–216, 2018.
2017
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering Inproceedings
In: In proceedings of the 8th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2017), pp. 372–380, 2017.
Segment-based interactive-predictive machine translation Journal Article
In: Machine Translation, vol. 31, no. 4, pp. 163–185, 2017.
The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation Journal Article
In: Computational Linguistics in the Netherlands Journal, vol. 7, no. 0, pp. 53–64, 2017.
Adapting Neural Machine Translation with Parallel Synthetic Data Inproceedings
In: In proceedings of the Second Conference on Machine Translation (WMT 2017), pp. 138–147, 2017.
Historical Documents Modernization Inproceedings
In: In proceedings of the 20th Annual Conference of the European Association for Machine Translation (EAMT 2017), pp. 295–306, 2017.
Neural Networks Classifier for Data Selection in Statistical Machine Translation Inproceedings
In: In proceedings of the 20th Annual Conference of the European Association for Machine Translation (EAMT 2017), pp. 283–294, 2017.
Log-Linear Weight Optimization Using Discriminative Ridge Regression Method in Statistical Machine Translation Journal Article
In: Lecture Notes in Computer Science, vol. 10255, no. 0, pp. 32–41, 2017.
Traducción automática neuronal Journal Article
In: Tradumàtica, no. 15, pp. 66–74, 2017.