Carlos D. Martínez
Associate Professor
Publications
2023
Comparing Speaker Adaptation Methods for Visual Speech Recognition for Continuous Spanish Journal Article
In: Applied Sciences, vol. 13.0, no. 11.0, 2023.
Consistent Nested Named Entity Recognition in Handwritten Documents via Lattice Rescoring Proceedings Article
In: In proceedings of the 17th International Conference on Document Analysis and Recognition (ICDAR 2023), pp. 255–268, 2023.
Evaluation of Different Tagging Schemes for Named Entity Recognition in Handwritten Documents Proceedings Article
In: In proceedings of the 17th International Conference on Document Analysis and Recognition (ICDAR 2023), pp. 3–16, 2023.
2022
Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children Proceedings Article
In: In proceedings of the XII Jornadas en Tecnologías del Habla and VIII Iberian SLTech Workshop (iberSPEECH 2022), 2022.
Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish Proceedings Article
In: In proceedings of the XII Jornadas en Tecnologías del Habla and VIII Iberian SLTech Workshop (iberSPEECH 2022), pp. 41–45, 2022.
Spanish Lipreading in Realistic Scenarios: the LLEER project Proceedings Article
In: In proceedings of the XII Jornadas en Tecnologías del Habla and VIII Iberian SLTech Workshop (iberSPEECH 2022), pp. 241–245, 2022.
Evaluation of Named Entity Recognition in Handwritten Documents Proceedings Article
In: In proceedings of the 15th IAPR International Workshop on Document Analysis Systems (DAS 2022), 2022.
Guidelines to Develop Trustworthy Conversational Agents for Children Proceedings Article
In: In proceedings of the 20 International Conference on the Ethical and Social Impacts of ICT (ETHICOMP 2022), pp. 359–377, 2022.
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild Proceedings Article
In: In proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022), pp. 2750–2758, 2022.
2021
Generation of Synthetic Sign Language Sentences Proceedings Article
In: In proceedings of the XI Jornadas en Tecnologías del Habla and VII Iberian SLTech Workshop (iberSPEECH 2020), pp. 235–239, 2021.
Analysis of Visual Features for Continuous Lipreading in Spanish Proceedings Article
In: In proceedings of the XI Jornadas en Tecnologías del Habla and VII Iberian SLTech Workshop (iberSPEECH 2020), pp. 220–224, 2021.
2020
Study of the influence of lexicon and language restrictions on computer assisted transcription of historical manuscripts Journal Article
In: Neurocomputing, vol. 390, pp. 12–27, 2020.
2019
Inclusión de actividades de seguimiento en la asignatura de Percepción y su impacto en los resultados académicos Proceedings Article
In: In proceedings of the Jornada de Innovación Docente JIDINF 2019, pp. 73–77, 2019.
Imagespeech combination for interactive computer assisted transcription of handwritten documents Journal Article
In: Computer Vision and Image Understanding, vol. 180, pp. 74–83, 2019.
2018
Sign Language Gesture Classification Using Neural Networks Proceedings Article
In: In proceedings of the X Jornadas en Tecnologías del Habla and VI Iberian SLTech Workshop (iberSPEECH 2018), pp. 127–131, 2018.
Exploring E2E speech recognition systems for new languages Proceedings Article
In: In proceedings of the X Jornadas en Tecnologías del Habla and VI Iberian SLTech Workshop (iberSPEECH 2018), pp. 102–106, 2018.
The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge Proceedings Article
In: In proceedings of the X Jornadas en Tecnologías del Habla and VI Iberian SLTech Workshop (iberSPEECH 2018), pp. 267–271, 2018.
Improving Transcription of Manuscripts with Multimodality and Interaction Proceedings Article
In: In proceedings of the X Jornadas en Tecnologías del Habla and VI Iberian SLTech Workshop (iberSPEECH 2018), pp. 92–96, 2018.
Advances on the Transcription of Historical Manuscripts based on Multimodality, Interactivity and Crowdsourcing Proceedings Article
In: In proceedings of the X Jornadas en Tecnologías del Habla and VI Iberian SLTech Workshop (iberSPEECH 2018), pp. 174–178, 2018.
Comparing different feedback modalities in assisted transcription of manuscripts Proceedings Article
In: In proceedings of the 13th IAPR International Workshop on Document Analysis Systems (DAS 2018), pp. 115–120, 2018.
Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks Journal Article
In: Journal of imaging, vol. 4, no. 1, 2018.
Multimodality, interactivity, and crowdsourcing for document transcription Journal Article
In: Computational Intelligence, vol. 34, no. 2, pp. 398–419, 2018.
2017
Using Speech and Handwriting in an Interactive Approach for Transcribing Historical Documents Book Chapter
In: Handwriting: Recognition, Development and Analysis, Nova Science Publishers, Inc pages = 277--295, 2017, ISBN: 978-1-53611-937-4.
Multimodal Crowdsourcing for Transcribing Handwritten Documents Journal Article
In: IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 25, no. 2, pp. 409–419, 2017.
Improving the automatic segmentation of subtitles through conditional random field Journal Article
In: Speech Communication, vol. 88, no. 0, pp. 83–95, 2017.
Interactive Layout Detection Proceedings Article
In: In proceedings of the 8th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2017), pp. 161–168, 2017.
Sign language gesture recognition using HMM Proceedings Article
In: In proceedings of the 8th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2017), pp. 419–426, 2017.
Spanish Sign Language Recognition with Different Topology Hidden Markov Models Proceedings Article
In: In proceedings of the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017), pp. 3349–3353, 2017.
Baseline Detection on Arabic Handwritten Documents Proceedings Article
In: In proceedings of the 17th ACM symposium on Document Engineering (DocEng 2017), pp. 193–196, 2017.
2016
Collaborator Effort Optimisation in Multimodal Crowdsourcing for Transcribing Historical Manuscripts Proceedings Article
In: In proceedings of the IX Jornadas en Tecnologías del Habla and V Iberian SLTech Workshop (iberSPEECH 2016), pp. 234–244, 2016.
Context, multimodality, and user collaboration in handwritten text processing: the CoMUN-HaT project. Proceedings Article
In: In proceedings of the IX Jornadas en Tecnologías del Habla and V Iberian SLTech Workshop (iberSPEECH 2016), pp. 375–383, 2016.
Read4SpeechExperiments: A Tool for Speech Acquisition from Mobile Devices. Proceedings Article
In: In proceedings of the IX Jornadas en Tecnologías del Habla and V Iberian SLTech Workshop (iberSPEECH 2016), pp. 411–417, 2016.
Comparing rule-based and statistical methods in automatic subtitle segmentation for Basque and Spanish Proceedings Article
In: In proceedings of the IX Jornadas en Tecnologías del Habla and V Iberian SLTech Workshop (iberSPEECH 2016), pp. 251–260, 2016.
Dialogue Act Annotation of a Multiparty Meeting Corpus with Discriminative Models Proceedings Article
In: In proceedings of the IX Jornadas en Tecnologías del Habla and V Iberian SLTech Workshop (iberSPEECH 2016), pp. 241–250, 2016.
Impact of automatic segmentation on the quality, productivity and self-reported post-editing effort of intralingual subtitles Proceedings Article
In: In proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 3049–3053, 2016.
Collaborator Effort Optimisation in Multimodal Crowdsourcing for Transcribing Historical Manuscripts Book Chapter
In: Advances in Speech and Language Technologies for Iberian Languages, Springer pages = 234--244, 2016, ISBN: 978-3-319-49168-4.
A first approach to Arrhythmogenic Cardiomyopathy detection through ECG and Hidden Markov Models Proceedings Article
In: In proceedings of the XXXIV Congreso Anual de la Sociedad Española de Ingeniería Biomédica (CASEIB 2016), pp. 38–41, 2016.
A Multimodal Crowdsourcing Framework for Transcribing Historical Handwritten Documents Proceedings Article
In: In proceedings of the 16th ACM symposium on Document Engineering (DocEng 2016), pp. 157–163, 2016.
An Interactive Approach with Off-line and On-line Handwritten Text Recognition Combination for Transcribing Historical Documents Proceedings Article
In: In proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS 2016), pp. 269–274, 2016.
2015
Combining Handwriting and Speech Recognition for Transcribing Historical Handwritten Documents Proceedings Article
In: In proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR 2015), pp. 126–130, 2015.
Multimodal Output Combination for Transcribing Historical Handwritten Documents Proceedings Article
In: In proceedings of the 16th International Conference on Computer Analysis of Images and Patterns (CAIP 2015), pp. 246–260, 2015.
Unsegmented Dialogue Act Annotation and Decoding with N-Gram Transducers Journal Article
In: IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 23, no. 1, pp. 198–211, 2015.
2014
Detection of sexist language by using text classification techniques Proceedings Article
In: In proceedings of the VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (iberSPEECH 2014), pp. 159–168, 2014.
A study of the quality of automatic speech recognition in distributed systems Proceedings Article
In: In proceedings of the VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (iberSPEECH 2014), pp. 119–128, 2014.
An iterative multimodal framework for the transcription of handwritten historical documents Journal Article
In: Pattern Recognition Letters, vol. 35, no. 0, pp. 195–203, 2014.
A comparative study between generative and discriminative statistical models for unsegmented dialogue annotation Proceedings Article
In: In proceedings of the VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (iberSPEECH 2014), pp. 178–187, 2014.
The Percepción Smart Campus system Proceedings Article
In: In proceedings of the VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (iberSPEECH 2014), pp. 359–366, 2014.
Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers Book Chapter
In: Human Language Technology Challenges for Computer Science and Linguistics, Springer pages = 264--275, 2014, ISBN: 978-3-319-08957-7.
Active Learning to Speed-Up the Training Process for Dialogue Act Labelling Book Chapter
In: Human Language Technology Challenges for Computer Science and Linguistics, Springer pages = 253--263, 2014, ISBN: 978-3-319-08957-7.
Speech Recognition on the Percepción Project Proceedings Article
In: In proceedings of the VIII Jornadas en Tecnologías del Habla and IV Iberian SLTech Workshop (iberSPEECH 2014), pp. 321–330, 2014.