Welcome to the


The Pattern Recognition and Human Language Technology (PRHLT) research center is composed by researchers from the Universitat Politècnica de València (UPV) in the areas of Multimodal Interaction, Pattern Recognition, Image Processing (Image Analysis, Computer Vision, Handwritten Text Recognition, Document Analysis) and Language Processing (Speech Recognition and Understanding, Machine Translation, Information Retrieval).

The PRHLT center is an active research entity with important ongoing research projects, technology transfer activities, and research publications.


Big data and deep learning

“Machine Learning is the new electricity” Deep Learning is a technique that belongs to the Machine Learning Field. Machine Learning techniques learns from data. Nowadays the amount of data grows exponentially year after year. Therefore machine learning techniques obtain a great potential to solve very complex problems. Big-data is the perfect partner and deep learning techniques are becoming a standard thanks to the hardware and software advances. In PRHLT we have [...]

Read more


Speech processing and dialogue systems

Speech processing includes different applications, such like speech recognition and understanding, speech-to-speech translation, speech interaction with mobile devices, speaker and domain adaptation, and multimodal speech recognition. Dialogue systems related tasks are speech and multimodal based dialogue systems, statistical dialogue models, and automatic dialogue annotation.

Read more


Handwritten Text Recognition

Both off-line (document images) and on-line HTR (tablet or e-pen signals) are considered. No prior character or word segmentation is needed. Technology relies on character-level optical models based on Convolutional-Recurrent Neural Networks and Hidden Markov Models, , along with Finite-State Lexical and N-Gram Language Models. After model training, for each given text line image, a holistic (“Viterbi”) search provides both an optimal transcription and the corresponding word and character segmentations. [...]

Read more


Computer vision

General Statistical and Syntactic Pattern Recognition techniques for image analysis and recognition. Some applications: OCR and document analysis, medical diagnosis, biometric identification, image and video retrieval. Relevance-based Image Retrieval Biometrics

Read more


Language translation

The activities of the Machine Translation group began some years ago with the use of finite-state models for speech-to-speech translation and for text-to-text translation in limited domains. This group has developped a number of translation models with the corresponding learning algorithms and a number of prototypes for speech translation and computer-assisted translation. Currently, the Machine Translation group is devoted to the development of new interactive-predictive techniques for computer-assisted translation, techniques for [...]

Read more


Natural Language Processing

Social media data analysis: Author profiling, Stance detection, Deceptive opinion detection, Irony detection and sentiment analysis, Mixed-script text analysis, Plagiarism and social copying detection. Author profiling Given a text, what are the author’s traits? The focus is on inferring traits such as gender, age, native language, language variety, and personality on the basis of the stylistic analysis of the author’s texts. This is of interest for areas such [...]

Read more

Current Projects

Misinformation and Miscommunication in social media: FAKE news and HATE speech (MISMIS-FAKEnHATE)

Although social media are the default channel used by people to share information, ideas and opinions, they may contribute paradoxically to the polarization of society as we have recently witnessed in the last presidential elections in the USA and in the Brexit referendum. Every user ends up receiving only the information that matches her personal beliefs and viewpoints, with the risk of an intellectual isolation (filter bubble), where beliefs may [...]

Duration: 1 January 2019 to 31 December 2021
Read more

DeepHealth: Deep-Learning and HPC to Boost Biomedical Applications for Health

Health scientific discovery and innovation are expected to quickly move forward under the so-called “fourth paradigm of science”, which relies on unifying the traditionally separated and heterogeneous high-performance computing and big data analytics environments. Under this paradigm, the DeepHealth project will provide HPC computing power at the service of biomedical applications; and apply Deep Learning (DL) techniques on large and complex biomedical datasets to support new and more efficient ways of [...]

Duration: 1 January 2019 to 31 December 2021
Read more

IBEM: Indexing and search of mathematical expressions on a large scale in massive corpus of printed documents

Nowadays there exist large databases of digitized printed scientific documents, and many of them include mathematical expressions. The searching of textual information in these documents is currently a possibility widely exploited by the search engines of the most used web browsers. However, the searching in massive collections of digitized printed scientific documents with queries that are mathematical expressions is a research area scarcely explored. The methods that currently [...]

Duration: 1 November 2018 to 31 October 2020
Read more

Perfilado social de usuarios

La proliferación de las redes sociales y la ingente cantidad de información generada por las mismas (big data) proporcionan una gran oportunidad a las empresas para conocer mejor a sus clientes. Sin embargo, la cantidad de datos es habitualmente tan inabarcable que el reto principal de las compañías radica en seleccionar de todo ese corpus la información útil, aquella que mayor valor les puede aportar. El objetivo principal de este proyecto [...]

Duration: 13 March 2019 to 31 December 2020
Read more

HisClima : Dos Siglos de Datos Climáticos

El objetivo del proyecto es crear una plataforma inteligente que permita extraer información de miles de entradas de cuadernos de bitácora manuscritos que contienen (en un periodo de doscientos años y un amplio rango geográfico), datos sobre condiciones climatológicas diarias que pueden ser de gran utilidad a investigadores del cambio climático.

Duration: 30 April 2019 to 30 April 2021
Read more

Deep learning for adaptive and multimodal interaction in pattern recognition (DeepPattern)

Los sistemas de reconocimiento de formas (pattern recognition) y aprendizaje automático (machine learning) no están libres de errores en sus predicciones por lo que en muchos casos es necesario la interacción con el usuario. En el paradigma del reconocimiento de formas interactivo se aprovecha la realimentación multimodal proporcionada por el usuario en cada interacción con el sistema, tanto para mejorar las predicciones del sistema (interactividad predictiva) como para mejorar los [...]

Duration: 1 January 2019 to 31 December 2022
Read more

Document Transcription with Interactive Ubiquitous Multimodal platforms (DocTIUM)

This project aims to make a step forward in the development of user centric intelligent tools for extracting knowledge from historical data. Starting from the recognition of historical document images and including the user in the loop through engaging experiences, we will develop the concept of the big data of the past. As use case driving the research, we will construct a browser of the memory of communities inspired in the [...]

Duration: 1 January 2019 to 31 December 2021
Read more

Latest News

More news

TRANSKRIBUS, winner of the Horizon Impact Award 2020

Transkribus is one of the shortlisted projects for Horizon Impact Award 2020. Transkribus uses artificial intelligence (AI) to access and analyse historical documents and archives, [...]

Un sistema de inteligencia artificial ayuda a detectar fake news y mensajes de odio en redes

El centro PRHLT lidera un proyecto que permite etiquetar las pautas de los mensajes de odio y las noticias falsas para después localizar contenido del mismo tipo. Ver la [...]

Grados de influencia en Twitter

El centro de investigación PRHLT desarrollará una herramienta para permitir a Vodafone conocer mejor a sus clientes. La proliferación de las redes sociales y la ingente cantidad de información generada por [...]


PRHLT Research Center
Universitat Politècnica de València
Ciudad Politécnica la Innovación
Edif. 8B Acceso N Planta 0
Camí de Vera, s/n
46022 Valencia (VLC), Spain
(+34) 96 387 81 70
Contact form

Write the text below (required)