Gabriel de Jesus

Gabriel de Jesus

Researcher (PhD) | IRNLPTetun

I am an affiliated researcher at INESC TEC Porto (Portugal) and the creator of Labadain. My research focuses on language inclusion and digital preservation of Tetun.

About Me

I work at the intersection of language technology and cultural preservation, developing computational algorithms, methods, and resources that advance digital inclusion for Tetun-speaking communities. I have authored and contributed to scientific publications in this area.

I have over 10 years of professional experience in digital governance, information systems, and leadership. My previous roles include Non-Executive Board Member of the National Communications Authority, IT Director at the Major Project Secretariat, and HRIS Manager for the USAID HRH2030 Program in Timor-Leste.

Research Interests

Information Retrieval (IR)

Advancing information retrieval for low-resource languages, particularly Tetun, through efficient indexing, and effective retrieval methods.

Natural Language Processing (NLP)

Developing core NLP capabilities for Tetun, including language modeling, linguistic resource creation, and reproducible research practices.

Retrieval-Augmented Generation (RAG)

Designing and improving RAG systems for Tetun by combining robust retrieval pipelines with generative models and reliable evaluation frameworks.

Agentic AI

Exploring autonomous AI systems for Tetun that integrate NLP and retrieval, with a focus on adaptability, reproducibility, and real-world impact.

Projects

Labadain Platforms

A suite of platforms for the Tetun language, designed to strengthen digital inclusion for Tetun speakers.

Labadain Chat

An AI assistant designed to support access in Tetun.

https://www.labadain.com/

Labadain Search

The first search engine for the Tetun language.

https://search.labadain.com/

Education

PhD in Informatics Engineering

2021-2025

Faculty of Engineering, University of Porto (FEUP), Portugal

Thesis: Text Information Retrieval in Tetun

My doctoral research established foundational methods and resources for Tetun IR, including datasets, algorithms, tools, and baselines that enable reproducible research and the development of search technologies for the language.

View Thesis

MSc in Computer Science

2011-2013

Faculty of Science, University of Porto (FCUP), Portugal

Dissertation: Relatórios e Análise de Tendências de Rede Social Desportiva Playnify

View Dissertation

BInf in Informatics Engineering

2005-2008

Fundação das universidades Portuguesas, Universidade Nacional Timor Lorosa'e (FUP/UNTL), Timor-Leste

Grants

PhD Research

Research project "Pesquisa e recomendação computacional de conteúdo noticioso".

DOI: 10.54499/SFRH/BD/151437/2021

Fundação para a Ciência e a Tecnologia (FCT), 2021–2025

Master Scholarship

Erasmus Mundus ACP scholarship for Master's studies in Computer Science.

European Union, 2011–2013

Summer Schools

Lisbon Machine Learning School (LxMLS)

A week-long intensive program covering machine learning foundations, natural language processing, and deep learning, with a strong focus on real-world applications and hands-on practice.

Lisbon, Portugal (July 18–23, 2022)

European Summer School in Information Retrieval (ESSIR)

Advanced training in information retrieval, including core IR models, evaluation methods, and modern approaches such as neural and multilingual retrieval systems.

Lisbon, Portugal (July 24–29, 2022)

Trainings

Retrieval Augmented Generation (RAG)

Hands-on sessions on designing, building, and optimizing RAG systems, including real-world applications and evaluating tradeoffs between cost, speed, and quality.

Coursera (August 22, 2025)

Course Certificate

Machine Learning Specialization

Hands-on sessions on building and training machine learning models, including supervised learning (linear and logistic regression), neural networks with TensorFlow, decision trees and ensemble methods, as well as applying best practices, unsupervised learning (clustering and anomaly detection), recommender systems, and deep reinforcement learning.

Coursera (February 23, 2023)

Course Certificate

Academic Services

Conference Reviewer

The 15th International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR), Padua, Italy, 2025.

ICTIR, 2025