Talks and Presentations

Keynotes, talks, and presentations on AI, language technology, and Tetun.

AI Perspective and Labadain Chat

Date: March 26, 2026

Event: Talkshow at TVE Timor

Venue: Dili, Timor-Leste

This Q&A explores perspectives on AI development and its impact on the Timorese community, including the development of Labadain Chat.

Labadain Chat: History and Vision

Date: March 12, 2026

Event: Talkshow at Radio Liberdade

Venue: Dili, Timor-Leste

This talkshow explores the motivation behind the development of Labadain Chat, its background, and the vision for its future.

Labadain Chat: AI for Tetun Speakers

Date: February 26, 2026

Event: Keynote Speaker at the FTH Round Table

Venue: Dili, Timor-Leste

This talk presents Labadain LIX-R361, an agentic AI system for Tetun that improves performance through LLM customization and supports use cases such as translation, news generation, and writing assistance, while highlighting opportunities and challenges for AI in Timor-Leste.

AI for Tetun: Building Timor-Leste's Inclusive Digital Future

Date: November 21, 2025

Event: Keynote Speaker at the TLNOG2 Conference

Venue: Dili, Timor-Leste

This talk explores how AI can support Tetun, a low-resource and official language of Timor-Leste, by enabling inclusive digital access through language technologies, datasets, and information retrieval systems.

Labadain: The Foundation of Tetun Language Technology

Date: November 20, 2025

Event: Invited Talk at the DEI–FECT–UNTL National Seminar

Venue: Dili, Timor-Leste

This talk introduces Labadain as the foundation of Tetun language technology, showcasing how datasets, tools, and AI systems enable inclusive digital access for Tetun speakers.

Conference Proceedings Talk on the Labadain Crawler Pipeline for LRLs

Date: May 22, 2024

Event: Conference proceedings talk at the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Venue: Torino, Italy

This talk presents Labadain Crawler, a web-based data collection pipeline designed for low-resource languages, detailing its architecture, language processing components, and its application to building a high-quality Tetun text corpus.

Conference Proceedings Talk on Labadain-30k+ Dataset Construction

Date: May 20, 2024

Event: Conference proceedings talk at the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages at LREC-COLING 2024

Venue: Torino, Italy

This talk presents Labadain-30k+, a manually audited Tetun text dataset, outlining the data collection pipeline, quality control process, and key insights from content analysis to support NLP and information retrieval research in a low-resource language.