Events

Priberam Machine Learning Lunch Seminar – Gonçalo Gomes

PA2 amphitheatre, Mathematics Building, Alameda campus

22 April, at 1 p.m., in PA2 amphitheatre, Mathematics Building, Alameda campus

Date: 22 April
Hour: 1 p.m.
Venue: PA2 amphitheatre, Mathematics Building, Alameda campus

Speaker: Gonçalo Gomes (IT/IST/INESC-ID)
Title: “Improving Evaluation Metrics for Vision-and-Language Models”

Abstract:

Evaluating image captions is essential for ensuring both linguistic fluency and accurate semantic alignment with visual content. While reference-free metrics such as CLIPScore have advanced automated caption evaluation, most existing work on learned evaluation metrics remains limited to pointwise English-centric assessments, with significant gaps in terms of reliability, interpretability, and multilingual inclusivity of vision-and-language evaluation metrics. In this seminar session I will explore extensions of current English-centric benchmarks to a multilingual scenario promoting the development of more inclusive frameworks. Additionally, I will present two extensions from CLIPScore metric aiming to improve its interpretability and reliability in real world applications. Leveraging a model-agnostic conformal risk control framework, I will explore the calibration of CLIPScore distributions values for task-specific control variables tackling both granular assessment for individual word errors within captions, and the calibration of these raw distribution scores producing a more reliable interval for captioning evaluation by improving the correlation between uncertainty estimations and prediction errors.

Speaker Bio:

Gonçalo Gomes received a MSc degree in Data Science and Engineering, from Instituto Superior Técnico, Universidade de Lisboa. He is currently a second-year PhD student at the same institution and a junior researcher at the Human Language Technologies Lab of INESC-ID and also at SARDINE Labs of Instituto de Telecomunicações (IT). His research interests focus on developing more informative and trustworthy evaluation frameworks for vision-and-language applications, particularly envisioning a more inclusive AI frameworks for non-English environments.

Priberam is a member of the IST Spin-Off Community®.

The Priberam Machine Learning Lunch Seminars are free of charge. Prior registration is required.

More information and registration.