Events

Priberam Machine Learning Lunch Seminar – João Gante

PA2 amphitheatre, Mathematics Building, Alameda campus

25 March, at 1 p.m., in PA2 amphitheatre, Mathematics Building, Alameda campus

Date: 25 March
Hour: 1 p.m.
Venue: PA2 amphitheatre, Mathematics Building, Alameda campus

Speaker: João Gante (Hugging Face)
Title: “From Llama 3 to Deepseek R1 and beyond: a year of LLMs in retrospective”

Abstract:

The year 2023 was ripe in open-source LLMs, and the community managed to surpass the original ChatGPT model. The wave continued in 2024-2025, and the gap to the best closed-source models is now reduced to a few months. This talk will go over the major model architecture, training and inference changes that pushed the state-of-the-art in LLMs and VLMs over the last year.

Speaker Bio:

João Gante is a Machine Learning Engineer in the Open-Source team at Hugging Face, leading text generation in the “transformers” library. João has 7 years of experience in the AI industry, as well as a PhD in AI applied to telecommunications from Instituto Superior Técnico.

Priberam is a member of the IST Spin-Off Community®.

The Priberam Machine Learning Lunch Seminars are free of charge. Prior registration is required.

More information and registration.