Date: 25 March
Hour: 1 p.m.
Venue: PA2 amphitheatre, Mathematics Building, Alameda campus
Speaker: João Gante (Hugging Face)
Title: “From Llama 3 to Deepseek R1 and beyond: a year of LLMs in retrospective”
Abstract:
The year 2023 was ripe in open-source LLMs, and the community managed to surpass the original ChatGPT model. The wave continued in 2024-2025, and the gap to the best closed-source models is now reduced to a few months. This talk will go over the major model architecture, training and inference changes that pushed the state-of-the-art in LLMs and VLMs over the last year.
Speaker Bio:
João Gante is a Machine Learning Engineer in the Open-Source team at Hugging Face, leading text generation in the “transformers” library. João has 7 years of experience in the AI industry, as well as a PhD in AI applied to telecommunications from Instituto Superior Técnico.
Priberam is a member of the IST Spin-Off Community®.
The Priberam Machine Learning Lunch Seminars are free of charge. Prior registration is required.