Data: 25 de março
Hora: 13h
Local: Anfiteatro PA2, Pavilhão de Matemática, campus Alameda
Orador: João Gante (Hugging Face)
Título: “From Llama 3 to Deepseek R1 and beyond: a year of LLMs in retrospective”
Resumo:
The year 2023 was ripe in open-source LLMs, and the community managed to surpass the original ChatGPT model. The wave continued in 2024-2025, and the gap to the best closed-source models is now reduced to a few months. This talk will go over the major model architecture, training and inference changes that pushed the state-of-the-art in LLMs and VLMs over the last year.
Nota Biográfica:
João Gante is a Machine Learning Engineer in the Open-Source team at Hugging Face, leading text generation in the “transformers” library. João has 7 years of experience in the AI industry, as well as a PhD in AI applied to telecommunications from Instituto Superior Técnico.
A Priberam integra a Comunidade IST Spin-Off®.
Os “Priberam Machine Learning Lunch Seminars” são de entrada livre, mediante inscrição prévia.