Edoardo Ponti (@pontiedoardo) 's Twitter Profile
Edoardo Ponti

@pontiedoardo

Assistant Professor in #NLP at @EdinburghUni and visiting professor @nvidia | PhD @Cambridge_Uni | A Kleene star shines on the hour of our meeting

ID: 1035496901800587264

linkhttps://ducdauge.github.io/ calendar_today31-08-2018 11:57:54

425 Tweet

2,2K Followers

452 Following

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

Thanks for acknowledging Dynamic Token Pooling as a predecessor to H-Net, Albert Gu! We had some decent ideas in that paper (e2e and entropy-based tokenisation), but it surprises me that it took 2 years (an eternity in NLP) to find the right recipe and scale better than BPE

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

If you are at ICML Conference make sure to attend Adrian Lancucki’s invited talk on our inference-time *hyper*-scaling paper (and more!) at the tokenization workshop this Friday tokenization-workshop.github.io/schedule/

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

We blend imitation (SFT) and exploration (RLVR) in post-training with a simple idea: Sample a prefix of an SFT demonstration, let your policy model complete it, and mix it with other RLVR rollouts Intuitively, the model relies more on hints for problems currently out of reach

Simone Scardapane (@s_scardapane) 's Twitter Profile Photo

*The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs* by Piotr Nawrot Edoardo Ponti Kelly Marchisio (St. Denis) Sebastian Ruder They study sparse attention techniques at scale, comparing to small dense models at the same compute budget. arxiv.org/abs/2504.17768

*The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs*
by <a href="/p_nawrot/">Piotr Nawrot</a> <a href="/PontiEdoardo/">Edoardo Ponti</a> <a href="/cheeesio/">Kelly Marchisio (St. Denis)</a> <a href="/seb_ruder/">Sebastian Ruder</a>

They study sparse attention techniques at scale, comparing to small dense models at the same compute budget.

arxiv.org/abs/2504.17768
Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

Reach out to Yifu Qiu if you’re looking for a research scientist starting next year! He is extremely talented and he’s been doing fantastic research on world models inside general-purpose LLMs/VLMs

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

A feast of SAC Highlights awards at ACL 2025 for my students! Lead authors: - Nina Gregorio & Matteo Gay for "The Cross-linguistic Role of Animacy in Grammar Structures" - Giwon Hong for "Mixtures of In-Context Learners" Massive congrats! 2025.aclweb.org/program/awards/

Bonan Zhao / 赵博囡 (@bonanzhao) 's Twitter Profile Photo

My Lab at the University of Edinburgh🇬🇧 has funded PhD positions for this cycle! We study the computational principles of how people learn, reason, and communicate. It's a new lab, and you will be playing a big role in shaping its culture and foundations. Spread the words!

My Lab at the University of Edinburgh🇬🇧 has funded PhD positions for this cycle!

We study the computational principles of how people learn, reason, and communicate. 

It's a new lab, and you will be playing a big role in shaping its culture and foundations.

Spread the words!
Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

With SEMI🌓, you can integrate entirely new modalities (satellite images, galaxies, inertia measurements, molecules, ...) into LLMs with as few as 32 samples!

St John's College, Cambridge (@stjohnscam) 's Twitter Profile Photo

Were you or someone you know the first generation in the family to go to university? The John Crook Scholarship at St John's offers the chance to study Cambridge University for two further years. Apply by 6pm BST on 15 October 2025 via UCAS Info👇 joh.cam.ac.uk/apply/undergra…

Were you or someone you know the first generation in the family to go to university? 

The John Crook Scholarship at St John's offers the chance to study <a href="/Cambridge_Uni/">Cambridge University</a> for two further years. 

Apply by 6pm BST on 15 October 2025 via <a href="/ucas_online/">UCAS</a>

Info👇
joh.cam.ac.uk/apply/undergra…
ELLIS (@ellisforeurope) 's Twitter Profile Photo

🌍 11 ELLIS Members and Scholars from five countries have received ERC Starting Grants! Congratulations to all awardees! 👏 Last week European Research Council (ERC) awarded 478 grants totaling €761M to support early-career researchers across Europe. 🔗 Learn more: ellis.eu/news/erc-award…

Edoardo Ponti (@pontiedoardo) 's Twitter Profile Photo

Scaling laws for test-time compute should not depend on the token budget, but on actual runtime! Generation can be sped up (KV cache compression, quantisation, etc.) while mostly maintaining quality, defining better Pareto frontiers: arxiv.org/pdf/2506.05345