Andrej Jovanović (@itsmaddox_j) 's Twitter Profile
Andrej Jovanović

@itsmaddox_j

ML Research @ISTAustria. Interested in distributed and collaborative machine learning | Prev MLMI @Cambridge_Eng

ID: 1712141720408797189

linkhttps://maddox-j.github.io/ calendar_today11-10-2023 16:23:07

130 Tweet

117 Followers

197 Following

Cambridge ML Systems Lab (@camlsys) 's Twitter Profile Photo

Photon: A New SOTA for Decentralized LLM pre-training at MLSys Conference 2025. Poster today; talk on Thursday. Lorenzo Sani converting one person at a time to the merits of federated everywhere for everything. Paper: arxiv.org/abs/2411.02908

Photon: A New SOTA for Decentralized LLM pre-training at <a href="/MLSysConf/">MLSys Conference</a> 2025. Poster today; talk on Thursday. <a href="/lorenzosani97/">Lorenzo Sani</a> converting one person at a time to the merits of federated everywhere for everything. 
Paper: arxiv.org/abs/2411.02908
nic lane (@niclane7) 's Twitter Profile Photo

We're hiring at Cambridge ML Systems Lab for engineering support to scale our decentralized AI research. Find me at MLSys Conference this week, or reach out online. Help build the next version of Photon, and an opportunity to also collaborate with Flower. jobs.cam.ac.uk/job/51202/

We're hiring at <a href="/CaMLSys/">Cambridge ML Systems Lab</a> for engineering support to scale our decentralized AI research. Find me at <a href="/MLSysConf/">MLSys Conference</a> this week, or reach out online. Help build the next version of Photon, and an opportunity to also collaborate with <a href="/flwrlabs/">Flower</a>.

jobs.cam.ac.uk/job/51202/
Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Next Friday, May 23rd, our ML Theory group is excited to host Frederik Kunstner for a session on "Heavy-tailed imbalance and why Adam outperforms gradient descent on language models." Special shoutout to Anier Velasco Sotomayor, Andrej Jovanović, and Thang Chu for organizing this event 🎉

Next Friday, May 23rd, our ML Theory group is excited to host Frederik Kunstner for a session on "Heavy-tailed imbalance and why Adam outperforms gradient descent on language models."

Special shoutout to <a href="/aniervs/">Anier Velasco Sotomayor</a>, <a href="/itsmaddox_j/">Andrej Jovanović</a>, and <a href="/ThangChu77/">Thang Chu</a> for organizing this event 🎉
Andrej Jovanović (@itsmaddox_j) 's Twitter Profile Photo

We are super excited to host Frederik next Friday 🪇 Cannot wait to dig deep into why Adam is such a robust optimiser, and why it is the de facto standard for many applications. Do tune in! 📺

Andrej Jovanović (@itsmaddox_j) 's Twitter Profile Photo

Join me to hear about decentralised training, why it works and what opportunities it can unlock 🚀. Many thanks to harsha for the invitation!

Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Our ML Efficiency Group is excited to welcome Andrej Jovanović on Tuesday, June 17th for an insightful presentation of "Communication-efficient training for foundation models through federated learning"

Our ML Efficiency Group is excited to welcome <a href="/itsmaddox_j/">Andrej Jovanović</a> on Tuesday, June 17th for an insightful presentation of "Communication-efficient training for foundation models through federated learning"
Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Be sure to join our ML Efficiency Group tomorrow, June 17th as they host Andrej Jovanović for a session on "Communication-efficient training for foundation models through federated learning" Learn more: cohere.com/events/Cohere-…

Simon Yu (@simon_ycl) 's Twitter Profile Photo

Introducing RL2: Ray-Less Reinforcement Learning for LLMs 🚀 Want to run RL experiments but tired of complicated abstractions? We've got you covered with <1K lines PPO/REINFORCE implementation: 🎯 Ray Less = Launch RL exps with torchrun just like SFT ⚡ Long-context

Introducing RL2: Ray-Less Reinforcement Learning for LLMs 🚀 

Want to run RL experiments but tired of complicated abstractions? We've got you covered with &lt;1K lines PPO/REINFORCE implementation: 
🎯 Ray Less = Launch RL exps with torchrun just like SFT 
⚡ Long-context
Ambroise Odonnat (@ambroiseodonnat) 's Twitter Profile Photo

Here is the recording with the slides for those interested! 🎤 youtu.be/UONvP1TL0-g?fe… 📊drive.google.com/file/d/14ZIopS… 📑arxiv.org/pdf/2410.02724 Cohere Labs Cohere Labs

Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Join our ML Theory group next week as they welcome Tony S.F. on July 3rd for a presentation on "Training neural networks at any scale" Thanks to Andrej Jovanović Anier Velasco Sotomayor and Thang Chu for organizing this session 👏 Learn more: cohere.com/events/Cohere-…

Join our ML Theory group next week as they welcome <a href="/tonysilveti/">Tony S.F.</a> on July 3rd for a presentation on "Training neural networks at any scale"

Thanks to <a href="/itsmaddox_j/">Andrej Jovanović</a>  <a href="/aniervs/">Anier Velasco Sotomayor</a>  and <a href="/ThangChu77/">Thang Chu</a>  for organizing this session 👏

Learn more: cohere.com/events/Cohere-…
Cohere Labs (@cohere_labs) 's Twitter Profile Photo

Don't forget to join us tomorrow, July 3rd as we host Tony S.F. for a session on "Training neural networks at any scale" Learn more: cohere.com/events/Cohere-…

Eldar Kurtic (@_eldarkurtic) 's Twitter Profile Photo

The Hugging Face folks deserve far more credit for being a pillar of open-source and still managing to push out SOTA results across the board, along with a full write-up of the entire model’s lifecycle.