
Daniel D'souza
@mrdanieldsouza
Research Engineer @Cohere_Labs💙 | @UMichECE Alum 〽️ | 🇮🇳✖️🇺🇸 💫"The Universe Works in Mysterious Ways"💫
ID: 796430891652415496
https://www.danieldsouza.me 09-11-2016 19:14:56
1,1K Tweet
717 Followers
931 Following

Hidden gem: The Cohere Labs speaker series. Every week you can just drop into a call where some of the best ML/AI researchers present their latest findings. From Microsoft Research's Jianwei Yang on multimodal agents, to New York University's Eugene Vinitsky 🍒🦋 on Self-Play, to up-and-coming




🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with Cohere Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers


An excellent work by Diana Abagyan💎 We show that a "universal" tokenizer, covering more than just primary languages, greatly boosts new language adaptation without hurting pretraining performance 🚀 A very critical study for multilingual LLMs given huge cost of pretraining🔥



Can we train models for better inference-time control instead of over-complex prompt engineering❓ Turns out the key is in the data — adding fine-grained markers boosts performance and enables flexible control at inference🎁 Huge congrats to Daniel D'souza for this great work



