Daniel D'souza  (@mrdanieldsouza) Twitter Tweets • TwiCopy

Daniel D'souza 

@mrdanieldsouza

+ Follow

Research Engineer @Cohere_Labs💙 | @UMichECE Alum 〽️ | 🇮🇳✖️🇺🇸 💫"The Universe Works in Mysterious Ways"💫

ID: 796430891652415496

linkhttps://www.danieldsouza.me calendar_today09-11-2016 19:14:56

1,1K Tweet

717 Followers

931 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Hidden gem: The Cohere Labs speaker series. Every week you can just drop into a call where some of the best ML/AI researchers present their latest findings. From Microsoft Research's Jianwei Yang on multimodal agents, to New York University's Eugene Vinitsky 🍒🦋 on Self-Play, to up-and-coming

Hidden gem: The <a href="/Cohere_Labs/">Cohere Labs</a> speaker series. Every week you can just drop into a call where some of the best ML/AI researchers present their latest findings. From <a href="/MSFTResearch/">Microsoft Research</a>'s <a href="/jw2yang4ai/">Jianwei Yang</a> on multimodal agents, to <a href="/nyuniversity/">New York University</a>'s <a href="/EugeneVinitsky/">Eugene Vinitsky 🍒🦋</a> on Self-Play, to up-and-coming

thumb_up_off_alt38

chat_bubble_outline2

repeat16

shareShare

Srishti Gureja

@srishti_gureja

a month ago

Our paper M-RewardBench got accepted to ACL main: arxiv.org/abs/2410.15522 We construct the first-of-its-kind multilingual RM evaluation benchmark and leverage it to look into the performances of several Reward Models in non-English settings along w/ other interesting insights.

thumb_up_off_alt100

chat_bubble_outline5

repeat10

shareShare

Cohere Labs

@cohere_labs

a month ago

How can we make language models more flexible to adapt to new languages after pretraining? 🌏 🧠 Our latest work investigates whether a tokenizer trained on more languages than the pretraining target can improve language plasticity without compromising pretraining performance.

thumb_up_off_alt79

chat_bubble_outline1

repeat20

shareShare

Diana Abagyan

@dianaabagyan

a month ago

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with Cohere Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

🚨New pretraining paper on multilingual tokenizers 🚨

Super excited to share my work with <a href="/Cohere_Labs/">Cohere Labs</a>: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

thumb_up_off_alt90

chat_bubble_outline3

repeat29

shareShare

Ahmet Üstün

@ahmetustun89

a month ago

An excellent work by Diana Abagyan💎 We show that a "universal" tokenizer, covering more than just primary languages, greatly boosts new language adaptation without hurting pretraining performance 🚀 A very critical study for multilingual LLMs given huge cost of pretraining🔥

thumb_up_off_alt18

chat_bubble_outline0

repeat6

shareShare

Sara Hooker

@sarahookr

a month ago

Thanks AK for the spotlight on our work I really believe strongly in this wider direction — of taking the pressure off everyday users to be master prompt engineers and inferring controllability directly from tasks.

thumb_up_off_alt65

chat_bubble_outline5

repeat10

shareShare

Daniel D'souza 

@mrdanieldsouza

a month ago

Thanks for the feature AK ! 😄🤝

thumb_up_off_alt14

chat_bubble_outline0

repeat1

shareShare

Ahmet Üstün

@ahmetustun89

a month ago

Can we train models for better inference-time control instead of over-complex prompt engineering❓ Turns out the key is in the data — adding fine-grained markers boosts performance and enables flexible control at inference🎁 Huge congrats to Daniel D'souza  for this great work

thumb_up_off_alt19

chat_bubble_outline0

repeat8

shareShare

Cohere Labs

@cohere_labs

a month ago

🤹 How do we move away from complicated and brittle prompt engineering at inference for under-represented tasks?🤔 🧠 Our latest work finds that optimizing training protocols improves controllability and boosts performance on underrepresented use cases at inference time 📈

thumb_up_off_alt18

chat_bubble_outline2

repeat10

shareShare

Daniel D'souza 

@mrdanieldsouza

a month ago

🤝Arbitration is the future 🤝 “Why rely on a single teacher 🧑🏻‍🏫 when you can synthetically generate a much higher quality dataset by relying on specialized teacher models? 🧑🏻‍🏫👩‍🏫👨🏿‍🏫” Check out this fantastic summary of our recently accepted ACL 2025 work ✨

thumb_up_off_alt20

chat_bubble_outline0

repeat4

shareShare

Cohere Labs

@cohere_labs

a month ago

We’re proud to have released 9 open models — all built to support research, experimentation, and real-world impact. 🌎 These models reflect our commitment to building powerful, accessible tools that can accelerate progress across machine learning and beyond.

thumb_up_off_alt42

chat_bubble_outline1

repeat9

shareShare

Daniel D'souza 

@mrdanieldsouza

a month ago

Yooo 👀 and there’s pizza!?! 🍕

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

Daniel D'souza 

Gate.io

Moritz Laurer

Srishti Gureja

Cohere Labs

Diana Abagyan

Ahmet Üstün

Sara Hooker

Daniel D'souza 

Ahmet Üstün

Cohere Labs

Daniel D'souza 

Cohere Labs

Daniel D'souza 