Yiyang Nan (@yiyangnan) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Saurabh Dash

@theycallmemr_

2 months ago

🚨Preprint Alert As promised, announcing the Aya Vision Technical Report – detailing the recipe to build SOTA multilingual multimodal models.

thumb_up_off_alt44

chat_bubble_outline1

repeat10

shareShare

How do we build multimodal systems that work effectively across the globe? 🌍 Today we release the Aya Vision Technical Report, the detailed recipe behind Aya Vision models, unifying state-of-the-art multilingual capabilities in multimodal and text tasks across 23 languages!

thumb_up_off_alt50

chat_bubble_outline1

repeat20

shareShare

Shan Chen

@shan23chen

2 months ago

‼️ 1/n Ask your reasoning model to think in lower resource language does degrade models’ performance at the moment. My awesome Co-author already communicated the main points in the thread, I will just communicate some random things we learned in my 🧵

thumb_up_off_alt16

chat_bubble_outline1

repeat8

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

a month ago

🧵 Multilingual safety training/eval is now standard practice, but a critical question remains: Is multilingual safety actually solved? Our new survey with Cohere Labs answers this and dives deep into: - Language gap in safety research - Future priority areas Thread 👇

thumb_up_off_alt59

chat_bubble_outline4

repeat29

shareShare

Brown CS

@browncsdept

a month ago

We're happy to announce that effective as of July 1, 2025, faculty members Stephen Bach and Srinath Sridhar have received named chairs. Steve is now the Eliot Horowitz Assistant Professor in CS and Srinath is the John E. Savage Assistant Professor in CS: cs.brown.edu/news/2025/06/0…

We're happy to announce that effective as of July 1, 2025, faculty members <a href="/stevebach/">Stephen Bach</a> and <a href="/drsrinathsridha/">Srinath Sridhar</a> have received named chairs. Steve is now the Eliot Horowitz Assistant Professor in CS and Srinath is the John E. Savage Assistant Professor in CS: cs.brown.edu/news/2025/06/0…

thumb_up_off_alt82

chat_bubble_outline0

repeat6

shareShare

Stephen Bach

@stevebach

a month ago

Excited to release Trove, our lightweight, hackable, and easy-to-use toolkit for dense retrieval, which simplifies experiments. Trove makes it easy and efficient to explore all the combinations of your documents and queries for training, and streamlines multi-node evaluations.

thumb_up_off_alt18

chat_bubble_outline1

repeat8

shareShare

Stella Li

@stellalisy

a month ago

Excited to share more about Spurious Rewards! Also keep an eye out for some new experiments and arxiv coming soon 👀🔜

thumb_up_off_alt132

chat_bubble_outline2

repeat11

shareShare

Yiyang Nan

@yiyangnan

a month ago

these are great takes! really cleared things up for me

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Cohere Labs

@cohere_labs

a month ago

How can we make language models more flexible to adapt to new languages after pretraining? 🌏 🧠 Our latest work investigates whether a tokenizer trained on more languages than the pretraining target can improve language plasticity without compromising pretraining performance.

thumb_up_off_alt79

chat_bubble_outline1

repeat20

shareShare

Diana Abagyan

@dianaabagyan

a month ago

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with Cohere Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

🚨New pretraining paper on multilingual tokenizers 🚨

Super excited to share my work with <a href="/Cohere_Labs/">Cohere Labs</a>: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

thumb_up_off_alt90

chat_bubble_outline3

repeat29

shareShare

Sara Hooker

@sarahookr

a month ago

Huge congrats to Diana Abagyan on her first first author paper. Was a pleasure collaborating on this work — we ask what cheap interventions in pre-training can allow for more language plasticity downstream.

thumb_up_off_alt55

chat_bubble_outline5

repeat10

shareShare

Sara Hooker

@sarahookr

a month ago

Thanks AK for the spotlight on our work I really believe strongly in this wider direction — of taking the pressure off everyday users to be master prompt engineers and inferring controllability directly from tasks.

thumb_up_off_alt65

chat_bubble_outline5

repeat10

shareShare

Ahmet Üstün

@ahmetustun89

a month ago

Can we train models for better inference-time control instead of over-complex prompt engineering❓ Turns out the key is in the data — adding fine-grained markers boosts performance and enables flexible control at inference🎁 Huge congrats to Daniel D'souza  for this great work

thumb_up_off_alt19

chat_bubble_outline0

repeat8

shareShare

Jack Clark

@jackclarksf

a month ago

Amir Efrati Stephanie Palazzolo Worth reading this research which showed it has already been turned into cheat-slop and that meta was one of the worst culprits for gaming it x.com/singhshiviii/s…

thumb_up_off_alt25

chat_bubble_outline1

repeat7

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

a month ago

We see so many work this week about "emergent misalignment", but how is it fundamentally different from LLM jailbreaking research? I wrote a short blog post about it: yongzx.substack.com/p/emergent-mis…

thumb_up_off_alt17

chat_bubble_outline1

repeat6

shareShare

Muhammad Khalifa

@mkhalifaaaa

7 days ago

Last week, I gave a talk at Tsinghua University on scaling test-time compute with generative process verifiers/PRMs that verify reasoning with reasoning. jietang thank you for the invite! TL;DR: A super data-efficient recipe to train generative process verifiers that can scale

thumb_up_off_alt126

chat_bubble_outline1

repeat18

shareShare

Cohere Labs

@cohere_labs

a day ago

Proud to support South Korean research. 🇰🇷 Our new grant program is designed to encourage and support the Korean research ecosystem and social impact initiatives. Let’s innovate together.

thumb_up_off_alt21

chat_bubble_outline2

repeat7

shareShare