
Yiran Zhao✈️ICLR2025
@yiran_zhao924
CS Ph.D. Candidate @NUSingapore
I’m on the job market and actively looking for a Research Scientist position starting in Fall 2025!
ID: 1455570848920702978
https://zhaoyiran924.github.io/ 02-11-2021 16:21:58
20 Tweet
113 Followers
207 Following










Thanks Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) for promoting our new model! Yes, we use the model extension method to elevate Babel's performance ceiling, and the results show that it works well.

Babel🗼A multilingual LLM supporting 25 languages, released by the Alibaba DAMO team. Model: huggingface.co/collections/To… Paper: huggingface.co/papers/2503.00… ✨ 9B/83B chat & base ✨ Supports 25 languages: English, Chinese, Hindi, Spanish, Arabic, French, Bengali, Portuguese, Russian,


🚀RL algorithms are shaping the post-training of LLMs, but how do their objectives connect? In this blog, I explore their relationships and provide a unified perspective through the Policy Gradient Theorem—the backbone of policy gradient methods. Dive in: lancelqf.github.io/note/llm_post_…

