Yiran Zhao✈️ICLR2025 (@yiran_zhao924) 's Twitter Profile
Yiran Zhao✈️ICLR2025

@yiran_zhao924

CS Ph.D. Candidate @NUSingapore
I’m on the job market and actively looking for a Research Scientist position starting in Fall 2025!

ID: 1455570848920702978

linkhttps://zhaoyiran924.github.io/ calendar_today02-11-2021 16:21:58

20 Tweet

113 Followers

207 Following

Michael Qizhe Shieh (@mpulsewidth) 's Twitter Profile Photo

Greedy Coordinate Gradient is a useful method but takes a lot of time to run. We accelerated it by 5.6x using a method called probe sampling. The key idea behind probe sampling is to use a smaller draft model to filter unpromising candidates in the search. But the difficulty

Greedy Coordinate Gradient is a useful method but takes a lot of time to run. We accelerated it by 5.6x using a method called probe sampling.

The key idea behind probe sampling is to use a smaller draft model to filter unpromising candidates in the search. But the difficulty
Yiran Zhao✈️ICLR2025 (@yiran_zhao924) 's Twitter Profile Photo

Introducing our exciting new research on interpreting how large language models process multilingual input! #Multilingual #LLM #NLP

Hou Pong (Ken) Chan (@kenchanhp) 's Twitter Profile Photo

🚀 We released SeaLLM v2.5, which obtains amazing performance on many Southeast Asia language benchmarks. 👇 Check out more about its highlights

Wenxuan Zhang (@wenxuan__zhang) 's Twitter Profile Photo

🚀 Big Update Alert! 🌏 We're thrilled to announce a significant update on the SeaExam leaderboard, showcasing the best LLMs for Southeast Asian languages! Newly added: ✨ LLMs specialized for SEA languages: SeaLLMs-v3, SEA-LIONv2 🌐 Open LLMs emphasizing on multilingual

🚀 Big Update Alert! 
🌏 We're thrilled to announce a significant update on the SeaExam leaderboard, showcasing the best LLMs for Southeast Asian languages! 

Newly added:
✨ LLMs specialized for SEA languages: SeaLLMs-v3, SEA-LIONv2
🌐 Open LLMs emphasizing on multilingual
Wenxuan Zhang (@wenxuan__zhang) 's Twitter Profile Photo

🧠 Sharing some observations from our very recent studies on LLM pruning, as I feel many phenomena already evolved: * Newer smaller models (~8B) are really tough to prune: e.g., pruning Llama-3 is much much more challenging than Llama-2 (because they are much more sophisticated

wing.nus (@wing_nus) 's Twitter Profile Photo

Did you know a tiny typo can throw off an LLM’s reasoning?🤯Our research shows how even small errors can disrupt AI's step-by-step thinking! We’re exploring what this means for LLM robustness and handling real-world, messy input. #AI aclanthology.org/2024.emnlp-mai… 1/3

Did you know a tiny typo can throw off an LLM’s reasoning?🤯Our research shows how even small errors can disrupt AI's step-by-step thinking! We’re exploring what this means for LLM robustness and handling real-world, messy input. #AI aclanthology.org/2024.emnlp-mai… 
1/3
AK (@_akhaliq) 's Twitter Profile Photo

Alibaba just released Babel on Hugging Face Open Multilingual Large Language Models Serving Over 90% of Global Speakers introduce two variants: Babel-9B, designed for efficient single-GPU inference and fine-tuning, and Babel-83B, which sets a new standard for open multilingual

Alibaba just released Babel on Hugging Face

Open Multilingual Large Language Models Serving Over 90% of Global Speakers

introduce two variants: Babel-9B, designed for efficient single-GPU inference and fine-tuning, and Babel-83B, which sets a new standard for open multilingual
Adina Yakup (@adinayakup) 's Twitter Profile Photo

Babel🗼A multilingual LLM supporting 25 languages, released by the Alibaba DAMO team. Model: huggingface.co/collections/To… Paper: huggingface.co/papers/2503.00… ✨ 9B/83B chat & base ✨ Supports 25 languages: English, Chinese, Hindi, Spanish, Arabic, French, Bengali, Portuguese, Russian,

Qingfeng Lan (@qingfeng_lan) 's Twitter Profile Photo

🚀RL algorithms are shaping the post-training of LLMs, but how do their objectives connect? In this blog, I explore their relationships and provide a unified perspective through the Policy Gradient Theorem—the backbone of policy gradient methods. Dive in: lancelqf.github.io/note/llm_post_…

Yang Deng (@ydeng_dandy) 's Twitter Profile Photo

I have an opening for fully-funded 6-month visiting PhD student at SMU. Time: August 2025 - March 2026 Eligibility: PhD students from universities in Europe, North/South America, South-East Asia. Topic: NLP/LLM Email me for more details if you are interested~

Wenxuan Zhang (@wenxuan__zhang) 's Twitter Profile Photo

🙋🏻‍♂️ I'm actively recruiting for multiple fully-funded positions in my group at SUTD! Topic: NLP / LLM / Multimodal Openings: 1 x Postdoc: start date is flexible 1 x PhD: earliest batch is Spring 2026 2-3 x Visiting students: fully funded for 6-12 months. Open to Bachelor /