Zhiyuan (@zhiyuancs) 's Twitter Profile
Zhiyuan

@zhiyuancs

PhD student in @NUSingapore

ID: 984846826325950464

linkhttps://zhiyuanhubj.github.io/ calendar_today13-04-2018 17:32:35

39 Tweet

315 Followers

184 Following

Mengyue Yang ✈️ ICLR 2025 (@mengyue_yang_) 's Twitter Profile Photo

🚀 Excited to announce our World Models: Understanding, Modelling and Scaling Workshop at #ICLR2025! 🎉 Keynote speakers, panellists, and submission guidelines are live now! Check out: 👉 sites.google.com/view/worldmode… Join us as we explore World Understanding, Sequential Modelling,

🚀 Excited to announce our World Models: Understanding, Modelling and Scaling Workshop at #ICLR2025! 🎉

Keynote speakers, panellists, and submission guidelines are live now! Check out:
👉 sites.google.com/view/worldmode…

Join us as we explore World Understanding, Sequential Modelling,
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

We are excited to announce that our workshop will be held on April 28 in Singapore. Due to numerous requests for extensions, we have decided to extend the submission deadline by 4 days to February 6 (AoE). We look forward to receiving your submissions and can't wait to see you at

We are excited to announce that our workshop will be held on April 28 in Singapore. Due to numerous requests for extensions, we have decided to extend the submission deadline by 4 days to February 6 (AoE). We look forward to receiving your submissions and can't wait to see you at
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

🚀 Call for Reviewers! 🚀 Our Workshop on Reasoning and Planning for LLMs at ICLR 2025 ICLR 2026 has received an overwhelming number of submissions! We are looking for reviewers to help ensure a high-quality selection process. 🔹 Max 2 papers per reviewer 🔹 Review deadline:

Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

🚀 Exciting news! The ICLR 2025 LLM Reasoning & Planning Workshop is offering several Student Registration Grants to support early-career researchers 💡 Free ICLR registration for in-person full-time students! Apply by March 2, 2025. More info: …shop-llm-reasoning-planning.github.io Submit

Nuo Chen (@nuojohnchen) 's Twitter Profile Photo

Welcome to use JudgeLRM! Compare any Hugging Face language models by asking your own questions, and explore JudgeLRM’s reasoning and detailed comparisons! Demo: huggingface.co/spaces/nuojohn… Paper: huggingface.co/papers/2504.00… Model: huggingface.co/nuojohnchen/Ju… Code: github.com/NuoJohnChen/Ju… We

Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

Although the ICLR main conference is coming to an end, we are excited to invite you to the Reasoning and Planning for LLMs Workshop, which will be held all day on Monday, April 28. We are honored to host an outstanding lineup of keynote speakers and panelists from Meta, OpenAI,

Although the ICLR main conference is coming to an end, we are excited to invite you to the Reasoning and Planning for LLMs Workshop, which will be held all day on Monday, April 28.

We are honored to host an outstanding lineup of keynote speakers and panelists from Meta, OpenAI,
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

🚀 Beyond 'aha': toward Meta‑Abilities Alignment! By self‑synthesizes training tasks & self‑verifies rewards with zero human labels, LLM systematically masters core reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability

🚀 Beyond 'aha': toward Meta‑Abilities Alignment!
By self‑synthesizes training tasks & self‑verifies rewards with zero human labels, LLM systematically masters core reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬.
Meta‑ability
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

🚀 Beyond “aha”: toward Meta‑Abilities Alignment! Zero human annotation enables LRMs masters strong reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability alignment lifts the ceiling of further domain‑RL—7B → 32B

🚀 Beyond “aha”: toward Meta‑Abilities Alignment!
Zero human annotation enables LRMs masters strong reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬.

Meta‑ability alignment lifts the ceiling of further domain‑RL—7B → 32B
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

I can’t believe this jaw‑dropping comic was generated by GPT just by feeding it our paper directly🤯! It perfectly illustrates how meta‑ability training makes LRMs think better.

I can’t believe this jaw‑dropping comic was generated by GPT just by feeding it our paper directly🤯! 

It perfectly illustrates how meta‑ability training makes LRMs think better.
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

🚨🚨Reviewed around 20 papers for @ACMMM—but our own reviews were hidden & forced on us without expertise match. Time to rethink AI community peer review. 🤔 Our author team were assigned nearly 20 papers with no regard for our areas of expertise, received only a single round of

Victor.Kai Wang (@victorkaiwang1) 's Twitter Profile Photo

Customizing Your LLMs in seconds using prompts🥳! Excited to share our latest work with HPC-AI Lab, VITA Group, Konstantin Schürholt, Yang You, Michael Bronstein, Damian Borth : Drag-and-Drop LLMs(DnD). 2 features: tuning-free, comparable or even better than full-shot tuning.(🧵1/8)

Li Junnan (@lijunnan0409) 's Twitter Profile Photo

🚀Introducing GTA1 – our new GUI Agent that leads the OSWorld leaderboard with a 45.2% success rate, outperforming OpenAI's CUA! GTA1 improves two core components of GUI agents: Planning and Grounding. 🧠 Planning: A generic test-time scaling strategy that concurrently samples

🚀Introducing GTA1 – our new GUI Agent that leads the OSWorld leaderboard with a 45.2% success rate, outperforming OpenAI's CUA!

GTA1 improves two core components of GUI agents: Planning and Grounding.

🧠 Planning: A generic test-time scaling strategy that concurrently samples
Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

How to determine which idea is most promising to scale up? Feedback from the chat with Sora 2 researcher: Even in big tech, you must prove a method's worth scaling up. Key hint? Under fixed compute and targeted perspectives (e.g., deep reasoning in LLMs or physical

Weihao Tan (@weihaotan64) 's Twitter Profile Photo

🚀Introducing Lumine, a generalist AI agent trained within Genshin Impact that can perceive, reason, and act in real time, completing hours-long missions and following diverse instructions within complex 3D open-world environments.🎮 Website: lumine-ai.org 1/6

Zhiyuan (@zhiyuancs) 's Twitter Profile Photo

Looking forward to chatting more about reasoning and information-theoretic views of LLM/VLM at NeurIPS — don’t miss our presentation! #NeurIPS2025

Looking forward to chatting more about reasoning and information-theoretic views of LLM/VLM at NeurIPS — don’t miss our presentation! #NeurIPS2025