Zhiyuan (@zhiyuancs) Twitter Tweets • TwiCopy

Mengyue Yang ✈️ ICLR 2025

a year ago

🚀 Excited to announce our World Models: Understanding, Modelling and Scaling Workshop at #ICLR2025! 🎉 Keynote speakers, panellists, and submission guidelines are live now! Check out: 👉 sites.google.com/view/worldmode… Join us as we explore World Understanding, Sequential Modelling,

thumb_up_off_alt82

chat_bubble_outline0

repeat15

shareShare

Zhiyuan

@zhiyuancs

a year ago

We are excited to announce that our workshop will be held on April 28 in Singapore. Due to numerous requests for extensions, we have decided to extend the submission deadline by 4 days to February 6 (AoE). We look forward to receiving your submissions and can't wait to see you at

thumb_up_off_alt14

chat_bubble_outline0

repeat5

shareShare

Zhiyuan

@zhiyuancs

10 months ago

🚀 Call for Reviewers! 🚀 Our Workshop on Reasoning and Planning for LLMs at ICLR 2025 ICLR 2026 has received an overwhelming number of submissions! We are looking for reviewers to help ensure a high-quality selection process. 🔹 Max 2 papers per reviewer 🔹 Review deadline:

thumb_up_off_alt17

chat_bubble_outline0

repeat7

shareShare

Zhiyuan

@zhiyuancs

10 months ago

🚀 Exciting news! The ICLR 2025 LLM Reasoning & Planning Workshop is offering several Student Registration Grants to support early-career researchers 💡 Free ICLR registration for in-person full-time students! Apply by March 2, 2025. More info: …shop-llm-reasoning-planning.github.io Submit

thumb_up_off_alt34

chat_bubble_outline0

repeat7

shareShare

Nuo Chen

@nuojohnchen

9 months ago

Welcome to use JudgeLRM! Compare any Hugging Face language models by asking your own questions, and explore JudgeLRM’s reasoning and detailed comparisons! Demo: huggingface.co/spaces/nuojohn… Paper: huggingface.co/papers/2504.00… Model: huggingface.co/nuojohnchen/Ju… Code: github.com/NuoJohnChen/Ju… We

thumb_up_off_alt1

chat_bubble_outline1

repeat2

shareShare

Zhiyuan

@zhiyuancs

8 months ago

Although the ICLR main conference is coming to an end, we are excited to invite you to the Reasoning and Planning for LLMs Workshop, which will be held all day on Monday, April 28. We are honored to host an outstanding lineup of keynote speakers and panelists from Meta, OpenAI,

thumb_up_off_alt24

chat_bubble_outline1

repeat10

shareShare

Zhiyuan

@zhiyuancs

7 months ago

🚀 Beyond 'aha': toward Meta‑Abilities Alignment! By self‑synthesizes training tasks & self‑verifies rewards with zero human labels, LLM systematically masters core reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

Zhiyuan

@zhiyuancs

7 months ago

🚀 Beyond “aha”: toward Meta‑Abilities Alignment! Zero human annotation enables LRMs masters strong reasoning abilities rather than aha emerging and generalize across math ⚙️, code 💻, science 🔬. Meta‑ability alignment lifts the ceiling of further domain‑RL—7B → 32B

thumb_up_off_alt97

chat_bubble_outline2

repeat18

shareShare

Zhiyuan

@zhiyuancs

7 months ago

I can’t believe this jaw‑dropping comic was generated by GPT just by feeding it our paper directly🤯! It perfectly illustrates how meta‑ability training makes LRMs think better.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Zhiyuan

@zhiyuancs

6 months ago

🚨🚨Reviewed around 20 papers for @ACMMM—but our own reviews were hidden & forced on us without expertise match. Time to rethink AI community peer review. 🤔 Our author team were assigned nearly 20 papers with no regard for our areas of expertise, received only a single round of

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Victor.Kai Wang

@victorkaiwang1

6 months ago

Customizing Your LLMs in seconds using prompts🥳! Excited to share our latest work with HPC-AI Lab, VITA Group, Konstantin Schürholt, Yang You, Michael Bronstein, Damian Borth : Drag-and-Drop LLMs(DnD). 2 features: tuning-free, comparable or even better than full-shot tuning.(🧵1/8)

thumb_up_off_alt97

chat_bubble_outline4

repeat71

shareShare

Li Junnan

@lijunnan0409

5 months ago

🚀Introducing GTA1 – our new GUI Agent that leads the OSWorld leaderboard with a 45.2% success rate, outperforming OpenAI's CUA! GTA1 improves two core components of GUI agents: Planning and Grounding. 🧠 Planning: A generic test-time scaling strategy that concurrently samples

thumb_up_off_alt65

chat_bubble_outline2

repeat16

shareShare

Zhiyuan

@zhiyuancs

2 months ago

How to determine which idea is most promising to scale up? Feedback from the chat with Sora 2 researcher: Even in big tech, you must prove a method's worth scaling up. Key hint? Under fixed compute and targeted perspectives (e.g., deep reasoning in LLMs or physical

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Weihao Tan

@weihaotan64

a month ago

🚀Introducing Lumine, a generalist AI agent trained within Genshin Impact that can perceive, reason, and act in real time, completing hours-long missions and following diverse instructions within complex 3D open-world environments.🎮 Website: lumine-ai.org 1/6

thumb_up_off_alt900

chat_bubble_outline31

repeat149

shareShare

Zhiyuan

@zhiyuancs

19 days ago

Looking forward to chatting more about reasoning and information-theoretic views of LLM/VLM at NeurIPS — don’t miss our presentation! #NeurIPS2025

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare