Shenzhi Wang🌟 (@shenzhiwang_thu) Twitter Tweets • TwiCopy

Shenzhi Wang🌟

@shenzhiwang_thu

+ Follow

PhD Candidate @Tsinghua_Uni | Developer of 🔥Xwen-7B&72B-Chat🔥Llama3-8B&70B-Chinese-Chat & 🔥Mistral-7B-v0.3-Chinese-Chat | Research Focuses: RL+LLM+Agent

ID: 1676443035184316416

linkhttps://shenzhi-wang.netlify.app calendar_today05-07-2023 04:09:12

330 Tweet

1,1K Followers

406 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

6 months ago

(6/n) Download #Xwen #LLM now! BF16 and all kinds of GGUF models are provided. HuggingFace🤗: huggingface.co/collections/sh…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

宝玉

@dotey

6 months ago

由清华大学等高校成员组成的 Xwen Team 开源的 Xwen 模型！基于Qwen base模型训练而成，包含Xwen-72B-Chat与Xwen-7B-Chat两种大小，表现不错，有关注本地部署小模型的可以关注一下

thumb_up_off_alt21

chat_bubble_outline2

repeat6

shareShare

Xwen 🔥 a series of open models based on Qwen2.5 models, developed by a brilliant research team of PhD students from the Chinese community. huggingface.co/collections/sh… ✨ 7B/72B ✨ Apache 2.0 ✨ Xwen-72B-Chat outperformed DeepSeek V3 on Arena Hard Auto

thumb_up_off_alt21

chat_bubble_outline4

repeat3

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

5 months ago

lmarena.ai (formerly lmsys.org) OpenAI Hey lmarena.ai (formerly lmsys.org) team, we'd love to see our open-sourced model, Xwen-72B-Chat, included in the Chatbot Arena! 🥹 It supports both English and Chinese, with strong performance across multiple benchmarks. We have sufficient inference compute for API integration. We've sent several

<a href="/lmarena_ai/">lmarena.ai (formerly lmsys.org)</a> <a href="/OpenAI/">OpenAI</a> Hey <a href="/lmarena_ai/">lmarena.ai (formerly lmsys.org)</a> team, we'd love to see our open-sourced model, Xwen-72B-Chat, included in the Chatbot Arena! 🥹 It supports both English and Chinese, with strong performance across multiple benchmarks. We have sufficient inference compute for API integration.

We've sent several

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

5 months ago

lmarena.ai (formerly lmsys.org) OpenAI Xwen-72B-Chat achieves 86.1 on Arena-Hard-Auto, surpassing DeepSeek-V3 (671B) with only ~1/10th the parameters! 😆We believe Xwen-72B-Chat can perform well on Chatbot Arena, possibly ranking in the top 10.

<a href="/lmarena_ai/">lmarena.ai (formerly lmsys.org)</a> <a href="/OpenAI/">OpenAI</a> Xwen-72B-Chat achieves 86.1 on Arena-Hard-Auto, surpassing DeepSeek-V3 (671B) with only ~1/10th the parameters!
😆We believe Xwen-72B-Chat can perform well on Chatbot Arena, possibly ranking in the top 10.

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

5 months ago

🔥 Welcome to try our EasyR1!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

AK

@_akhaliq

5 months ago

Microsoft releases ART Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

thumb_up_off_alt149

chat_bubble_outline3

repeat32

shareShare

LLaMA Factory

@llamafactory_ai

5 months ago

In modern RLHF frameworks, we see many "batch size" configs. They are designed for maximizing the GPU utilization. However, these configs also confuse users who are not familiar with systems. To provide a clear view for those who want to train their reasoning models, we explain

thumb_up_off_alt99

chat_bubble_outline3

repeat22

shareShare

Qwen

@alibaba_qwen

5 months ago

Today, we release QwQ-32B, our new reasoning model with only 32 billion parameters that rivals cutting-edge reasoning model, e.g., DeepSeek-R1. Blog: qwenlm.github.io/blog/qwq-32b HF: huggingface.co/Qwen/QwQ-32B ModelScope: modelscope.cn/models/Qwen/Qw… Demo: huggingface.co/spaces/Qwen/Qw… Qwen Chat:

thumb_up_off_alt9,9K

chat_bubble_outline490

repeat1,1K

shareShare

Binyuan Hui

@huybery

5 months ago

🚀 Recently, I've been focusing on RL for LLM, and I'm excited to introduce QwQ-32B—the best open-source reasoning model under 100B scale. RL indeed holds some fascinating yet unexplored mysteries. You're all welcome to continue building more interesting things based on Qwen!

thumb_up_off_alt1,1K

chat_bubble_outline69

repeat131

shareShare

LLaMA Factory

@llamafactory_ai

5 months ago

LLaMA-Factory now supports the fine-tuning (Full/LoRA/QLoRA) of the QwQ-32B model. Time to release the power of reasoning models on your personal data 💥

thumb_up_off_alt13

chat_bubble_outline0

repeat4

shareShare

Rui Lu

@raylu_thu

4 months ago

🚨Ever wonder why diffusion models generate nonsensical text? Our latest study at #ICLR2025 uncovers "Local Generation Bias"—a hidden training bias causing textual hallucinations! 🧠 Key finding: Diffusion models independently generate symbols locally without global context.

thumb_up_off_alt191

chat_bubble_outline5

repeat46

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

3 months ago

Cooragent by our Tsinghua LeapLab LeapLab@THU : Open-source multi-agent collaboration framework! 🚀 Tell it to "build an AI intelligence secretary" → auto-scans, curates updates, delivers daily reports. MIT Licensed | Dev-friendly. Try ⬇️ github.com/LeapLabTHU/coo… #AgenticAI

thumb_up_off_alt10

chat_bubble_outline1

repeat6

shareShare

Yang Yue

@yangyue_thu

3 months ago

Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Our new paper investigate the question and has sparked active discussions. In video, freq Q&A starts at 1:28, covering common questions on pass@k, the takeaway and etc. see limit-of-RLVR.github.io

thumb_up_off_alt154

chat_bubble_outline6

repeat24

shareShare

Qwen

@alibaba_qwen

3 months ago

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general

thumb_up_off_alt7,7K

chat_bubble_outline316

repeat1,1K

shareShare

Andrew Zhao

@andrewz45732491

3 months ago

❄️Introducing Absolute Zero Reasoner: Our reasoner learns to both propose tasks that maximize learnability and improve reasoning by solving them, entirely through self-play—with no external data! It overall outperforms other "zero" models in math & coding domains. 🧵 1/

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat343

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

3 months ago

🔥 Excited to introduce our work: Absolute Zero—training reasoning LLMs with NO DATA via RLVR! 🚀 A new “Absolute Zero” paradigm: models learn to propose and solve tasks, evolving through self‑play. 🏆 AZ Reasoner: SoTA overall performance in math & coding with no human data.

thumb_up_off_alt15

chat_bubble_outline3

repeat2

shareShare

Shenzhi Wang🌟

@shenzhiwang_thu

2 months ago

🔥 Checkout our new survey on scaffolded language models that learn beyond parametric update! 🚀Applications could be in software agents like Codex to continuously learn and adapt to an user’s needs

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Shenzhi Wang🌟

Gate.io

Shenzhi Wang🌟

宝玉

Adina Yakup

Shenzhi Wang🌟

Shenzhi Wang🌟

Shenzhi Wang🌟

AK

LLaMA Factory

Qwen

Binyuan Hui

LLaMA Factory

Rui Lu

Shenzhi Wang🌟

Yang Yue

Qwen

Andrew Zhao

Shenzhi Wang🌟

Shenzhi Wang🌟