Weizhu Chen (@weizhuchen) Twitter Tweets • TwiCopy

Weizhu Chen

@weizhuchen

+ Follow

Microsoft

ID: 14328706

linkhttps://www.microsoft.com/en-us/research/people/wzchen/ calendar_today08-04-2008 02:07:46

161 Tweet

2,2K Followers

212 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

We updated Phi-3 mini in our June release, with the enhancement in instruction following, reasoning in MMLU (70.9)/GPQA(30.6), and better long context. Share with us your feedback on the new models huggingface.co/microsoft/Phi-… huggingface.co/microsoft/Phi-…

thumb_up_off_alt125

chat_bubble_outline7

repeat21

shareShare

Jeff Dean

@jeffdean

8 months ago

Franklin Goodman Cheng Lu NeurIPS Conference Fei-Fei Li Jiahui Yu Ed H. Chi I didn't see the talk, but the images I've seen of the slide seem quite offensive. Such generalizations should have no place in NeurIPS or anywhere else.

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat160

shareShare

Weizhu Chen

@weizhuchen

8 months ago

+1 on this.

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Weizhu Chen

@weizhuchen

5 months ago

We released Phi-4-mini (3.8B base in LLM), a new SLM excelling in language, vision, and audio through a mixture-of-LoRA, uniting three modalities in one model. I am so impressed with its new audio capability. I hope you can play with it and share with us your feedback. We also

thumb_up_off_alt733

chat_bubble_outline48

repeat144

shareShare

Weizhu Chen

@weizhuchen

5 months ago

Check out our tech report of the phi4 mini and multimodality.

thumb_up_off_alt46

chat_bubble_outline2

repeat9

shareShare

Weizhu Chen

@weizhuchen

3 months ago

Happy to see Yiping Wang and the team did some interesting work here.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Weizhu Chen

@weizhuchen

3 months ago

Glad to see the team used a 3.8B model (Phi-4-mini-reasoning) to achieve 94.6 in Math-500 and 57.5 in AIME-24. arxiv: arxiv.org/pdf/2504.21233 hf: huggingface.co/microsoft/Phi-… Azure: aka.ms/phi4-mini-reas…

thumb_up_off_alt27

chat_bubble_outline2

repeat4

shareShare

Weizhu Chen

@weizhuchen

2 months ago

Synthesizing challenging problems that current model performs poorly is an important area in RL. Another thing interests me is the self-evolve learning via synthesizing questions/problems that the model can learn continuously. You may check our work here:mastervito.github.io/MasterVito.SwS…

thumb_up_off_alt27

chat_bubble_outline2

repeat3

shareShare

Weizhu Chen

@weizhuchen

23 days ago

You may check our work of Phi4-mini-flash-Reasoning. What I like the most is the Gated Memory Unit (GMU) design, which can be applied in future model design to achieve quality and long context, as well as the uP++. Liliang Ren ✈️ ICML 2025

thumb_up_off_alt19

chat_bubble_outline1

repeat0

shareShare

Weizhu Chen

@weizhuchen

13 days ago

See our work in the workshop today. If you are looking for opportunities to work on efficient model architecture or whatever to make the training or inference run much faster with thousands or more gpus, please come to talk to us or dm me. We are hiring.

thumb_up_off_alt25

chat_bubble_outline1

repeat1

shareShare