Dinghuai Zhang 张鼎怀 (@zdhnarsil) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

We're pleased to have Prof. Yoshua Bengio (Professor of Computer Science, Université de Montréal) as a distinguished speaker at the National University of Singapore (NUS) 120 Distinguished Speaker Series! Registration page: lnkd.in/gcPfwZ3T

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Yuanqi Du

@yuanqid

4 months ago

Scientific Knowledge Emerges in LLMs and YOU CAN Access It (via sampling)! 🔥🔥🔥New blog to summarize what we have learned from evaluating LLMs for several optimization, decision-making, and planning problems in science with truly impressive performances!

thumb_up_off_alt98

chat_bubble_outline1

repeat17

shareShare

Jiatao Gu

@thoma_gu

3 months ago

I will be attending #ICLR2025 in person during Apr 24-28, and presenting our research: DART: Denoising Autoregressive Transformer 📌Fri 25 Apr 3 p.m. +08 — 5:30 p.m. +08 This is my first time visiting Singapore, and I am looking forward to chatting with old and new friends!

thumb_up_off_alt80

chat_bubble_outline2

repeat8

shareShare

YCY

@yoyolicoris

3 months ago

github.com/pytorch/audio/… Torchaudio just announced it will be pure Python again. That means dropping efficient kernels like filter, RNN-Tranducer, etc. It's an unwise and disruptive decision tbh. If your work will be affected by this, please leave a comment there... PyTorch

thumb_up_off_alt91

chat_bubble_outline9

repeat11

shareShare

Carles Domingo-Enrich

@cdomingoenrich

3 months ago

🚀Excited to open source the code for Adjoint Matching --- as part of a new repo centered around reward fine-tuning via stochastic optimal control! github.com/microsoft/soc-…

thumb_up_off_alt56

chat_bubble_outline0

repeat10

shareShare

Qwen

@alibaba_qwen

3 months ago

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general

thumb_up_off_alt7,7K

chat_bubble_outline316

repeat1,1K

shareShare

Zhihong Shao

@zhs05232838

3 months ago

We just released DeepSeek-Prover V2. - Solves nearly 90% of miniF2F problems - Significantly improves the SoTA performance on the PutnamBench - Achieves a non-trivial pass rate on AIME 24 & 25 problems in their formal version Github: github.com/deepseek-ai/De…

thumb_up_off_alt2,2K

chat_bubble_outline74

repeat329

shareShare

Yinuo Ren

@yinuo_ren

3 months ago

We establish a fundamental link between the time-reversal of Markov processes and the generalized Doob’s h-transform. This connection enables the design of denoising generative models with an arbitrary generator. Check our new paper: arxiv.org/abs/2504.01938 (1/4)

thumb_up_off_alt237

chat_bubble_outline4

repeat47

shareShare

Sam Rodriques

@sgrodriques

3 months ago

Chenghao Liu will work with FutureHouse and Nobel laureate Frances Arnold Frances Arnold at Caltech to develop closed-loop generative machine learning workflows for de novo enzyme discovery. Chenghao was co-founder of Dreamfold and known for combining physical organic chemistry

Chenghao Liu will work with FutureHouse and Nobel laureate Frances Arnold <a href="/francesarnold/">Frances Arnold</a> at Caltech to develop closed-loop generative machine learning workflows for de novo enzyme discovery. Chenghao was co-founder of Dreamfold and known for combining physical organic chemistry

thumb_up_off_alt15

chat_bubble_outline1

repeat1

shareShare

机器之心 JIQIZHIXIN

@synced_global

3 months ago

Self-Evolving Curriculum for LLM Reasoning This paper, from Mila – Quebec AI Institute and ServiceNow Research, tackles a key challenge in reinforcement learning (RL)-based fine-tuning of LLMs: how to choose which problems to train on, and in what order, for best learning and

thumb_up_off_alt181

chat_bubble_outline2

repeat29

shareShare

Chongxuan Li

@lichongxuan

2 months ago

🚀 Excited to share our latest work: "Scaling Diffusion Transformers Efficiently via μP"! Diffusion Transformers are essential in visual generative models, but hyperparameter tuning for scaling remains challenging. We adapt μP, proving it also applies to diffusion Transformers!

thumb_up_off_alt12

chat_bubble_outline1

repeat4

shareShare

Zhengyang Geng

@zhengyanggeng

2 months ago

Excited to share our work with my amazing collaborators, Goodeat, Xingjian Bai, Zico Kolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

Excited to share our work with my amazing collaborators, <a href="/Goodeat258/">Goodeat</a>, <a href="/SimulatedAnneal/">Xingjian Bai</a>, <a href="/zicokolter/">Zico Kolter</a>, and Kaiming.

In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

thumb_up_off_alt111

chat_bubble_outline4

repeat28

shareShare

宝玉

@dotey

2 months ago

卧槽，Rick Rubin 这篇《The Timeless Art of Vibe Coding我看魔障了，用道德经来解释 Vibe Coding！居然还是个西方人写的！这篇文章将道与代码的类比：「道」即无名，「代码」即有形道德经开篇：道可道，非常道。名可名，非常名。无名天地之始，有名万物之母。 Rubin 改编为： “The code that

thumb_up_off_alt346

chat_bubble_outline35

repeat83

shareShare

Weijie Su

@weijie444

2 months ago

Happy to share that our paper "The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review" will appear in JASA as a Discussion Paper: arxiv.org/abs/2408.13430 It's a privilege to work with such a wonderful team: Buxin, Jiayao, Natalie Collina,

thumb_up_off_alt63

chat_bubble_outline1

repeat15

shareShare

Dinghuai Zhang 张鼎怀

@zdhnarsil

2 months ago

Oh what is this? 👀 huggingface.co/deepseek-ai/De…

thumb_up_off_alt13

chat_bubble_outline1

repeat0

shareShare

Harry Zhao

@theharryzhao

2 months ago

Our paper on rejecting hallucinated planning targets is now accepted at ICML Conference 2025! 📜: arxiv.org/abs/2410.07096 💿: github.com/mila-iqia/delu… "Rejecting Hallucinated State Targets during Planning" - Authors: Harry Zhao, Tristan, Romain Laroche, Doina Precup, Yoshua Bengio

Our paper on rejecting hallucinated planning targets is now accepted at <a href="/icmlconf/">ICML Conference</a> 2025!
📜: arxiv.org/abs/2410.07096
💿: github.com/mila-iqia/delu…

"Rejecting Hallucinated State Targets during Planning"
- Authors: <a href="/TheHarryZhao/">Harry Zhao</a>, <a href="/TiSU32/">Tristan</a>, <a href="/LarocheRomain/">Romain Laroche</a>, Doina Precup, <a href="/Yoshua_Bengio/">Yoshua Bengio</a>

thumb_up_off_alt24

chat_bubble_outline1

repeat9

shareShare

Tianyuan Zhang

@tianyuanzhang99

2 months ago

Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch

thumb_up_off_alt390

chat_bubble_outline5

repeat74

shareShare

Jiatao Gu

@thoma_gu

2 months ago

I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight poster--world consistent video diffusion (cvpr.thecvf.com/virtual/2025/p…), and three workshop invited talks which includes our recent preprint ★STARFlow★! (0/n)

thumb_up_off_alt77

chat_bubble_outline2

repeat18

shareShare

Yichen Li

@antheayli

2 months ago

How to equip robot with super human sensory capabilities? Come join us at RSS 2025 workshop, June21, on Multimodal Robotics with Multisensory capabilities to learn more. Featuring speakers: Jitendra MALIK, Katherine J. Kuchenbecker, Kristen Grauman, Yunzhu Li, Boyi Li

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Dinghuai Zhang 张鼎怀

Gate.io

Leo Dianbo Liu

Yuanqi Du

Jiatao Gu

YCY

Carles Domingo-Enrich

Qwen

Zhihong Shao

Yinuo Ren

Sam Rodriques

机器之心 JIQIZHIXIN

Chongxuan Li

Zhengyang Geng

宝玉

Weijie Su

Dinghuai Zhang 张鼎怀

Harry Zhao

Tianyuan Zhang

Jiatao Gu

Yichen Li