Tim Xiao (@timzxiao) Twitter Tweets • TwiCopy

Tim Xiao

@timzxiao

+ Follow

PhD student in Machine Learning @ University of Tübingen · IMPRS-IS scholar

ID: 618657764

linkhttp://timx.me calendar_today26-06-2012 02:44:37

248 Tweet

231 Followers

316 Following

Zhen Liu

@itsthezhen

6 months ago

Dinghuai Zhang 张鼎怀 Tim Xiao Weiyang Liu Yoshua Bengio Code released: github.com/lzzcd001/nabla…. Feel free to check it out!

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Weiyang Liu

@besteuler

6 months ago

The implementation of Nabla-GFlowNet (#ICLR2025) is released. Welcome to try it out. :)

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

📢Glad to introduce FormalMATH, a large-scale Lean4 benchmark comprising 5,560 formally verified problems. 📖The benchmark spans from high-school Olympiad challenges to undergraduate-level theorems across diverse domains. The best LLM prover only achieved 16.46% accuracy. 1/4

thumb_up_off_alt19

chat_bubble_outline1

repeat5

shareShare

Katrin Renz

@katrinrenz

5 months ago

📣 Excited to share our #CVPR2025 Spotlight paper and my internship project Wayve: SimLingo. A Vision-Language-Action (VLA) model that achieves state-of-the-art driving performance with language capabilities. Code: github.com/RenzKa/simlingo Paper: arxiv.org/abs/2503.09594

thumb_up_off_alt34

chat_bubble_outline1

repeat13

shareShare

Tim Xiao

@timzxiao

4 months ago

Checkout our recent work on efficient pretraining for LLM!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Zhen Liu

@itsthezhen

3 months ago

I was surprised when I first saw that the black magic of prompt engineering can marry classical ML methods in such a natural way - simply asking an LLM to do rejection sampling makes it a more rational agent. Cannot wait to see how we may similarly design better "LLM algorithms".

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Robert Bamler

@robamler

3 months ago

Great paper by my students Tim Xiao and Johannes Zenn and collaborators that applies ideas from Monte Carlo sampling to (black-box) LLM execution to turn LLMs into better calibrated stochastic samplers.

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

Weiyang Liu

@besteuler

3 months ago

Verbalized machine learning treats LLMs with prompts as function approximators. Building on this, Tim Xiao came up with the idea of studying whether LLMs can act as samplers. It turns out they’re often biased, even when they appear to understand the target distribution.

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Weiyang Liu

@besteuler

3 months ago

Muon is gaining attention for its use of orthogonalization, making it a natural point of comparison with POET. We computed singular value entropy over training steps and find that POET always maintains high entropy. A recent study (arxiv.org/abs/2502.16982) suggests that this is a

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Zicong Fan🇨🇦

@zc_alexfan

3 months ago

Our #ICCV2025 HANDS workshop will be on Oct. 20, PM! We focus on hand-related areas, e.g., hand pose est., hand-object interaction, robotics hand manipulation. hands-workshop.org NUS ETH CS Department Uni of Birmingham Reality Labs at Meta AI at Meta 東京大学 | UTokyo Meshcapade

thumb_up_off_alt33

chat_bubble_outline1

repeat15

shareShare

Weiyang Liu

@besteuler

3 months ago

We have added some new experiments and analyses to the new version of our paper. Check it out here: arxiv.org/abs/2506.08001. We discovered that despite being generalized to spectrum-preserving training, POET can still preserve minimum hyperspherical energy. This property only

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare