Zhengyao Jiang (@zhengyaojiang) Twitter Tweets • TwiCopy

Zhengyao Jiang

@zhengyaojiang

+ Follow

Cofounder @WecoAI. Building AI agents that build AI. PhD in Machine Learning, UCL @UCL_DARK @ai_ucl. (Zheng=j-uhng, j as in job; yao=y-aoww)

ID: 4074787285

linkhttp://zhengyaojiang.github.io calendar_today31-10-2015 01:52:12

328 Tweet

3,3K Followers

352 Following

Chen Sun 🤖🧠🇨🇦

@chensun92

a month ago

Our team at Google DeepMind is looking to hire a talented new Research Scientist! Our group (under Ed H. Chi) aims to push the frontier of AI-human interactions through personalization of LLMs and deeply understanding the open-ended nature of user intentions. Beneath this lies

Our team at <a href="/GoogleDeepMind/">Google DeepMind</a> is looking to hire a talented new Research Scientist!

Our group (under <a href="/edchi/">Ed H. Chi</a>) aims to push the frontier of AI-human interactions through personalization of LLMs and deeply understanding the open-ended nature of user intentions.

Beneath this lies

thumb_up_off_alt255

chat_bubble_outline2

repeat27

shareShare

Jakob Foerster

@j_foerst

a month ago

🚨job alert🚨Foerster Lab for AI Research is looking for a postdoc, deadline is 1st of September (my.corehr.com/pls/uoxrecruit…) By many reports we are on the global Pareto frontier of talent density, agency and ambition. Join us!

thumb_up_off_alt25

chat_bubble_outline0

repeat8

shareShare

Edward Grefenstette

@egrefen

a month ago

🚨 Research Scientist Hiring alert! 🚨 Applications close THIS FRIDAY for Google DeepMind research scientist roles to work on autonomous assistants and human-facing agentic capabilities, self-improvement, and open-endedness.

thumb_up_off_alt83

chat_bubble_outline1

repeat19

shareShare

Yuxiang (Jimmy) Wu

@yuxiangjwu

a month ago

Seed secured, now the real experiments begin Weco AI

thumb_up_off_alt23

chat_bubble_outline1

repeat1

shareShare

Minqi Jiang

@minqijiang

a month ago

Weco is one of the most under-the-radar teams building frontier agents. Very excited to see they now have the resources to take their vision for recursively-improving ASI to the next level. Congrats Zhengyao Jiang, Yuxiang (Jimmy) Wu, Dhruv Srikanth, and co!

thumb_up_off_alt33

chat_bubble_outline1

repeat5

shareShare

Jack Parker-Holder

@jparkerholder

a month ago

Genie 3 feels like a watershed moment for world models 🌐: we can now generate multi-minute, real-time interactive simulations of any imaginable world. This could be the key missing piece for embodied AGI… and it can also create beautiful beaches with my dog, playable real time

thumb_up_off_alt4,4K

chat_bubble_outline217

repeat456

shareShare

Tim Rocktäschel

@_rockt

a month ago

Harder, Better, Faster, Stronger, Real-time! We are excited to reveal Genie 3, our most capable real-time foundational world model. Fantastic cross-team effort led by Jack Parker-Holder and Shlomi Fruchter. Below some interactive worlds and capabilities that were highlights for me

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat151

shareShare

Zhengyao Jiang

@zhengyaojiang

a month ago

Honestly pretty blown away. You control a realistic avatar with your keyboard in real-time, all powered by a single neural network. The mind-blowing bit: the real world's computation happens at the atomic level, unimaginably expensive to simulate fully (just imagine storing

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Zhengyao Jiang

@zhengyaojiang

25 days ago

Quick takes on GPT-5 on MLE-Bench: - It's not based on the full set or the lite set but rather the "30 most interesting" competitions. - Not very good scientific practice. - Medal rates for all models are quite low (below 10%). It seems they didn't apply any agentic

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Tom Johnson

@tomjohndesign

18 days ago

This is how I feel about vibe coding. Any project I try that has any kind of complication has this immediate burst of progress. Things are amazing and it feels like a superpower. Then... as I add more complexity, things crash to a halt. The only projects that I think I can

thumb_up_off_alt2,2K

chat_bubble_outline247

repeat291

shareShare

Zhengyao Jiang

@zhengyaojiang

18 days ago

Not surprised that CoT fails when problem depth grows. RL post-training will lengthen chains, but it won’t make infinite-horizon reasoners. Humans get there by caching conclusions in persistent memory (notes but primarily synapses). Memory is the frontier.

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

elvis

@omarsar0

16 days ago

M3-Agent: A Multimodal Agent with Long-Term Memory Impressive application of multimodal agents. Lots of great insights throughout the paper. Here are my notes with key insights:

thumb_up_off_alt905

chat_bubble_outline18

repeat165

shareShare

Zhengyao Jiang

@zhengyaojiang

15 days ago

People used polymarket to get a prior on which frontier lab would ship the best model next. Turns out we can just ask an LLM now. For many unresolved events, querying an AI already gives you a forecast that’s as informative as prediction markets.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Zhengyao Jiang

@zhengyaojiang

10 days ago

A simple inference time heuristic pushes GPT-OSS-120B to 99.9% on AIME’25 (GPT-5-pro level). My interpretation: pretraining + SFT + KL divergence heavily anchor the policy to “autocomplete style” reasoning, the model struggles to drop unproductive rollouts once it started. So

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare