Yiding Jiang (@yidingjiang) Twitter Tweets • TwiCopy

Yiding Jiang

@yidingjiang

+ Follow

PhD student @mldcmu @SCSatCMU. Formerly intern @MetaAI, AI resident @GoogleAI. BS from @Berkeley_EECS. Trying to understand stuff.

ID: 4515396858

linkhttp://yidingjiang.github.io calendar_today10-12-2015 07:06:19

301 Tweet

1,1K Followers

559 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Looking beyond the next token TRELAWNEY inserts future tokens <T>...</T> during training to teach models to plan ahead—boosting reasoning, coherence, and control. Highlights: - NO ARCHITECTURE CHANGES. JUST SMARTER DATA. - works with standard decoding - enables controllable

thumb_up_off_alt292

chat_bubble_outline10

repeat55

shareShare

Christina Baek

@_christinabaek

3 months ago

Are current reasoning models optimal for test-time scaling? 🌠 No! Models make the same incorrect guess over and over again. We show that you can fix this problem w/o any crazy tricks 💫 – just do weight ensembling (WiSE-FT) for big gains on math! 1/N

thumb_up_off_alt478

chat_bubble_outline6

repeat103

shareShare

Allan Zhou

@allanzhou17

3 months ago

Excited to be presenting ADO next week at #ICLR2025! Check out a new blogpost we wrote that summarizes the key ideas and results (link below):

thumb_up_off_alt33

chat_bubble_outline2

repeat10

shareShare

Sadhika Malladi

@sadhikamalladi

3 months ago

Check out our online data selection alg ADO at ICLR 2025! And take a look at this blog post by Yiding Jiang and Allan Zhou summarizing the key ideas: bland.website/notes/ado/

thumb_up_off_alt44

chat_bubble_outline0

repeat10

shareShare

Yutong (Kelly) He

@electronickale

3 months ago

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

thumb_up_off_alt83

chat_bubble_outline2

repeat31

shareShare

Yiding Jiang

@yidingjiang

2 months ago

Data selection and curriculum learning can be formally viewed as a compression protocol via prequential coding. New blog (with Allan Zhou ) about this neat idea that motivated ADO but didn’t make it into the paper. yidingjiang.github.io/blog/post/curr…

thumb_up_off_alt98

chat_bubble_outline2

repeat14

shareShare

Allan Zhou

@allanzhou17

2 months ago

How should we order training examples? In a new blogpost (w/ Yiding Jiang), we explore a compression-based perspective: order your dataset to minimize its prequential codelength.

thumb_up_off_alt28

chat_bubble_outline0

repeat2

shareShare

Minqi Jiang

@minqijiang

2 months ago

Prequential coding is such a lovely lens for thinking about curriculum learning.

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Yiding Jiang

@yidingjiang

a month ago

A mental model I find useful: all data acquisition (web scrapes, synthetic data, RL rollouts, etc.) is really an exploration problem 🔍. This perspective has some interesting implications for where AI is heading. Wrote down some thoughts: yidingjiang.github.io/blog/post/expl…

thumb_up_off_alt412

chat_bubble_outline5

repeat56

shareShare

Minqi Jiang

@minqijiang

a month ago

Recently, there has been a lot of talk of LLM agents automating ML research itself. If Llama 5 can create Llama 6, then surely the singularity is just around the corner. How can we get a pulse check on whether current LLMs are capable of driving this kind of total

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat181

shareShare

Jean de Nyandwi

@jeande_d

24 days ago

Good blog on "era of exploration" - Data scarcity is the new bottleneck. LLMs consume data far faster than humans can produce it. We're running out of high-quality training data. - Pretraining solved exploration by accident. Pretraining effectively pays a massive, upfront

thumb_up_off_alt287

chat_bubble_outline3

repeat41

shareShare

Aya Somai

@aya_somai_

21 days ago

My favorite reading of the week by Yiding Jiang: Next era is not about learning from data but deciding what data to learn from. yidingjiang.github.io/blog/post/expl…

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Alex Robey

@alexrobey23

17 days ago

On Monday, I'll be presenting a tutorial on jailbreaking LLMs + the security of AI agents with Hamed Hassani and Amin Karbasi at ICML. I'll be in Vancouver all week -- send me a DM if you'd like to chat about jailbreaking, AI agents, robots, distillation, or anything else!

thumb_up_off_alt77

chat_bubble_outline2

repeat9

shareShare

Vaishnavh Nagarajan

@_vaishnavh

15 days ago

Today Chen Wu and I will be presenting our #ICML work on creativity in the Oral 3A Reasoning session (West Exhibition Hall C) 10 - 11 am PT Or please stop by our poster right after @ East Exhibition Hall A-B #E-2505 11am-1:30pm. (Hope you enjoy some silly human drawings!)

Today <a href="/ChenHenryWu/">Chen Wu</a> and I will be presenting our #ICML work on creativity in the Oral 3A Reasoning session (West Exhibition Hall C) 10 - 11 am PT

Or please stop by our poster right after @ East Exhibition Hall A-B #E-2505 11am-1:30pm. (Hope you enjoy some silly human drawings!)

thumb_up_off_alt86

chat_bubble_outline1

repeat18

shareShare

Yiding Jiang

Gate.io

𝚐𝔪𝟾𝚡𝚡𝟾

Christina Baek

Allan Zhou

Sadhika Malladi

Yutong (Kelly) He

Yiding Jiang

Allan Zhou

Minqi Jiang

Yiding Jiang

Minqi Jiang

Jean de Nyandwi

Aya Somai

Alex Robey

Vaishnavh Nagarajan