Jinyan Su (on job market) (@sujinyan6) Twitter Tweets • TwiCopy

Jinyan Su (on job market)

@sujinyan6

+ Follow

PhD @Cornell; LLM personalization; RAG; LLM reasoning

ID: 1418105627352584193

linkhttp://jinyansu1.github.io calendar_today22-07-2021 07:08:34

13 Tweet

187 Followers

169 Following

Jinyan Su (on job market)

@sujinyan6

a year ago

Really enjoyed COLM, feels like i don’t have to social with people for the next 3 months.

thumb_up_off_alt54

chat_bubble_outline1

repeat1

shareShare

Happy Thanksgiving! Inspired by many great bloggers Sasha Rush Yao Fu, I made a tutorial about the "inference-time compute" tech showcased by O1. I incorporate insights from Sasha's great talk and ongoing O1 replications. Video: youtu.be/_Bw5o55SRL8. Feedback welcome!

thumb_up_off_alt134

chat_bubble_outline2

repeat19

shareShare

Samuel Marks

@saprmarks

a year ago

What can AI researchers do *today* that AI developers will find useful for ensuring the safety of future advanced AI systems? To ring in the new year, the Anthropic Alignment Science team is sharing some thoughts on research directions we think are important.

thumb_up_off_alt330

chat_bubble_outline10

repeat66

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

10 months ago

If you can only read one DeepSeek paper in your life, read DeepSeek Math. Everything else is either ≈obvious in hindsight or clever optimization. DeepSeek Math is a tour de force of data engineering, general DL LLM methodology, RL, and just beautiful. Just 22 pages.

thumb_up_off_alt3,3K

chat_bubble_outline22

repeat470

shareShare

Niklas Muennighoff

@muennighoff

10 months ago

Last week we released s1 - our simple recipe for sample-efficient reasoning & test-time scaling. We’re releasing 𝐬𝟏.𝟏 trained on the 𝐬𝐚𝐦𝐞 𝟏𝐊 𝐪𝐮𝐞𝐬𝐭𝐢𝐨𝐧𝐬 but performing much better by using r1 instead of Gemini traces. 60% on AIME25 I. Details in 🧵1/9

thumb_up_off_alt762

chat_bubble_outline22

repeat117

shareShare

Infini-AI-Lab

@infiniailab

9 months ago

🚀 RAG vs. Long-Context LLMs: The Real Battle ⚔️ 🤯Turns out, simple-to-build RAG can match million-dollar long-context LLMs (LC LLMs) on most existing benchmarks. 🤡So, do we even need long-context models? YES. Because today’s benchmarks are flawed: ⛳ Too Simple –

thumb_up_off_alt189

chat_bubble_outline6

repeat38

shareShare

Hao AI Lab

@haoailab

9 months ago

Reasoning models often waste tokens self-doubting. Dynasor saves you up to 81% tokens to arrive at the correct answer! 🧠✂️ - Probe the model halfway to get the certainty - Use Certainty to stop reasoning - 100% Training-Free, Plug-and-play 🎮Demo: hao-ai-lab.github.io/demo/dynasor-c…

thumb_up_off_alt383

chat_bubble_outline21

repeat82

shareShare

Xuandong Zhao

@xuandongzhao

8 months ago

🚀 Highly recommend checking out The Future of Language Models and Transformers workshops hosted by the Simons Institute for the Theory of Computing at UC Berkeley! This is an incredible opportunity to learn about cutting-edge LLM research directly from some of the most renowned experts in the field.

🚀 Highly recommend checking out The Future of Language Models and Transformers workshops hosted by the <a href="/SimonsInstitute/">Simons Institute for the Theory of Computing</a> at <a href="/UCBerkeley/">UC Berkeley</a>!

This is an incredible opportunity to learn about cutting-edge LLM research directly from some of the most renowned experts in the field.

thumb_up_off_alt77

chat_bubble_outline2

repeat15

shareShare

Lilian Weng

@lilianweng

6 months ago

Giving your models more time to think before prediction, like via smart decoding, chain-of-thoughts reasoning, latent thoughts, etc, turns out to be quite effective for unblocking the next level of intelligence. New post is here :) “Why we think”: lilianweng.github.io/posts/2025-05-…

thumb_up_off_alt3,3K

chat_bubble_outline53

repeat418

shareShare

Jinyan Su (on job market)

@sujinyan6

6 months ago

I just started my internship at Adobe today, I am in San Jose for the next few months, welcome to connect and chat!

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Jinyan Su (on job market)

Jinyan Su (on job market)

Chenghao Yang

Samuel Marks

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

Niklas Muennighoff

Infini-AI-Lab

Hao AI Lab

Xuandong Zhao

Lilian Weng

Jinyan Su (on job market)