Chulin Xie (@chulinxie) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Is an LLM’s reasoning ability solely based on its powerful memorization skills? We conducted an in-depth empirical study to explore this question and uncovered some fascinating findings. Check out Chulin Xie’s threads for more details!

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Yangsibo Huang

@yangsibohuang

9 months ago

Probing results are my fav in our paper (Sec 4.2)!! 1. LLMs clearly develop reasoning skills through direct DT (i.e., w/o CoT). 2. Harder tasks demand more internal computation to solve. 3. Probing accuracy peaks in the middle layers—not the final layer.

thumb_up_off_alt71

chat_bubble_outline0

repeat7

shareShare

Victor Reis

@vetohaze

9 months ago

Our Algorithms group at Microsoft Research is hiring interns in differential privacy, reasoning abilities of LLMs, and theory: jobs.careers.microsoft.com/global/en/job/… jobs.careers.microsoft.com/global/en/job/… jobs.careers.microsoft.com/global/en/job/…

thumb_up_off_alt103

chat_bubble_outline2

repeat19

shareShare

Tian Li

@litian0331

8 months ago

I am taking new Ph.D. students from UChicagoCS and Data Science Institute in the 2024-2025 cycle! If you are interested in distributed optimization, data sharing, and trustworthy ML, please feel free to apply! More info on our research: litian96.github.io

I am taking new Ph.D. students from <a href="/UChicagoCS/">UChicagoCS</a> and <a href="/DSI_UChicago/">Data Science Institute</a> in the 2024-2025 cycle! If you are interested in distributed optimization, data sharing, and trustworthy ML, please feel free to apply! More info on our research: litian96.github.io

thumb_up_off_alt322

chat_bubble_outline2

repeat90

shareShare

Chulin Xie

@chulinxie

8 months ago

Exciting internship opportunity on privacy & foundation models with the amazing Zinan Lin at MSR! Zinan is an incredibly insightful and supportive mentor!

thumb_up_off_alt23

chat_bubble_outline1

repeat0

shareShare

Maya Varma

@mayavarma23

8 months ago

(1/4) Excited to share RaVL, which is appearing this week at #NeurIPS2024! RaVL discovers and mitigates spurious correlations in fine-tuned vision-language models (VLMs). 📄 Paper: arxiv.org/abs/2411.04097 💻 GitHub: github.com/Stanford-AIMI/…

thumb_up_off_alt18

chat_bubble_outline1

repeat5

shareShare

Dawn Song

@dawnsongtweets

8 months ago

🎉 Deeply honored that our paper "Decoding Trust: Comprehensive Assessment of Trustworthiness in GPT Models” which was awarded Outstanding Paper at NeurIPS 2023, has just been awarded Best Scientific Cybersecurity Paper of 2024, in collaboration with Bo Li Sanmi Koyejo

thumb_up_off_alt194

chat_bubble_outline23

repeat31

shareShare

Chulin Xie

@chulinxie

8 months ago

💻 Are Code Agents Safe? #NeurIPS2024 In RedCode, we evaluate the risks of code execution and generation in 19 code agents within real system environments. 🗓️ Thu 12 Dec | 4:30 PM – 7:30 PM PST 📍: West Ballroom A-D #5300 🔗: redcode-agent.github.io Stop by the RedCode poster

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Zinan Lin

@lin_zinan

7 months ago

🚀 Image AR models (𝗩𝗔𝗥 & 𝗟𝗹𝗮𝗺𝗮𝗚𝗲𝗻) can be distilled to 𝗢𝗡𝗘 step (up to 𝟮𝟭𝟴𝘅 𝗳𝗮𝘀𝘁𝗲𝗿) for the first time! See 𝑫𝒊𝒔𝒕𝒊𝒍𝒍𝒆𝒅 𝑫𝒆𝒄𝒐𝒅𝒊𝒏𝒈 ↓ 𝗪𝗲𝗯𝘀𝗶𝘁𝗲: imagination-research.github.io/distilled-deco… 𝗣𝗮𝗽𝗲𝗿: arxiv.org/abs/2412.17153 huggingface.co/papers/2412.17… (1/n)

thumb_up_off_alt34

chat_bubble_outline2

repeat8

shareShare

Xiang Yue@ICLR2025🇸🇬

@xiangyue96

6 months ago

Demystifying Long CoT Reasoning in LLMs arxiv.org/pdf/2502.03373 Reasoning models like R1 / O1 / O3 have gained massive attention, but their training dynamics remain a mystery. We're taking a first deep dive into understanding long CoT reasoning in LLMs! 11 Major

thumb_up_off_alt946

chat_bubble_outline12

repeat225

shareShare

Hejie Cui

@hennyjiecc

5 months ago

We build 𝗠𝗲𝗱𝗛𝗘𝗟𝗠✨: a comprehensive benchmark evaluating AI on realistic clinical tasks that healthcare professionals perform daily instead of just medical exams.👩‍⚕️⚕️ • Stanford HAI Blog: hai.stanford.edu/news/holistic-… • Leaderboard: crfm.stanford.edu/helm/medhelm/l…

thumb_up_off_alt33

chat_bubble_outline0

repeat5

shareShare

Virtue AI

@virtueai_co

4 months ago

We’ve raised $30M in Seed + Series A funding led by Lightspeed and Walden Catalyst Ventures, with participation from Prosperity7 Ventures, Factory, Osage University Partners (OUP), Lip-Bu Tan, Chris Re, and more. Virtue AI is the first unified platform for securing AI across

thumb_up_off_alt46

chat_bubble_outline3

repeat11

shareShare

Prateek Mittal

@prateekmittal_

3 months ago

Last week, I shared two #ICLR2025 papers that were recognized by their Award committee. Reflecting on the outcome, I thought it might be interesting to share that both papers were previously rejected by #NeurIPS2024. I found the dramatic difference in reviewer perception of

thumb_up_off_alt202

chat_bubble_outline4

repeat24

shareShare

Chulin Xie

Gate.io

Bill Yuchen Lin

Yangsibo Huang

Victor Reis

Tian Li

Chulin Xie

Maya Varma

Dawn Song

Chulin Xie

Zinan Lin

Xiang Yue@ICLR2025🇸🇬

Hejie Cui

Virtue AI

Prateek Mittal