Yuxiang Wei (@yuxiangwei9) 's Twitter Profile
Yuxiang Wei

@yuxiangwei9

PhD candidate @IllinoisCDS | Researcher @AIatMeta (Meta FAIR). Code LLM training.

ID: 1449574367076110338

linkhttps://yuxiang.cs.illinois.edu calendar_today17-10-2021 03:14:18

75 Tweet

620 Followers

253 Following

Lingming Zhang (@lingmingzhang) 's Twitter Profile Photo

“On SWE-bench Verified, GPT-4o resolves 33.2% of samples, with the best performing open-source scaffold, Agentless.” Wow, our Agentless😺 is used by OpenAI as the default technique to evaluate the performance of GPT-4o on SWE-bench! 🧑‍💻 github.com/OpenAutoCoder/…

Yuxiang Wei (@yuxiangwei9) 's Twitter Profile Photo

Thanks Tanishq Abraham is at ICML for sharing our work! It's been an amazing summer interning at SnowflakeDB AI Research team. I started with no pretraining experience but learned so much. Special thanks to the Arctic team for their invaluable support!! Hojae Han Rajhans Samdani

LLM4Code (@llm4code) 's Twitter Profile Photo

📢Announcing the 2nd workshop on #LLM4Code, co-located with @ICSEConf 2025 in Ottawa, Canada 🇨🇦! 🎯We are calling for submissions: 🌟Website: llm4code.github.io 🌟Deadline: Nov 18, 2024 🌟8-page research paper / 4-page position paper (including references) 🚀Calling for

📢Announcing the 2nd workshop on #LLM4Code, co-located with @ICSEConf 2025 in Ottawa, Canada 🇨🇦!

🎯We are calling for submissions:

🌟Website: llm4code.github.io 
🌟Deadline: Nov 18, 2024
🌟8-page research paper / 4-page position paper (including references)

🚀Calling for
Chenyuan (@cy1yang) 's Twitter Profile Photo

How to leverage the white-box info (i.e. source code) for fuzzing compilers? Check out our work “WhiteFox 🦊: White-Box Compiler Fuzzing Empowered by Large Language Models” at OOPSLA 2024! w/ Yinlin Deng, Runyu Lu, JIayi Yao, Jiawei Liu, Reyhan, and Lingming Zhang (1/N)

How to leverage the white-box info (i.e. source code) for fuzzing compilers?

Check out our work “WhiteFox 🦊: White-Box Compiler Fuzzing Empowered by Large Language Models” at OOPSLA 2024!

w/ <a href="/yinlin_deng/">Yinlin Deng</a>, <a href="/lry89757/">Runyu Lu</a>, JIayi Yao, <a href="/JiaweiLiu_/">Jiawei Liu</a>, <a href="/Reyhaneh/">Reyhan</a>, and <a href="/LingmingZhang/">Lingming Zhang</a> (1/N)
Jiawei Liu (@jiaweiliu_) 's Twitter Profile Photo

Human preference has been commonly used for RLHF and eval — but: 🤔How reliable is that for reasoning-heavy areas such as codegen? 🤔Can models effectively predict code preference? We present techniques, evals, & studies to demystify code preference: llm-code-preference.github.io 🧵

Human preference has been commonly used for RLHF and eval — but:

🤔How reliable is that for reasoning-heavy areas such as codegen?
🤔Can models effectively predict code preference?

We present techniques, evals, &amp; studies to demystify code preference:
llm-code-preference.github.io

🧵
Yiling Lou (@yiling__lou) 's Twitter Profile Photo

🚨Just one week left to submit to #LLM4Code25!(Co-located with ICSE) We also offer a non-archival option, if selected, your camera-ready will be shared only on our workshop website, not the proceedings (no need to worry about double submission if you have other submission plan)

Yuxiang Wei (@yuxiangwei9) 's Twitter Profile Photo

Interested in code generation and instruction tuning without distillation? Please drop by our #NeurIPS poster (East Axhibit # 2502) this afternoon (Friday / Dec 13)! Sorry to miss NeurIPS this year 🥺 but Jiawei Liu will be there to present the poster. Talk to him!

Interested in code generation and instruction tuning without distillation? Please drop by our #NeurIPS poster (East Axhibit # 2502) this afternoon (Friday / Dec 13)!

Sorry to miss NeurIPS this year 🥺 but <a href="/JiaweiLiu_/">Jiawei Liu</a> will be there to present the poster. Talk to him!
Terry Yue Zhuo (@terryyuezhuo) 's Twitter Profile Photo

Happy to release SWE Arena, your vibe coding platform! SWE Arena supports real-time code execution and rendering, covering various frontier LLMs & VLMs! We actually had this idea two years ago inside BigCode with Arjun Guha and Daniel Fried. However, there wasn't much tech

AK (@_akhaliq) 's Twitter Profile Photo

Meta just dropped SWE-RL Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Trained on top of Llama 3, our resulting reasoning model, Llama3-SWE-RL-70B, achieves a 41.0% solve rate on SWE-bench Verified -- a human-verified collection of real-world

Meta just dropped SWE-RL

Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Trained on top of Llama 3, our resulting reasoning model, Llama3-SWE-RL-70B, achieves a 41.0% solve rate on SWE-bench Verified -- a human-verified collection of real-world
Gabriel Synnaeve (@syhw) 's Twitter Profile Photo

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution arxiv.org/abs/2502.18449 by Yuxiang Wei Sida Wang and the whole team! Get started with your favorite model here github.com/facebookresear…

Alex Gu @ iclr (@minimario1729) 's Twitter Profile Photo

📢 Excited to share our new paper: Challenges and Paths Towards AI for SWE We discuss: 🛠️ 6 sub-tasks needed for SWE 🤖 9 challenges of today's AI in SWE 🔮 9 future directions to address the challenges w/ collaborators from MIT, Berkeley, Cornell, Stanford, and UPenn ⬇️ (1/n)

📢 Excited to share our new paper: Challenges and Paths Towards AI for SWE

We discuss:
🛠️ 6 sub-tasks needed for SWE
🤖 9 challenges of today's AI in SWE
🔮 9 future directions to address the challenges

w/ collaborators from MIT, Berkeley, Cornell, Stanford, and UPenn

⬇️ (1/n)