Yangyi Chen (on job market) (@yangyichen6666) Twitter Tweets • TwiCopy

Yangyi Chen (on job market)

@yangyichen6666

+ Follow

CS Ph.D. student at UIUC @IllinoisCS, focusing on scalable foundation models. I’m on the industry job market, seeking full-time research scientist positions!

ID: 1430592529322287108

linkhttps://yangyi-chen.github.io/ calendar_today25-08-2021 18:07:09

397 Tweet

922 Followers

289 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Thrilled to share my first project at NVIDIA! ✨ Today’s language models are pre-trained on vast and chaotic Internet texts, but these texts are unstructured and poorly understood. We propose CLIMB — Clustering-based Iterative Data Mixture Bootstrapping — a fully automated

thumb_up_off_alt312

chat_bubble_outline17

repeat55

shareShare

Shizhe Diao

@shizhediao

3 months ago

[Approach] ➤ Embeds and clusters web-scale data semantically. ➤ Searches, iteratively and efficiently, for optimal data mixtures using a lightweight proxy model + predictor loop. ➤ Learns how different domains interact, and how the right mix can unlock downstream performance

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Dylan

@dylan_works_

3 months ago

Decision: Tweet Comment: Okay, here is the summary of this Summary: Summary: Besides this picture, this message hallucinates the full name of our approach based on the acronym, which includes 2 words that appeared ZERO times in the entire paper. ICML Conference

thumb_up_off_alt27

chat_bubble_outline3

repeat5

shareShare

Xiusi Chen

@xiusi_chen

3 months ago

🚀 Can we cast reward modeling as a reasoning task? 📖 Introducing our new paper: RM-R1: Reward Modeling as Reasoning 📑 Paper: arxiv.org/pdf/2505.02387 💻 Code: github.com/RM-R1-UIUC/RM-… Inspired by recent advances of long chain-of-thought (CoT) on reasoning-intensive tasks, we

thumb_up_off_alt182

chat_bubble_outline3

repeat41

shareShare

Heng Ji

@hengjinlp

2 months ago

We are extremely excited to announce mCLM, a Modular Chemical Language Model that is friendly to automatable block-based chemistry and mimics bilingual speakers by “code-switching” between functional molecular modules and natural language descriptions of the functions. 1/2

thumb_up_off_alt95

chat_bubble_outline1

repeat27

shareShare

Yangyi Chen (on job market)

@yangyichen6666

2 months ago

Soft Soft Soft 🍰

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Mistral AI

@mistralai

2 months ago

Meet Devstral, our SOTA open model designed specifically for coding agents and developed with All Hands AI mistral.ai/news/devstral

Meet Devstral, our SOTA open model designed specifically for coding agents and developed with <a href="/allhands_ai/">All Hands AI</a>

mistral.ai/news/devstral

thumb_up_off_alt3,3K

chat_bubble_outline102

repeat431

shareShare

Hyeonjeong Ha

@hyeonjeong_ai

2 months ago

Thrilled to share that our paper has been accepted to #ACL2025 Main 🇦🇹 Huge thanks to my amazing collaborators and my advisor Heng Ji 🙃 📄arxiv.org/abs/2502.17793 Happy to chat about our work as well as MLLM research projects 🙌

Thrilled to share that our paper has been accepted to #ACL2025 Main 🇦🇹

Huge thanks to my amazing collaborators and my advisor <a href="/hengjinlp/">Heng Ji</a> 🙃
📄arxiv.org/abs/2502.17793

Happy to chat about our work as well as MLLM research projects 🙌

thumb_up_off_alt44

chat_bubble_outline0

repeat8

shareShare

Wei Ping

@_weiping

2 months ago

Introducing AceReason-Nemotron: Advancing math and code reasoning through reinforcement learning (RL) We propose conducting RL on math-only prompts first, then on code-only prompts. Our key findings include: - Math-only RL significantly boosts both math and code benchmarks! -

thumb_up_off_alt144

chat_bubble_outline1

repeat23

shareShare

Shizhe Diao

@shizhediao

2 months ago

Does RL truly expand a model’s reasoning🧠capabilities? Contrary to recent claims, the answer is yes—if you push RL training long enough! Introducing ProRL 😎, a novel training recipe that scales RL to >2k steps, empowering the world’s leading 1.5B reasoning model💥and offering

thumb_up_off_alt382

chat_bubble_outline17

repeat64

shareShare

Yang Chen

@ychennlp

a month ago

📢We conduct a systematic study to demystify the synergy between SFT and RL for reasoning models. The result? We trained a 7B model - AceReason-Nemotron-1.1, significantly improved from version 1.0 on math and coding benchmarks. ✅AIME2025 (math): 53.6% -> 64.8% ✅LiveCodeBench

thumb_up_off_alt120

chat_bubble_outline4

repeat28

shareShare

Martin Ziqiao Ma

@ziqiao_ma

a month ago

Can we scale 4D pretraining to learn general space-time representations that reconstruct an object from a few views at any time to any view at any other time? Introducing 4D-LRM: a Large Space-Time Reconstruction Model that ... 🔹 Predicts 4D Gaussian primitives directly from

thumb_up_off_alt99

chat_bubble_outline1

repeat39

shareShare

May Fung

@may_f1_

24 days ago

🧠 How can AI evolve from statically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘢𝘣𝘰𝘶𝘵 𝘪𝘮𝘢𝘨𝘦𝘴 → dynamically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘪𝘮𝘢𝘨𝘦𝘴 as cognitive workspaces, similar to the human mental sketchpad? 🔍 What’s the 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 from tool-use → programmatic

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Yangyi Chen (on job market)

@yangyichen6666

24 days ago

😂

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zhenhailong Wang

@zhenhailongw

16 days ago

Learning to perceive while learning to reason! We introduce PAPO: Perception-Aware Policy Optimization, a direct upgrade to GRPO for multimodal reasoning. PAPO relies on internal supervision signals. No extra annotations, reward models, or teacher models needed. 🧵1/3

thumb_up_off_alt33

chat_bubble_outline1

repeat14

shareShare