Yi Lin Sung (on job market) (@yilin_sung) Twitter Tweets • TwiCopy

fly51fly

6 months ago

[LG] RSQ: Learning from Important Tokens Leads to Better Quantized LLMs Y Sung, P Yadav, J Li, J Yoon... [UNC at Chapel Hill] (2025) arxiv.org/abs/2503.01820

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

🚨 We introduce ✨ Symbolic-MoE ✨ which uses skill-based instance-level recruiting to dynamically combine LLMs, allowing three 7-8B LLMs to beat GPT4o-mini and Llama3.3 70B across challenging + diverse reasoning tasks (MMLU-Pro, AIME, GPQA, MedMCQA) while running on 1 GPU! Key

thumb_up_off_alt264

chat_bubble_outline6

repeat83

shareShare

Shoubin Yu✈️ICLR 2025🇸🇬

@shoubin621

6 months ago

🚨 Introducing VEGGIE 🥦—a unified, end-to-end, and versatile instructional video generative model. Current video editing methods struggle with: 1. Understanding direct user instructions 2. Handling diverse editing skills in one model 3. balancing multiple training

thumb_up_off_alt92

chat_bubble_outline2

repeat39

shareShare

Eli Chien

@chien_eli

5 months ago

Life Update: I am happy to share the news that I will be an Assistant Professor at the National Taiwan University EE department! I am very grateful for this opportunity to be back in my home country, especially at the university where I was an undergrad! 1/4

thumb_up_off_alt126

chat_bubble_outline15

repeat4

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

5 months ago

🚨Announcing TaCQ 🚨 a new mixed-precision quantization method that identifies critical weights to preserve. We integrate key ideas from circuit discovery, model editing, and input attribution to improve low-bit quant., w/ 96% 16-bit acc. at 3.1 avg bits (~6x compression)

thumb_up_off_alt112

chat_bubble_outline2

repeat37

shareShare

Hanqi Xiao

@hanqi_xiao

5 months ago

Excited to share my first paper as first author: "Task-Circuit Quantization” 🎉 I led this work to explore how interpretability insights can drive smarter model compression. Big thank you to Elias Stengel-Eskin, Yi Lin Sung (on job market), and Mohit Bansal for mentorship and collaboration. More to come!

thumb_up_off_alt22

chat_bubble_outline0

repeat9

shareShare

Yi Lin Sung (on job market)

@yilin_sung

5 months ago

Introducing TaCQ: a new mixed-precision quantization method that preserves performance at low bit-widths (2–3 bits). 1⃣Keeps task-critical weights in 16-bit and quantizes the rest 2⃣Uses a novel saliency metric inspired by model editing and interpretability 3⃣Beats the strongest

thumb_up_off_alt18

chat_bubble_outline0

repeat8

shareShare

Jialu Li

@jialuli96

5 months ago

🚀New paper out - We present Video-MSG (Multimodal Sketch Guidance), a novel planning-based training-free guidance method for T2V models, improving control of spatial layout and object trajectories. 🔧 Key idea: • Generate a Video Sketch — a spatio-temporal plan with

thumb_up_off_alt97

chat_bubble_outline5

repeat35

shareShare

Han Wang

@hanwang98

5 months ago

🚨Real-world retrieval is messy: queries can be ambiguous, or documents may conflict/have incorrect/irrelevant info. How can we jointly address all these problems? We introduce: ➡️ RAMDocs, a challenging dataset with ambiguity, misinformation, and noise. ➡️ MADAM-RAG, a

thumb_up_off_alt56

chat_bubble_outline2

repeat29

shareShare

Vaidehi Patil

@vaidehi_patil_

4 months ago

🚨 Introducing our Transactions on Machine Learning Research paper “Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation” W:nt UnLOK-VQA, a benchmark to evaluate unlearning in vision-and-language models—where both images and text may encode sensitive or private

🚨 Introducing our <a href="/TmlrOrg/">Transactions on Machine Learning Research</a> paper “Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation”

W:nt UnLOK-VQA, a benchmark to evaluate unlearning in vision-and-language models—where both images and text may encode sensitive or private

thumb_up_off_alt102

chat_bubble_outline2

repeat37

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

4 months ago

Sharing some personal updates 🥳: - I've completed my PhD at UNC Computer Science! 🎓 - Starting Fall 2026, I'll be joining the Computer Science dept. at Johns Hopkins University (JHU Computer Science) as an Assistant Professor 💙 - Currently exploring options + finalizing the plan for my gap year (Aug

Sharing some personal updates 🥳:
- I've completed my PhD at <a href="/unccs/">UNC Computer Science</a>! 🎓
- Starting Fall 2026, I'll be joining the Computer Science dept. at Johns Hopkins University (<a href="/JHUCompSci/">JHU Computer Science</a>) as an Assistant Professor 💙
- Currently exploring options + finalizing the plan for my gap year (Aug

thumb_up_off_alt395

chat_bubble_outline65

repeat45

shareShare

Jaehong Yoon (on the faculty job market)

@jaeh0ng_yoon

3 months ago

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (NTU Singapore) as an Assistant Professor, starting in August 2025 🇸🇬🥳 I’ll continue my research on building trustworthy and continually adaptable multimodal AI,

Thrilled to share that I’ll be joining the College of Computing and Data Science at Nanyang Technological University (NTU) (<a href="/NTUsg/">NTU Singapore</a>) as an Assistant Professor, starting in August 2025 🇸🇬🥳

I’ll continue my research on building trustworthy and continually adaptable multimodal AI,

thumb_up_off_alt215

chat_bubble_outline28

repeat30

shareShare

Daeun Lee

@danadaeun

3 months ago

Excited to share Video-Skill-CoT🎬🛠️– a new framework for domain-adaptive video reasoning with skill-aware Chain-of-Thought (CoT) supervision! ⚡️Key Highlights: ➡️ Automatically extracts domain-specific reasoning skills from questions and organizes them into a unified taxonomy,

thumb_up_off_alt75

chat_bubble_outline2

repeat28

shareShare

Han Guo

@hanguo97

3 months ago

We know Attention and its linear-time variants, such as linear attention and State Space Models. But what lies in between? Introducing Log-Linear Attention with: - Log-linear time training - Log-time inference (in both time and memory) - Hardware-efficient Triton kernels

thumb_up_off_alt1,1K

chat_bubble_outline14

repeat185

shareShare

Hanqi Xiao

@hanqi_xiao

2 months ago

🎉 Excited to share that TaCQ (Task-Circuit Quantization), our work on knowledge-informed mixed-precision quantization, has been accepted to #COLM2025 Conference on Language Modeling! Happy to see that TaCQ was recognized with high scores and a nice shoutout from the AC – big thanks to Elias Stengel-Eskin

thumb_up_off_alt39

chat_bubble_outline1

repeat16

shareShare

Elias Stengel-Eskin (on the faculty job market)

@eliaseskin

2 months ago

🎉 Very excited to see TaCQ — our work on task-conditioned mixed-precision quantization that draws on interpretability methods — accepted to Conference on Language Modeling #COLM2025 with strong scores and a nice shoutout from the AC! Kudos to Hanqi on leading this effort!

thumb_up_off_alt55

chat_bubble_outline0

repeat15

shareShare

Ziyang Wang

@ziyangw00

2 months ago

🚨Introducing Video-RTS: Resource-Efficient RL for Video Reasoning with Adaptive Video TTS! While RL-based video reasoning with LLMs has advanced, the reliance on large-scale SFT with extensive video data and long CoT annotations remains a major bottleneck. Video-RTS tackles

thumb_up_off_alt40

chat_bubble_outline1

repeat28

shareShare

Han Lin

@hanlin_hl

a month ago

🤔 Can we bridge MLLMs and diffusion models more natively and efficiently, by having MLLMs produce patch-level CLIP latents already aligned with their visual encoders, while fully preserving MLLM's visual reasoning capabilities? Introducing Bifrost-1: 🌈 > High-Fidelity

thumb_up_off_alt131

chat_bubble_outline2

repeat47

shareShare

Jaemin Cho (on faculty job market)

@jmin__cho

19 days ago

📢 Introducing RotBench, which tests whether SoTA MLLMs (e.g., GPT-5, GPT-4o, o3, Gemini-2.5-pro) can identify the rotation of input images (0°, 90°, 180°, and 270°). Even frontier MLLMs struggle at this spatial reasoning task that humans solve with >98% Acc. ➡️ Models struggle

thumb_up_off_alt85

chat_bubble_outline2

repeat37

shareShare

Yi Lin Sung (on job market)

fly51fly

Justin Chih-Yao Chen

Shoubin Yu✈️ICLR 2025🇸🇬

Eli Chien

Elias Stengel-Eskin (on the faculty job market)

Hanqi Xiao

Yi Lin Sung (on job market)

Jialu Li

Han Wang

Vaidehi Patil

Jaemin Cho (on faculty job market)

Jaehong Yoon (on the faculty job market)

Daeun Lee

Han Guo

Hanqi Xiao

Elias Stengel-Eskin (on the faculty job market)

Ziyang Wang

Han Lin

Jaemin Cho (on faculty job market)