Xueguang Ma (@xueguang_ma) Twitter Tweets • TwiCopy

Xueguang Ma

@xueguang_ma

+ Follow

PhD student at @uwaterloo. Working on encoding the world into vectors. Prev. intern at @Meta, @MSFTResearch, @amazon

ID: 969033784548052992

linkhttps://mxueguang.github.io/ calendar_today01-03-2018 02:17:12

164 Tweet

590 Followers

432 Following

SIGIR-AP 2025

@acmsigir_ap

5 months ago

Only 33 days left to submit your paper to #SIGIRAP2025! Don't miss the abstract deadline.

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Let's talk about Mixture-of-Transformers (MoT) and heterogeneous omni-model training. 1. Inspired by prior architectures consisting of modality-specific parameters—such as Flamingo, CogVLM, BEIT-3, and MoMA—MoT (arxiv.org/abs/2411.04996) pushes this idea further by using

thumb_up_off_alt129

chat_bubble_outline1

repeat4

shareShare

jack morris

@jxmnop

5 months ago

hello twittersphere! i am planning to graduate in a few months, so i am officially ✨ Looking For A Job ✨ if you know of a role that'd be a good fit, or just want to chat, please reach out! here are some projects i've worked on that i'm most proud of 👇

thumb_up_off_alt841

chat_bubble_outline33

repeat51

shareShare

Benjamin Clavié

@bclavie

5 months ago

Multimodal RAG: Just use ColPali/DSE then pass your screenshots to the LLM This is the dream, but how well do LLMs read text contained in images? We wanted to know, so we tried a simple thing: do results change on evals when using screenshots rather than text as input? Yes.

thumb_up_off_alt438

chat_bubble_outline16

repeat81

shareShare

Luyu Gao

@luyu_gao

5 months ago

Among papers I wrote, GradCache is one of my favorite. Very glad to see it still being useful🚀

thumb_up_off_alt26

chat_bubble_outline1

repeat1

shareShare

Rulin Shao

@rulinshao

5 months ago

🎉Our Spurious Rewards is available on ArXiv! We added experiments on - More prompts/steps/models/analysis... - Spurious Prompts! Surprisingly, we obtained 19.4% gains when replacing prompts with LaTex placeholder text (\lipsum) 😶‍🌫️ Check out our 2nd blog: tinyurl.com/spurious-prompt

$🎉Our Spurious Rewards is available on ArXiv! We added experiments on - More prompts/steps/models/analysis... - Spurious Prompts! Surprisingly, we obtained 19.4% gains when replacing prompts with LaTex placeholder text (\lipsum) 😶‍🌫️ Check out our 2nd blog: tinyurl.com/spurious-prompt$

thumb_up_off_alt219

chat_bubble_outline4

repeat40

shareShare

Xueguang Ma

@xueguang_ma

5 months ago

Congrats on the release!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Jimmy Lin

@lintool

4 months ago

It’s been ~4 weeks since we launched Yupp – a consumer-first approach to robust & trustworthy AI evaluation. We’re still early but have already gathered 2M+ high-quality human preference feedback datapoints on 500+ models across diverse use cases. 🧵 x.com/pankaj/status/…

thumb_up_off_alt88

chat_bubble_outline8

repeat29

shareShare

Jimmy Lin

@lintool

4 months ago

It’s been 36 hours since Grok 4 launched and we have an early verdict based on 6K+ preferences of Yupp users globally on real use cases. ‼️ Grok 4 is worse than other leading models: OpenAI o3, Claude Opus 4, and Gemini 2.5 Pro. Grok 4 is liked even less than Grok 3. 🧵

It’s been 36 hours since Grok 4 launched and we have an early verdict based on 6K+ preferences of <a href="/yupp_ai/">Yupp</a> users globally on real use cases.

‼️ Grok 4 is worse than other leading models: OpenAI o3, Claude Opus 4, and Gemini 2.5 Pro. Grok 4 is liked even less than Grok 3. 🧵

thumb_up_off_alt1,1K

chat_bubble_outline113

repeat173

shareShare

Xueguang Ma

@xueguang_ma

4 months ago

Got my visa approved right before departure 😂. Heading to Padova for #SIGIR2025, looking forward to chatting about multi-modality/reasoning for information retrieval.

thumb_up_off_alt22

chat_bubble_outline0

repeat0

shareShare

AK

@_akhaliq

4 months ago

NeuralOS Towards Simulating Operating Systems via Neural Generative Models a generative OS that predicts screen images from user inputs, combining an RNN for computer state modeling and a diffusion model for rendering

thumb_up_off_alt860

chat_bubble_outline108

repeat119

shareShare

Yuntian Deng

@yuntiandeng

4 months ago

Can we build an operating system entirely powered by neural networks? Introducing NeuralOS: towards a generative OS that directly predicts screen images from user inputs. Try it live: neural-os.com Paper: huggingface.co/papers/2507.08… Inspired by Andrej Karpathy's vision. 1/5

thumb_up_off_alt159

chat_bubble_outline6

repeat34

shareShare

Michael Bendersky

@bemikelive

4 months ago

This is a good opportunity to announce that I recently joined the research team at Databricks where I will be working alongside Jonathan Frankle Rishabh Singh Matei Zaharia Erich Elsen, and many others on the hardest problems at the intersection of information retrieval and AI.

thumb_up_off_alt35

chat_bubble_outline1

repeat6

shareShare

Xueguang Ma

@xueguang_ma

4 months ago

ScholarCopilot (led by Yubo Wang) is now accepted at COLM 2025! Lots of great work has emerged over the past half year on improving interleaved search and reasoning. Looking forward to seeing them apply to the scientific writing tasks.

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Xueguang Ma

SIGIR-AP 2025

Victoria X Lin

jack morris

Benjamin Clavié

Luyu Gao

Rulin Shao

Xueguang Ma

Jimmy Lin

Jimmy Lin

Xueguang Ma

AK

Yuntian Deng

Michael Bendersky

Xueguang Ma