
Valerie Chen
@valeriechen_
phd student @mldcmu @SCSatCMU + visitor @NYUDataScience | building @CopilotArena | previously @MSFTResearch @yale @CMU_Robotics @IBMResearch
ID: 1374055043230535685
https://valeriechen.github.io/ 22-03-2021 17:47:10
279 Tweet
1,1K Followers
480 Following



Can we use LLMs to generate high-quality *and* original text for creative tasks? We explore where existing models fall on these two axes and try to understand what techniques can push the frontier of novel LLM outputs. Check out Vishakh Padmakumar's thread for more details 👇




Got interviewed by The Wall Street Journal about coding and OpenAI ... then this drops 😲 w/ Valerie Chen

Who is winning the race to claim the LLMs for SWE market? We share our thoughts based on our Copilot Arena work. See article below for current sentiments and what lies ahead 👇

Excited to release my first lead project Magentic-UI at Microsoft Research, an OS web agent application designed for efficient human-agent interaction. CUA agents are cool but they're not so useful yet, Magentic-UI helps us study how to get value from them. github.com/microsoft/mage…


CDS PhD student Vishakh Padmakumar, with co-authors John (Yueh-Han) Chen, Jane Pan, Valerie Chen, and CDS Associate Professor He He, has published new research on the trade-off between originality and quality in LLM outputs. Read more: nyudatascience.medium.com/in-ai-generate…



Exciting new work led by Aditya Soni showing how a few tools can enable agents to solve diverse tasks — from software engineering 🧑💻 to information seeking 🔍. Even more exciting to see some of these contributions integrated into OpenHands👐! Check out 🧵for more details✨

The paper about this versatile agent, OpenHands-Versa, was lead by Aditya Soni at CMU, and you can read much more about the methodology: - His summary: x.com/Aditya_Soni_8/… - The paper: arxiv.org/abs/2506.03011 - Our blog: all-hands.dev/blog/building-…

Huge shout-out to Aditya Soni at CMU, who's amazing work on his paper laid the foundation for accuracy improvements on many tasks: x.com/Aditya_Soni_8/… And Juan at All Hands AI, who set up VersaBench to do such a diverse variety of benchmarking.

