Gagan Bansal (@bansalg_) 's Twitter Profile
Gagan Bansal

@bansalg_

Building @pyautogen | Human-Agent Interaction at Microsoft Research | Previously University of Washington, IITD

ID: 981050629

linkhttps://github.com/gagb calendar_today30-11-2012 19:08:05

760 Tweet

2,2K Followers

495 Following

Ahmed Awadallah (@ahmedhawadallah) 's Twitter Profile Photo

Introducing Phi-4-reasoning, adding reasoning models to the Phi family of SLMs. The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning. 📌Competitive results on reasoning benchmarks with

Introducing Phi-4-reasoning, adding reasoning models to the Phi family of SLMs.

The model is trained with both supervised finetuning (using a carefully curated dataset of reasoning demonstration) and Reinforcement Learning.

📌Competitive results on reasoning benchmarks with
Ece Kamar (@ecekamar) 's Twitter Profile Photo

Excited to share our latest Phi model, Phi4-reasoning, a small but powerful model that match the performance of much larger reasoning models up to DeepSeek R1. Here is the report for new insights into training reasoning models and evaluating them: lnkd.in/g_Pz5JQA

Ahmed Awadallah (@ahmedhawadallah) 's Twitter Profile Photo

Two colleagues recently used our 14-billion parameters Phi-4-reasoning model to ace graduate-level Linear Algebra and Calculus BC tests—scoring 100% and 69/70 respectively. Thanks to the amazing work of our Windows + Devices colleagues, this model now runs on-device on

Peter Lee (@peteratmsr) 's Twitter Profile Photo

In the realm of AI agents for doing complex tasks that require multi-step planning and browser use, Magentic-UI from Microsoft Research is at the cutting edge. I think you'll find it surprisingly useful. Now available in open source on Azure Foundry.

Ahmed Awadallah (@ahmedhawadallah) 's Twitter Profile Photo

A few months back, our team released Magentic-one -- showing how we can build multi-agent systems with AutoGen for complex web task completion. But how should humans interact with such systems? Magentic-UI shows how to build an agentic user experience, prioritizing

Saleema Amershi (@saleemaamershi) 's Twitter Profile Photo

🤖AI agents are getting better. 🙋‍♀️Human agency should too. Today, we're open-sourcing Magentic-UI, an experimental app built on #AutoGen to accelerate research in transparency, control, and human oversight of agentic systems. More 👇

Eric Horvitz (@erichorvitz) 's Twitter Profile Photo

We're pursuing a long-term vision for how AI can amplify human intellect and elevate decision-making in some of the most challenging problems in patient care: questions & directions that arise in tumor board meetings. More in my LinkedIn article: aka.ms/AAwau32

Harsha Nori (@harshanori) 's Twitter Profile Photo

Was a fantastic collaboration on bringing guidance to the full family of OpenAI models. The most comprehensive structured outputs meet the world's best models 🫶 github.com/guidance-ai Shoutout Michal Moskal Andrew Braunstein cc Michelle Pokrass Nikunj Handa Eric Horvitz Kevin Scott

Hussein Mozannar (@hsseinmzannar) 's Twitter Profile Photo

Excited to release my first lead project Magentic-UI at Microsoft Research, an OS web agent application designed for efficient human-agent interaction. CUA agents are cool but they're not so useful yet, Magentic-UI helps us study how to get value from them. github.com/microsoft/mage…

Eric Horvitz (@erichorvitz) 's Twitter Profile Photo

It’s been a pleasure collaborating with Shrey Jain & colleagues at Microsoft’s Health Care & Life Sciences and Microsoft Research on directions ahead for leveraging AI advances to help with cancer care. Stanford Health Care ARPA-H

AutoGen (@pyautogen) 's Twitter Profile Photo

1K on Magentic-UI ⭐️ Thank you! 🙏 star-history.com/#microsoft/mag… github.com/microsoft/mage… #starhistory #GitHub #OpenSource via Star History

Gagan Bansal (@bansalg_) 's Twitter Profile Photo

Magentic-UI + Ollama We are slowly adding more support for local models in our new open-source, human-centered browser use agent.

Wayne Chi (@iamwaynechi) 's Twitter Profile Photo

Crazy that it's been almost a decade since my last internship... Super excited to be at Microsoft Research this summer! Will hopefully build an awesome new agentic system with Gagan Bansal and Hussein Mozannar

Crazy that it's been almost a decade since my last internship...

Super excited to be at <a href="/MSFTResearch/">Microsoft Research</a> this summer! Will hopefully build an awesome new agentic system with <a href="/bansalg_/">Gagan Bansal</a> and <a href="/HsseinMzannar/">Hussein Mozannar</a>
Gagan Bansal (@bansalg_) 's Twitter Profile Photo

My recent talk on challenges in developing human-centered agents is now available online! It provides an HCI perspective of our learning from developing AutoGen youtube.com/watch?v=O5jSX8…

Omar Shaikh (@oshaikh13) 's Twitter Profile Photo

What if LLMs could learn your habits and preferences well enough (across any context!) to anticipate your needs? In a new paper, we present the General User Model (GUM): a model of you built from just your everyday computer use. 🧵

Gagan Bansal (@bansalg_) 's Twitter Profile Photo

The challenge of achieving complementary performance strikes again! h/t Adam Fourney Here's the same exact problem we talked about in the context of XAI a few years. and even with LLM, the same pattern continues to repeat!

The challenge of achieving complementary performance strikes again! 
h/t <a href="/adamfourney/">Adam Fourney</a> 

Here's the same exact problem we talked about in the context of XAI a few years. and even with LLM, the same  pattern continues to repeat!