Shreyas Bhat (@shreyasbhat23) 's Twitter Profile
Shreyas Bhat

@shreyasbhat23

CS PhD @unc | Prev. research @qtimlab @harvardmed | @BITSPilaniGoa| @SforAiDL |
AI, medical imaging, drug discovery

ID: 3317852388

linkhttp://shreyas.xyz/ calendar_today17-08-2015 16:02:04

382 Tweet

317 Followers

923 Following

Caiming Xiong (@caimingxiong) 's Twitter Profile Photo

Introducing 🔥SFR-RAG 🔥, an LLM specialized in RAG use cases, optimized to fully leverage contextual content for accurate and faithful response generation. We also introduce ContextualBench - A compilation of 7 popular contextual QA tasks measured in different settings. On

Introducing 🔥SFR-RAG 🔥, an LLM specialized in RAG use cases, optimized to fully leverage contextual content for accurate and faithful response generation.
We also introduce ContextualBench - A compilation of 7 popular contextual QA tasks measured in different settings. On
Caiming Xiong (@caimingxiong) 's Twitter Profile Photo

Can your LLM Stay Faithful to Context, Even If "The Moon 🌕 is Made of Marshmallows 🍡"? We Introduce FaithEval, a new and comprehensive benchmark dedicated to evaluating contextual faithfulness for LLMs with 4.9K high-quality question-context pairs across 3 challenging tasks:

Can your LLM Stay Faithful to Context, Even If "The Moon 🌕 is Made of Marshmallows 🍡"?

We Introduce FaithEval, a new and comprehensive benchmark dedicated to evaluating contextual faithfulness for LLMs with 4.9K high-quality question-context pairs across 3 challenging tasks:
Mass General Brigham (@massgenbrigham) 's Twitter Profile Photo

.MassGeneral News and Harvard Medical School school Investigator Gary Ruvkun, PhD, has won the 2024 The Nobel Prize in Physiology or Medicine. He is honored for his discovery of small regulatory RNAs. Congratulations! Learn more: massgeneralbrigham.org/en/about/newsr…

.<a href="/MassGeneralNews/">MassGeneral News</a> and <a href="/harvardmed/">Harvard Medical School</a> school Investigator Gary Ruvkun, PhD, has won the 2024 <a href="/NobelPrize/">The Nobel Prize</a> in Physiology or Medicine. He is honored for his discovery of small regulatory RNAs. Congratulations! Learn more: 

massgeneralbrigham.org/en/about/newsr…
Rajaswa Patil (@rajaswapatil) 's Twitter Profile Photo

A junior from BITS Pilani, Goa shared today's mid-sem exam paper for their 'Generative AI' course. Coincidentally, one of the questions (Q4) was on Hopfield Networks—right as John Hopfield was awarded the Nobel Prize in Physics for it today. So cool!

A junior from <a href="/BITSPilaniGoa/">BITS Pilani, Goa</a> shared today's mid-sem exam paper for their 'Generative AI' course.

Coincidentally, one of the questions (Q4) was on Hopfield Networks—right as John Hopfield was awarded the Nobel Prize in Physics for it today. So cool!
Prateek Yadav (@prateeky2806) 's Twitter Profile Photo

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

Deedy (@deedydas) 's Twitter Profile Photo

This deep tech startup is solving air pollution in India. They make affordable, powerful, low maintenance (filterless), high-volume air purification systems for both factories (MK II) and homes (Hive). It's called Praan, and here's their incredible story: 1/5

This deep tech startup is solving air pollution in India.

They make affordable, powerful, low maintenance (filterless), high-volume air purification systems for both factories (MK II) and homes (Hive).

It's called Praan, and here's their incredible story:

1/5
Sharut Gupta (@sharut_gupta) 's Twitter Profile Photo

Grateful to MIT CSAIL Alliances for having me on their podcast! It was a joy sharing how our recent work trains machines to self-adapt to new tasks and scenarios! Paper: lnkd.in/g2g-RcDb Podcast: bit.ly/40IVZ7v

Shrey Pandit (@shreypandit2001) 's Twitter Profile Photo

🚨 New Research Alert! 🚨 Happy to share my latest work - MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models 📄Arxiv - arxiv.org/abs/2502.14302 🤗HuggingFace Dataset - huggingface.co/datasets/UTAus… 🌐Website - medhallu.github.io

🚨 New Research Alert! 🚨
Happy to share my latest work -  
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

📄Arxiv - arxiv.org/abs/2502.14302
🤗HuggingFace Dataset - huggingface.co/datasets/UTAus…
🌐Website - medhallu.github.io
Shubham Gandhi (@shubhamrgandhi) 's Twitter Profile Photo

🚨New preprint🚨 I’m super excited to share our work: An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation 📜: arxiv.org/abs/2505.20182 w/ Atharva Naik , Yiqing Xie and Carolyn Rose 🧵

jack morris (@jxmnop) 's Twitter Profile Photo

most foundational concept in deep learning that no one understands is probably the Neural Tangent Kernel (NTK) this line of work studies neural networks of *infinite width*, which explain a lot about normal finite-width NNs and there is exactly one Very Good blog post on them:

most foundational concept in deep learning that no one understands is probably the Neural Tangent Kernel (NTK)

this line of work studies neural networks of *infinite width*, which explain a lot about normal finite-width NNs

and there is exactly one Very Good blog post on them:
Shrey Pandit (@shreypandit2001) 's Twitter Profile Photo

Excited to share that our paper, “MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models,” was accepted to EMNLP 2025 (Main)! Huge thanks to my co-authors Jiawei Xu, Junyuan "Jason" Hong, Atlas Wang, Tianlong Chen, Kaidi Xu, and Ying Ding!

Caiming Xiong (@caimingxiong) 's Twitter Profile Photo

Meet SFR-DeepResearch (SFR-DR) 🤖: our RL-trained autonomous agents that can reason, search, and code their way through deep research tasks. 🚀SFR-DR-20B achieves 28.7% on Humanity's Last Exam (text-only) using only web search 🔍, browsing 🌐, and Python interpreter 🐍,

Meet SFR-DeepResearch (SFR-DR) 🤖: our RL-trained autonomous agents that can reason, search, and code their way through deep research tasks.

🚀SFR-DR-20B achieves 28.7% on Humanity's Last Exam (text-only) using only web search 🔍, browsing 🌐, and Python interpreter 🐍,
Caiming Xiong (@caimingxiong) 's Twitter Profile Photo

📊Excited to introduce our Hard2Verify from Salesforce AI Research , the benchmark for measuring the ability of verifiers to provide step-level correctness labels for model-generated responses to largely open-ended, frontier-level math problems📚 The current paradigm of RLVR requires

📊Excited to introduce our Hard2Verify from <a href="/SFResearch/">Salesforce AI Research</a> , the benchmark for measuring the ability of verifiers to provide step-level correctness labels for model-generated responses to largely open-ended, frontier-level math problems📚

The current paradigm of RLVR requires
Caiming Xiong (@caimingxiong) 's Twitter Profile Photo

One of the key challenges for building web-based “deep research” agents is to construct sufficiently difficult long-horizon agentic data. At Salesforce AI Research, We introduce ProgSearch, a controlled data synthesis pipeline that builds tasks of increasing complexity until a frontier

One of the key challenges for building web-based “deep research” agents is to construct sufficiently difficult long-horizon agentic data.
At <a href="/SFResearch/">Salesforce AI Research</a>,  We introduce ProgSearch, a controlled data synthesis pipeline that builds tasks of increasing complexity until a frontier