Shreyas Bhat (@shreyasbhat23) Twitter Tweets • TwiCopy

translated cats

@translatedcats

a year ago

First time coming to the sea.

thumb_up_off_alt158,158K

chat_bubble_outline190

repeat12,12K

shareShare

Introducing 🔥SFR-RAG 🔥, an LLM specialized in RAG use cases, optimized to fully leverage contextual content for accurate and faithful response generation. We also introduce ContextualBench - A compilation of 7 popular contextual QA tasks measured in different settings. On

thumb_up_off_alt161

chat_bubble_outline2

repeat31

shareShare

QTIM Lab

@qtimlab

a year ago

big ups to Albert for an inspiring First Look talk at #WMIF2024. pls check it out! youtube.com/watch?v=axxTJM…… Mass General Cancer Center Damon Runyon Cancer Research Foundation MGH Martinos Center

big ups to Albert for an inspiring First Look talk at #WMIF2024. pls check it out! youtube.com/watch?v=axxTJM……
<a href="/MGHCancerCenter/">Mass General Cancer Center</a>
<a href="/DamonRunyon/">Damon Runyon Cancer Research Foundation</a>
<a href="/MGHMartinos/">MGH Martinos Center</a>

thumb_up_off_alt10

chat_bubble_outline1

repeat4

shareShare

Caiming Xiong

@caimingxiong

a year ago

Can your LLM Stay Faithful to Context, Even If "The Moon 🌕 is Made of Marshmallows 🍡"? We Introduce FaithEval, a new and comprehensive benchmark dedicated to evaluating contextual faithfulness for LLMs with 4.9K high-quality question-context pairs across 3 challenging tasks:

thumb_up_off_alt89

chat_bubble_outline3

repeat20

shareShare

Mass General Brigham

@massgenbrigham

a year ago

.MassGeneral News and Harvard Medical School school Investigator Gary Ruvkun, PhD, has won the 2024 The Nobel Prize in Physiology or Medicine. He is honored for his discovery of small regulatory RNAs. Congratulations! Learn more: massgeneralbrigham.org/en/about/newsr…

.<a href="/MassGeneralNews/">MassGeneral News</a> and <a href="/harvardmed/">Harvard Medical School</a> school Investigator Gary Ruvkun, PhD, has won the 2024 <a href="/NobelPrize/">The Nobel Prize</a> in Physiology or Medicine. He is honored for his discovery of small regulatory RNAs. Congratulations! Learn more:

massgeneralbrigham.org/en/about/newsr…

thumb_up_off_alt97

chat_bubble_outline1

repeat28

shareShare

Rajaswa Patil

@rajaswapatil

a year ago

A junior from BITS Pilani, Goa shared today's mid-sem exam paper for their 'Generative AI' course. Coincidentally, one of the questions (Q4) was on Hopfield Networks—right as John Hopfield was awarded the Nobel Prize in Physics for it today. So cool!

A junior from <a href="/BITSPilaniGoa/">BITS Pilani, Goa</a> shared today's mid-sem exam paper for their 'Generative AI' course.

Coincidentally, one of the questions (Q4) was on Hopfield Networks—right as John Hopfield was awarded the Nobel Prize in Physics for it today. So cool!

thumb_up_off_alt72

chat_bubble_outline6

repeat1

shareShare

Prateek Yadav

@prateeky2806

a year ago

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

thumb_up_off_alt213

chat_bubble_outline8

repeat60

shareShare

jack morris

@jxmnop

a year ago

no AI here, just the coolest paper i've seen in a while

thumb_up_off_alt25,25K

chat_bubble_outline114

repeat1,1K

shareShare

Deedy

@deedydas

a year ago

This deep tech startup is solving air pollution in India. They make affordable, powerful, low maintenance (filterless), high-volume air purification systems for both factories (MK II) and homes (Hive). It's called Praan, and here's their incredible story: 1/5

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat144

shareShare

Sharut Gupta

@sharut_gupta

10 months ago

Grateful to MIT CSAIL Alliances for having me on their podcast! It was a joy sharing how our recent work trains machines to self-adapt to new tasks and scenarios! Paper: lnkd.in/g2g-RcDb Podcast: bit.ly/40IVZ7v

thumb_up_off_alt29

chat_bubble_outline2

repeat2

shareShare

Shrey Pandit

@shreypandit2001

10 months ago

🚨 New Research Alert! 🚨 Happy to share my latest work - MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models 📄Arxiv - arxiv.org/abs/2502.14302 🤗HuggingFace Dataset - huggingface.co/datasets/UTAus… 🌐Website - medhallu.github.io

thumb_up_off_alt15

chat_bubble_outline2

repeat5

shareShare

carmen

@carmguti

10 months ago

how it feels to do anything with claude

thumb_up_off_alt3,3K

chat_bubble_outline45

repeat193

shareShare

Harj Taggar

@harjtaggar

9 months ago

Autists had a great run, the AI future belongs to ADHD

thumb_up_off_alt9,9K

chat_bubble_outline395

repeat811

shareShare

Arul Murugan

@_arulm_

9 months ago

thumb_up_off_alt142

chat_bubble_outline5

repeat13

shareShare

Shubham Gandhi

@shubhamrgandhi

7 months ago

🚨New preprint🚨 I’m super excited to share our work: An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation 📜: arxiv.org/abs/2505.20182 w/ Atharva Naik , Yiqing Xie and Carolyn Rose 🧵

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

jack morris

@jxmnop

6 months ago

most foundational concept in deep learning that no one understands is probably the Neural Tangent Kernel (NTK) this line of work studies neural networks of *infinite width*, which explain a lot about normal finite-width NNs and there is exactly one Very Good blog post on them:

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat122

shareShare

Shrey Pandit

@shreypandit2001

4 months ago

Excited to share that our paper, “MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models,” was accepted to EMNLP 2025 (Main)! Huge thanks to my co-authors Jiawei Xu, Junyuan "Jason" Hong, Atlas Wang, Tianlong Chen, Kaidi Xu, and Ying Ding!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Caiming Xiong

@caimingxiong

3 months ago

Meet SFR-DeepResearch (SFR-DR) 🤖: our RL-trained autonomous agents that can reason, search, and code their way through deep research tasks. 🚀SFR-DR-20B achieves 28.7% on Humanity's Last Exam (text-only) using only web search 🔍, browsing 🌐, and Python interpreter 🐍,

thumb_up_off_alt956

chat_bubble_outline28

repeat146

shareShare

Caiming Xiong

@caimingxiong

2 months ago

📊Excited to introduce our Hard2Verify from Salesforce AI Research , the benchmark for measuring the ability of verifiers to provide step-level correctness labels for model-generated responses to largely open-ended, frontier-level math problems📚 The current paradigm of RLVR requires

📊Excited to introduce our Hard2Verify from <a href="/SFResearch/">Salesforce AI Research</a> , the benchmark for measuring the ability of verifiers to provide step-level correctness labels for model-generated responses to largely open-ended, frontier-level math problems📚

The current paradigm of RLVR requires

thumb_up_off_alt134

chat_bubble_outline2

repeat35

shareShare

Caiming Xiong

@caimingxiong

2 months ago

One of the key challenges for building web-based “deep research” agents is to construct sufficiently difficult long-horizon agentic data. At Salesforce AI Research, We introduce ProgSearch, a controlled data synthesis pipeline that builds tasks of increasing complexity until a frontier

One of the key challenges for building web-based “deep research” agents is to construct sufficiently difficult long-horizon agentic data.
At <a href="/SFResearch/">Salesforce AI Research</a>, We introduce ProgSearch, a controlled data synthesis pipeline that builds tasks of increasing complexity until a frontier

thumb_up_off_alt113

chat_bubble_outline3

repeat27

shareShare