Davis Liang (@liangdavis) 's Twitter Profile
Davis Liang

@liangdavis

ML @AbridgeHQ. Prev: Research Scientist (@MetaAI), Applied Scientist (@awscloud).

ID: 306922539

linkhttps://www.davisliang.com calendar_today28-05-2011 17:47:35

168 Tweet

368 Followers

279 Following

Sam Bowman (@sleepinyourhat) 's Twitter Profile Photo

I gave a talk! You can watch it! Covering: Scalable oversight, AI-AI debate, hard QA datasets, and getting truthful answers out of AI systems in domains we don't know much about.

I gave a talk! You can watch it!

Covering: Scalable oversight, AI-AI debate, hard QA datasets, and getting truthful answers out of AI systems in domains we don't know much about.
Cecilia Ziniti (@ceciliazin) 's Twitter Profile Photo

🧵 The historic NYT v. OpenAI lawsuit filed this morning, as broken down by me, an IP and AI lawyer, general counsel, and longtime tech person and enthusiast. Tl;dr - It's the best case yet alleging that generative AI is copyright infringement. Thread. 👇

🧵 The historic NYT v. <a href="/OpenAI/">OpenAI</a> lawsuit filed this morning, as broken down by me, an IP and AI lawyer,  general counsel, and longtime tech person and enthusiast. 

Tl;dr - It's the best case yet alleging that generative AI is copyright infringement. Thread. 👇
Zachary Lipton (@zacharylipton) 's Twitter Profile Photo

Had the privilege of picking the brain of the most inspiring technology leader alive. Thanks Jensen, NVIDIA, and Lightspeed for giving Shiv Rao, MD & I this opportunity to share our vision for Abridge & for making my Taiwanese family proud. Love you Annie Hui-Hsin Hsieh 謝蕙馨, flying

Had the privilege of picking the brain of the most inspiring technology leader alive. Thanks Jensen, <a href="/nvidia/">NVIDIA</a>, and <a href="/lightspeedvp/">Lightspeed</a> for giving <a href="/ShivdevRao/">Shiv Rao, MD</a> &amp; I this opportunity to share our vision for <a href="/AbridgeHQ/">Abridge</a> &amp; for making my Taiwanese family proud. Love you <a href="/annielizard/">Annie Hui-Hsin Hsieh 謝蕙馨</a>, flying
Abridge (@abridgehq) 's Twitter Profile Photo

📣 We are thrilled to announce a research collaboration and investment from NVIDIA to help us scale our multilingual clinical conversation platform across the entire U.S. healthcare system. Learn more here: bit.ly/3TlXsgh

📣 We are thrilled to announce a research collaboration and investment from <a href="/nvidia/">NVIDIA</a> to help us scale our multilingual clinical conversation platform across the entire U.S. healthcare system.

Learn more here: bit.ly/3TlXsgh
Sebastian Ruder (@seb_ruder) 's Twitter Profile Photo

Ahia et al. (2023; aclanthology.org/2023.emnlp-mai…) observed that the same is true for current LLMs such as ChatGPT: They segment text in non-English languages into many more tokens and are thus much more costly to use in such languages. They call this “double unfairness”: higher prices

Ahia et al. (2023; aclanthology.org/2023.emnlp-mai…) observed that the same is true for current LLMs such as ChatGPT: They segment text in non-English languages into many more tokens and are thus much more costly to use in such languages.

They call this “double unfairness”: higher prices
Abridge (@abridgehq) 's Twitter Profile Photo

We’re thrilled to be featured on the Forbes AI 50, alongside companies that inspire us such as OpenAI, Anthropic, Databricks, and others. It’s a special privilege to represent the impact of AI in healthcare, improving the care delivery experience at scale for both clinicians

We’re thrilled to be featured on the <a href="/Forbes/">Forbes</a> AI 50, alongside companies that inspire us such as OpenAI, Anthropic, Databricks, and others.

It’s a special privilege to represent the impact of AI in healthcare, improving the care delivery experience at scale for both clinicians
Alexandr Wang (@alexandr_wang) 's Twitter Profile Photo

How overfit are popular LLMs on public benchmarks? New research out of @scale_ai SEAL to answer this: - produced a new eval GSM1k - evaluated public LLMs for overfitting on GSM8k VERDICT: Mistral & Phi are overfitting benchmarks, while GPT, Claude, Gemini, and Llama are not.

How overfit are popular LLMs on public benchmarks?

New research out of @scale_ai SEAL to answer this:

- produced a new eval GSM1k
- evaluated public LLMs for overfitting on GSM8k

VERDICT: Mistral &amp; Phi are overfitting benchmarks, while GPT, Claude, Gemini, and Llama are not.
Abridge (@abridgehq) 's Twitter Profile Photo

🌐 We loved presenting at the Out-Of-Pocket Gen AI x Healthcare Ops Hackathon. Our very own Davis Liang demoed Abridge and spoke about the importance of multilinguality in health tech. 🗣️ Did you know? •Over 350 languages are spoken in the United States. •20% of Americans

🌐 We loved presenting at the Out-Of-Pocket Gen AI x Healthcare Ops Hackathon. Our very own <a href="/LiangDavis/">Davis Liang</a> demoed Abridge and spoke about the importance of multilinguality in health tech.

🗣️ Did you know? 

•Over 350 languages are spoken in the United States.
•20% of Americans
Lucas Bandarkar (@lucasbandarkar) 's Twitter Profile Photo

We presented Belebele at ACL 2024 this week! (Thx to Davis Liang and Satya Narayan Shukla) A year on from its release, it’s been really cool to see the diversity of research projects that have used it. The field is in dire need of more multilingual benchmarks !

Nikhil Krishnan (@nikillinit) 's Twitter Profile Photo

new post - we took a look at behind the technical curtain some of the interesting engineering challenges behind a company (Abridge ) training their own LLMs. -dealing with multiple languages -handling model drift -generalist models vs. healthcare specific ones and more

new post - we took a look at behind the technical curtain some of the interesting engineering challenges behind a company (<a href="/AbridgeHQ/">Abridge</a> ) training their own LLMs.

-dealing with multiple languages
-handling model drift
-generalist models vs. healthcare specific ones

and more
Zachary Lipton (@zacharylipton) 's Twitter Profile Photo

Evaluation of ambient scribes is a formidable task due to the free-form nature of generated text. Rigor requires automated metrics, strong benchmarks, clinician-in-loop trials, and in vivo testing. Learn how we're tackling these challenges at Abridge: abridge.com/ai/science-ai-…

Shiv Rao, MD (@shivdevrao) 's Twitter Profile Photo

1. Healthcare systems need enterprise-grade AI solutions they can trust. Read our latest whitepaper to learn more about what enterprise-grade AI for healthcare looks like: abridge.com/ai/science-ai-…

Pranav (@pranavmani30) 's Twitter Profile Photo

Does adapting general-domain models to medical-domain actually help w med-domain tasks? Stop by at Tuttle Hall, 230p EST, Nov 14 EMNLP 2025 to catch the amazing Daniel P Jeong present his 🚀oral 🚀talk. Super glad to be part of this work w Daniel P Jeong Saurabh Garg

Davis Liang (@liangdavis) 's Twitter Profile Photo

Interested in the intersection of healthcare and cutting edge applied science (LLMs, multilinguality, multimodal models, speech recognition, etc.)? We’re hiring machine learning scientists at Abridge 🚀 DM me or apply here: abridge.com/life-at-abridge

Abridge (@abridgehq) 's Twitter Profile Photo

❤️ At Abridge, feedback is our oxygen—and the most heartfelt feedback we get finds its way into a special internal channel we call "𝗟𝗼𝘃𝗲 𝗦𝘁𝗼𝗿𝗶𝗲𝘀." This holiday season, we asked Abridgers to share their favorite 𝗟𝗼𝘃𝗲 𝗦𝘁𝗼𝗿𝗶𝗲𝘀. 𝗪𝗮𝘁𝗰𝗵 𝘁𝗵𝗲