Pascal Biese (@xaiguydotagi) 's Twitter Profile
Pascal Biese

@xaiguydotagi

Weekly LLM Research Nuggets in my newsletter📲 AI/ML Engineer 🧑‍💻 The time to build is now 🦾 xaiguy.agi on 🧵

ID: 878228318977503233

linkhttps://llmwatch.com calendar_today23-06-2017 12:28:42

341 Tweet

163 Followers

63 Following

Jeremy Howard (@jeremyphoward) 's Twitter Profile Photo

“Nobody called LLMs” totally misunderstands how AI development happens. LLMs didn’t spontaneously appear for no reason. I knew they would work, I demonstrated them working in 2017, and I said at that time I think they’ll be a big deal.

Pascal Biese (@xaiguydotagi) 's Twitter Profile Photo

Supercharge Your Home Rig with Data Center-Level LLM Inference Speeds 🔥 💻 Ever dreamt of running data center-grade Large Language Model (LLM) inference on your personal gaming GPU? PowerInfer now makes this a reality. arxiv.org/abs/2312.12456

Pascal Biese (@xaiguydotagi) 's Twitter Profile Photo

LLMs and High-Stakes Decisions: Finally Becoming More Reliable? 🤔 A new approach rooted in the principles of social choice theory, leverages a novel application of the Partial Borda Count—a method to merge ranked choices—which could dramatically improve the reliability of

LLMs and High-Stakes Decisions: Finally Becoming More Reliable? 🤔

A new approach rooted in the principles of social choice theory, leverages a novel application of the Partial  Borda Count—a method to merge ranked choices—which could dramatically  improve the reliability of
Pascal Biese (@xaiguydotagi) 's Twitter Profile Photo

Turbulences Ahead: Are LLMs for Coding Robust or RUSTy? ✈ 🌪 Turbulence is a new benchmark tailored to test the integrity of instruction-tuned Large Language Models (LLMs) in code generation. Rather than just gauging if an AI can code, it zeroes in on the nuances: Can it

Turbulences Ahead: Are LLMs for Coding Robust or RUSTy? ✈ 🌪

Turbulence is a new benchmark tailored to test the integrity of instruction-tuned Large Language Models (LLMs) in code generation. 

Rather than just gauging if an AI can code, it zeroes in on the nuances: Can it
Pascal Biese (@xaiguydotagi) 's Twitter Profile Photo

Thou Shall Not Lie: LLMs' might be getting a reality check ✔ Hallucinations in AI-generated content often spark intense discussions about the trustworthiness of large language models (LLMs). Now, a new approach called Truth Forest (TrFr) is aiming to significantly enhance

Thou Shall Not Lie: LLMs' might be getting a reality check ✔ 

Hallucinations  in AI-generated content often spark intense discussions about the  trustworthiness of large language models (LLMs). 

Now, a new  approach called Truth Forest (TrFr) is aiming to significantly enhance