Robert Stojnic (@rbstojnic) Twitter Tweets • TwiCopy

elvis

2 years ago

Mitigating Hallucination in LLMs This paper summarizes 32 techniques to mitigate hallucination in LLMs. Introduces a taxonomy categorizing methods like RAG, Knowledge Retrieval, CoVe, and more. Provides tips on how to apply these methods and highlights the challenges and

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat276

shareShare

Matt Clifford

@matthewclifford

2 years ago

Excellent thoughts from Nabeel S. Qureshi on making the most of Twitter (which is still one of the very best things about the internet) nabeelqu.co/twitter

thumb_up_off_alt33

chat_bubble_outline2

repeat4

shareShare

Yann LeCun

@ylecun

2 years ago

Like Andrew Ng, I have observed a definite shift in the prevalent discourse about AI at Davos: - Few people still talk about existential risk, and few people believe that current technology, even scaled up, will present an existential risk. - Everyone agrees that open source AI

thumb_up_off_alt1,1K

chat_bubble_outline82

repeat266

shareShare

AI at Meta

@aiatmeta

2 years ago

Today we’re releasing Code Llama 70B: a new, more performant version of our LLM for code generation — available under the same license as previous Code Llama models. Download the models ➡️ bit.ly/3Oil6bQ • CodeLlama-70B • CodeLlama-70B-Python • CodeLlama-70B-Instruct

thumb_up_off_alt5,5K

chat_bubble_outline164

repeat1,1K

shareShare

Grégoire Mialon

@mialon_gregoire

2 years ago

I am recruiting a PhD intern to work with me AI at Meta (GenAI) around LLMs, tool use, and / or the GAIA benchmark. Ideal profile: near the end of your PhD, willing to be based in Paris, starting in May/June 2024. DM if you’re interested!

thumb_up_off_alt152

chat_bubble_outline8

repeat35

shareShare

AI at Meta

@aiatmeta

2 years ago

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3

thumb_up_off_alt5,5K

chat_bubble_outline344

repeat1,1K

shareShare

Robert McHardy 🏖️

@robert_mchardy

a year ago

🚨 Are We Done with MMLU? In our new paper "Are We Done with MMLU?" we identify errors in MMLU and find that some subsets are riddled with errors. We propose MMLU-Redux with 3,000 re-annotated questions across 30 subjects. 📜 arxiv.org/abs/2406.04127 (1/n)

thumb_up_off_alt181

chat_bubble_outline9

repeat36

shareShare

Joelle Pineau

@jpineau1

a year ago

I’m excited to share a few things we’re releasing today at Meta FAIR. These new AI model and dataset releases are part of our longstanding commitment to open science and I look forward to sharing even more work like this from the brilliant minds at FAIR! ai.meta.com/blog/meta-fair…

thumb_up_off_alt363

chat_bubble_outline11

repeat72

shareShare

Thomas Scialom

@thomasscialom

a year ago

The team worked really hard to make history, voila finally the Llama-3.1 herd of models...have fun with it! * open 405B, insane 70B * 128K context length, improved reasoning & coding capabilities * detailed paper ai.meta.com/research/publi…

thumb_up_off_alt107

chat_bubble_outline3

repeat16

shareShare

hardmaru

@hardmaru

a year ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 👩‍🔬 arxiv.org/abs/2408.06292 It’s common for AI researchers to joke amongst themselves that “now all we need to do is figure out how to make AI write the papers for us!” but I think we’re now getting there!

thumb_up_off_alt774

chat_bubble_outline35

repeat150

shareShare

Jesse Dodge

@jessedodge

a year ago

Congrats to our team for winning two paper awards at #ACL2024! OLMo won the Best Theme Paper award, and Dolma won a Best Resource Paper award! All the credit goes to the whole team for the massive group effort 🎉🎉

thumb_up_off_alt246

chat_bubble_outline11

repeat42

shareShare

Kevin Stone

@kevinleestone

a year ago

Proud to release o1-preview to the world. Now that we have started to crack the challenge of getting models to “think” we are able to get large improvements on complex tasks by just letting them think harder.

thumb_up_off_alt122

chat_bubble_outline8

repeat13

shareShare

Gabriel Synnaeve

@syhw

a year ago

Reinforcement learning with execution feedback (RLEF). Lots of sweat went into this one, but what works in principle works in practice: for code generation we can turn compute into training data: arxiv.org/abs/2410.02089 This works for LLMs, but will lead to world models.

thumb_up_off_alt567

chat_bubble_outline13

repeat87

shareShare

Ross Taylor

@rosstaylor90

a year ago

(🎥45m) The Hitchhiker's Guide to Reasoning A talk about LLM reasoning, covering various methods, core problems, and future research directions! Covering topics such as: 🔍 The missing intermediates problem: how pre-training models are hobbled by missing reasoning steps, and

thumb_up_off_alt149

chat_bubble_outline2

repeat29

shareShare

Roberta Raileanu

@robertarail

9 months ago

Super excited to share 🧠MLGym 🦾 – the first Gym environment for AI Research Agents 🤖🔬 We introduce MLGym and MLGym-Bench, a new framework and benchmark for evaluating and developing LLM agents on AI research tasks. The key contributions of our work are: 🕹️ Enables the

thumb_up_off_alt481

chat_bubble_outline14

repeat117

shareShare

Ross Taylor

@rosstaylor90

9 months ago

🎉 Excited to release General Reasoning: a new community resource for building open reasoning models. We’re looking to make personal, open reasoners a reality. Starting with a small step in that direction today! Read the thread in the quote tweet for details, or my personal

thumb_up_off_alt278

chat_bubble_outline9

repeat39

shareShare

AI at Meta

@aiatmeta

8 months ago

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

thumb_up_off_alt13,13K

chat_bubble_outline706

repeat2,2K

shareShare

Ana Lučić

@__alucic

6 months ago

Very excited that Aurora has been published in Nature: nature.com/articles/s4158…. Paper, code and model weights are all open and ready to be built on🚀

thumb_up_off_alt16

chat_bubble_outline1

repeat1

shareShare

Roberta Raileanu

@robertarail

4 months ago

I’m building a new team at Google DeepMind to work on Open-Ended Discovery! We’re looking for strong Research Scientists and Research Engineers to help us push the frontier of autonomously discovering novel artifacts such as new knowledge, capabilities, or algorithms, in an

thumb_up_off_alt2,2K

chat_bubble_outline74

repeat222

shareShare

Julien Chaumond

@julien_c

4 months ago

BREAKING: we've partnered with AI at Meta and Papers with Code to build a successor to Papers with Code (which was sunsetted yesterday) PWC, founded by Robert Stojnic and Ross Taylor has been an invaluable resource for AI scientists and engineers over the years (and an inspiration

BREAKING:

we've partnered with <a href="/metaai/">AI at Meta</a> and <a href="/paperswithcode/">Papers with Code</a> to build a successor to Papers with Code (which was sunsetted yesterday)

PWC, founded by <a href="/rbstojnic/">Robert Stojnic</a> and <a href="/rosstaylor90/">Ross Taylor</a> has been an invaluable resource for AI scientists and engineers over the years (and an inspiration

thumb_up_off_alt795

chat_bubble_outline33

repeat129

shareShare