iislucas (Lucas Dixon) (@iislucas) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Ian Tenney (@[email protected])

@iftenney

2 years ago

🧵(1/6): Excited to announce the v1.0 release of the Google AI Learning Interpretability Tool (🔥LIT), an interactive platform to debug, validate, and understand ML model behavior. This release brings exciting new features and a simplified Python API. pair-code.github.io/lit

🧵(1/6): Excited to announce the v1.0 release of the <a href="/GoogleAI/">Google AI</a> Learning Interpretability Tool (🔥LIT), an interactive platform to debug, validate, and understand ML model behavior. This release brings exciting new features and a simplified Python API. pair-code.github.io/lit

thumb_up_off_alt161

chat_bubble_outline4

repeat35

shareShare

Peter Hase

@peterbhase

2 years ago

Happy to share that this paper was accepted with a Spotlight at #NeurIPS2023! We updated the arXiv with results showing the disconnect between knowledge localization and editing success across different neuron ablations, editing methods, editing metrics, models, and datasets.⬇️

thumb_up_off_alt80

chat_bubble_outline1

repeat19

shareShare

Jeff Dean

@jeffdean

2 years ago

I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,

thumb_up_off_alt12,12K

chat_bubble_outline260

repeat2,2K

shareShare

Dan Friedman

@danfriedman0

2 years ago

We often interpret neural nets by studying simplified representations (e.g. low-dim visualization). But how faithful are these simplifications to the original model? In our new preprint, we found some surprising "interpretability illusions"... 1/6

thumb_up_off_alt283

chat_bubble_outline3

repeat47

shareShare

Asma Ghandeharioun

@ghandeharioun

2 years ago

🧵Can we “ask” an LLM to “translate” its own hidden representations into natural language? We propose 🩺Patchscopes, a new framework for decoding specific information from a representation by “patching” it into a separate inference pass, independently of its original context. 1/9

thumb_up_off_alt765

chat_bubble_outline15

repeat147

shareShare

Geoffrey Cideron

@cdrgeo

a year ago

Happy to introduce our paper MusicRL, the first music generation system finetuned with human preferences. Paper link: arxiv.org/abs/2402.04229

thumb_up_off_alt80

chat_bubble_outline2

repeat29

shareShare

Ian Tenney (@[email protected])

@iftenney

a year ago

Super excited for the Gemma model release, and with it a new debugging tool we built on 🔥LIT - use gradient-based salience to debug and refine complex LLM prompts! ai.google.dev/responsible/mo…

thumb_up_off_alt13

chat_bubble_outline1

repeat4

shareShare

Adam Roberts

@ada_rob

a year ago

I love music most when it’s live, in the moment, and expressing something personal. This is why I’m psyched about the new “DJ mode” we developed for MusicFX: aitestkitchen.withgoogle.com/tools/music-fx… It’s an infinite AI jam that you control 🎛️. Try mixing your unique 🌀 of instruments, genres,

thumb_up_off_alt440

chat_bubble_outline48

repeat103

shareShare

Google AI

@googleai

a year ago

Being able to interpret an #ML model’s hidden representations is key to understanding its behavior. Today we introduce Patchscopes, an approach that trains #LLMs to provide natural language explanations of their own hidden representations. Learn more → goo.gle/4aS5epd

thumb_up_off_alt1,1K

chat_bubble_outline32

repeat349

shareShare

Armand Joulin

@armandjoulin

a year ago

Gemma 2 27B is now the best open model while being 2.5x smaller than alternatives! This validates the work done by the team and Gemini. This is just the beginning 💙♊️

thumb_up_off_alt213

chat_bubble_outline7

repeat33

shareShare

kyutai

@kyutai_labs

a year ago

Join us live tomorrow at 2:30pm CET for some exciting updates on our research! youtube.com/live/hm2IJSKcY…

thumb_up_off_alt246

chat_bubble_outline13

repeat38

shareShare

Google AI

@googleai

a year ago

Can large language models (LLMs) explain their internal mechanisms? Check out the latest AI Explorable on Patchscopes, an inspection framework that uses LLMs to explain the hidden representations of LLMs. Learn more → goo.gle/patchscopes

thumb_up_off_alt576

chat_bubble_outline18

repeat148

shareShare

Google DeepMind

@googledeepmind

a year ago

We’re welcoming a new 2 billion parameter model to the Gemma 2 family. 🛠️ It offers best-in-class performance for its size and can run efficiently on a wide range of hardware. Developers can get started with 2B today → dpmd.ai/4d0MKEH

thumb_up_off_alt1,1K

chat_bubble_outline34

repeat313

shareShare

Asma Ghandeharioun

@ghandeharioun

a year ago

🧵Responses to adversarial queries can still remain latent in a safety-tuned model. Why are they revealed sometimes, but not others? And what are the mechanics of this latent misalignment? Does it matter *who* the user is? (1/n)

thumb_up_off_alt61

chat_bubble_outline1

repeat10

shareShare

Adam Roberts

@ada_rob

9 months ago

I’m so proud of the updated version of #MusicFXDJ we developed in collaboration with Jacob Collier, available today at labs.google/musicfx. Over the past year I’ve spent countless hours experimenting with our real-time music models, and it feels like I’ve learned to play a

thumb_up_off_alt192

chat_bubble_outline7

repeat35

shareShare

Sohee Yang

@soheeyang_

8 months ago

🚨 New Paper 🚨 Can LLMs perform latent multi-hop reasoning without exploiting shortcuts? We find the answer is yes – they can recall and compose facts not seen together in training or guessing the answer, but success greatly depends on the type of the bridge entity (80%+ for

thumb_up_off_alt192

chat_bubble_outline7

repeat46

shareShare

Jeff Dean

@jeffdean

8 months ago

What a way to celebrate one year of incredible Gemini progress -- #1🥇across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on. Thanks to the hard work of everyone in the Gemini team and

thumb_up_off_alt1,1K

chat_bubble_outline90

repeat314

shareShare

Tyler Chang

@tylerachang

7 months ago

We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus! medium.com/people-ai-rese…

thumb_up_off_alt128

chat_bubble_outline1

repeat20

shareShare

Arthur Conmy

@arthurconmy

5 months ago

We are hiring Applied Interpretability researchers on the GDM Mech Interp Team!🧵 If interpretability is ever going to be useful, we need it to be applied at the frontier. Come work with Neel Nanda, the Google DeepMind AGI Safety team, and me: apply by 28th February as a

thumb_up_off_alt283

chat_bubble_outline2

repeat35

shareShare

Alexander Chen

@alexanderchen

a month ago

Veo holograms 🦝⚡️ Visualizing animal superpowers! Just discovered Veo 3's amazing ability to render 3d holograms. Virtual interfaces within the simulated world. 🔊 Prompts in 🧵

thumb_up_off_alt19

chat_bubble_outline2

repeat4

shareShare