Xiang Lisa Li (@xianglisali2) Twitter Tweets • TwiCopy

Weijia Shi

3 years ago

🙋‍♀️How to present the same text in diff. tasks/domains as diff. embeddings W/O training? We introduce Instructor👨‍🏫, an instruction-finetuned embedder that can generate text embeddings tailored to any task given the task instruction➡️sota on 7⃣0⃣tasks👇! instructor-embedding.github.io

thumb_up_off_alt589

chat_bubble_outline11

repeat113

shareShare

Mina Lee

@minalee__

3 years ago

Language models (LMs) are already deployed in many real-world applications and used to interact with users 👩‍🦰, but these models are primarily evaluated non-interactively. How can we evaluate LMs interactively and why is it important? (1/8)

thumb_up_off_alt329

chat_bubble_outline5

repeat57

shareShare

Omar Khattab

@lateinteraction

3 years ago

Introducing Demonstrate–Search–Predict (𝗗𝗦𝗣), a framework for composing search and LMs w/ up to 120% gains over GPT-3.5. No more prompt engineering.❌ Describe a high-level strategy as imperative code and let 𝗗𝗦𝗣 deal with prompts and queries.🧵 arxiv.org/abs/2212.14024

thumb_up_off_alt960

chat_bubble_outline31

repeat189

shareShare

Stanford NLP Group

@stanfordnlp

3 years ago

The #cs224n poster session is happening now! We are super excited about amazing, cutting-edge NLP posters from ~650 students!

thumb_up_off_alt56

chat_bubble_outline2

repeat10

shareShare

Stanford NLP Group

@stanfordnlp

3 years ago

And here at the #cs224n #NLProc with Deep Learning poster session at Tressider (Stanford University) is almost all of the (large!) teaching team.

And here at the #cs224n #NLProc with Deep Learning poster session at Tressider (<a href="/Stanford/">Stanford University</a>) is almost all of the (large!) teaching team.

thumb_up_off_alt44

chat_bubble_outline0

repeat7

shareShare

Jesse Mu

@jayelmnop

3 years ago

Prompting is cool and all, but isn't it a waste of compute to encode a prompt over and over again? We learn to compress prompts up to 26x by using "gist tokens", saving memory+storage and speeding up LM inference: arxiv.org/abs/2304.08467 (w/ Xiang Lisa Li and noahdgoodman) 🧵

thumb_up_off_alt589

chat_bubble_outline14

repeat117

shareShare

Kelvin Guu

@kelvin_guu

2 years ago

New from Google DeepMind: When can you trust your LLM? We show that LLMs consistently overestimate their own accuracy on some topics (eg nutrition) while underestimating it on others (eg math). Our Few-shot Recalibrator fixes LLM over/under-confidence: arxiv.org/abs/2403.18286 🧵

thumb_up_off_alt78

chat_bubble_outline2

repeat16

shareShare

Chunting Zhou

@violet_zct

a year ago

Introducing *Transfusion* - a unified approach for training models that can generate both text and images. arxiv.org/pdf/2408.11039 Transfusion combines language modeling (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. This

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat209

shareShare

Transluce

@transluceai

a year ago

Eliciting Language Model Behaviors with Investigator Agents We train AI agents to help us understand the space of language model behaviors, discovering new jailbreaks and automatically surfacing a diverse set of hallucinations. Full report: transluce.org/automated-elic…

thumb_up_off_alt86

chat_bubble_outline6

repeat15

shareShare

Xiang Lisa Li

@xianglisali2

a year ago

Can we get language models to exhibit certain behaviors? We train investigator models to elicit target behaviors from LMs, which helps us proactively detect harmful responses and hallucination!

thumb_up_off_alt85

chat_bubble_outline2

repeat8

shareShare

Percy Liang

@percyliang

a year ago

This year, I have 4 exceptional students on the academic job market, and they couldn’t be more diffferent, with research spanning AI policy, robotics, NLP, and HCI. Here’s a brief summary of their research, along with one representative work each:

thumb_up_off_alt695

chat_bubble_outline7

repeat45

shareShare

Percy Liang

@percyliang

a year ago

Lisa Li (Xiang Lisa Li) changes how people fine-tune (prefix tuning, the original PEFT), generate (diffusion LM, non-autoregressively), improve (GV consistency fine-tuning without supervision), and evaluate language models (using LMs). Prefix tuning: arxiv.org/abs/2101.00190

thumb_up_off_alt85

chat_bubble_outline1

repeat8

shareShare

John Hewitt

@johnhewtt

a year ago

I’m hiring PhD students in computer science at Columbia! Our lab will tackle core challenges in understanding and controlling neural models that interact with language. for example, - methods for LLM control - discoveries of LLM properties - pretraining for understanding

thumb_up_off_alt881

chat_bubble_outline18

repeat155

shareShare

Percy Liang

@percyliang

8 months ago

When Xiang Lisa Li built diffusion LMs in 2022 (arxiv.org/abs/2205.14217), we were interested in more powerful controllable generation (inference-time conditioning on an arbitrary reward), but inference was slow. Interestingly, the main advantage now is speed. Impressive to see

thumb_up_off_alt120

chat_bubble_outline1

repeat19

shareShare

Percy Liang

@percyliang

6 months ago

What would truly open-source AI look like? Not just open weights, open code/data, but *open development*, where the entire research and development process is public *and* anyone can contribute. We built Marin, an open lab, to fulfill this vision:

thumb_up_off_alt939

chat_bubble_outline39

repeat185

shareShare