Yulin Chen (@yulinchen99) Twitter Tweets • TwiCopy

Yulin Chen

@yulinchen99

+ Follow

PhD Student at @nyuniversity @CILVRatNYU | Previously @TsinghuaNLP

ID: 1473167814131593221

linkhttps://yulinchen99.github.io/ calendar_today21-12-2021 05:46:16

16 Tweet

177 Followers

157 Following

Sil Hamilton

@srhm_ca

4 years ago

Manning also wrote a recent piece on what LLMs mean for meaning: amacad.org/publication/hu…

thumb_up_off_alt14

chat_bubble_outline1

repeat5

shareShare

A new preprint 📢 arxiv.org/pdf/2301.02828… K-nearest neighbors language models (kNN-LMs; Urvashi Khandelwal et al., ICLR'2020) improve the perplexity of standard LMs, even when they retrieve examples from the *same training set that the base LM was trained on*. but why? (1/3)

A new preprint 📢
arxiv.org/pdf/2301.02828…

K-nearest neighbors language models (kNN-LMs; <a href="/ukhndlwl/">Urvashi Khandelwal</a> et al., ICLR'2020) improve the perplexity of standard LMs, even when they retrieve examples from the *same training set that the base LM was trained on*.

but why?

(1/3)

thumb_up_off_alt351

chat_bubble_outline4

repeat64

shareShare

Yulin Chen

@yulinchen99

2 years ago

UltraChat will be presented tomorrow (Dec. 8th) at Poster session 3 (16:00-17:30) at East Foyer🥳🥳 #EMNLP2023

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ning Ding

@stingning

2 years ago

Good work! We also released a paper of UltraFuser and UltraChat 2 with a similar spirit. Fusing highly-specialized experts can be effective, see it in github.com/thunlp/UltraCh…. 🤗

thumb_up_off_alt49

chat_bubble_outline1

repeat19

shareShare

Graham Neubig

@gneubig

a year ago

It all makes sense now. arxiv.org/abs/2309.14316

thumb_up_off_alt397

chat_bubble_outline3

repeat44

shareShare

Owain Evans

@owainevans_uk

9 months ago

Surprising new results: We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis. This is *emergent misalignment* & we cannot fully explain it 🧵

thumb_up_off_alt6,6K

chat_bubble_outline432

repeat984

shareShare

Yulin Chen

@yulinchen99

7 months ago

We're excited to receive wide attention from the community—thank you for your support! We release code, trained probes, and the generated CoT data👇 github.com/AngelaZZZ-611/… We have labeled answer data on its way. Stay tuned!

thumb_up_off_alt44

chat_bubble_outline1

repeat10

shareShare

John(Yueh-Han) Chen

@jcyhc_ai

5 months ago

LLMs won’t tell you how to make fake IDs—but will reveal the layouts/materials of IDs and make realistic photos if asked separately. 💥Such decomposition attacks reach 87% success across QA, text-to-image, and agent settings! 🛡️Our monitoring method defends with 93% success! 🧵

thumb_up_off_alt24

chat_bubble_outline1

repeat9

shareShare

Yulin Chen

Sil Hamilton

Uri Alon

Yulin Chen

Ning Ding

Graham Neubig

Owain Evans

Yulin Chen

John(Yueh-Han) Chen