Siyuan Wang (@siyuan___wang) 's Twitter Profile
Siyuan Wang

@siyuan___wang

Research @nlp_usc. Previously: MS & BS @FudanUni.

ID: 1204686751962292224

linkhttps://siyuanwangw.github.io/ calendar_today11-12-2019 08:58:04

2 Tweet

16 Followers

79 Following

Huihan Li 🛩️ ICLR 2025 (@huihan_li) 's Twitter Profile Photo

Feeling hard generating challenging evaluation data for LLMs? Check our work👇! Introducing LINK🔗, the first framework for systematically generating data in the long-tail distribution, guided by symbolic rules arxiv.org/abs/2311.07237 w/USC NLP MOSAIC 🧵⬇️ #NLProc [1/n]

Feeling hard generating challenging evaluation data for LLMs? Check our work👇!

Introducing LINK🔗, the first framework for systematically generating data in the long-tail distribution, guided by symbolic rules

arxiv.org/abs/2311.07237
w/<a href="/nlp_usc/">USC NLP</a> <a href="/ai2_mosaic/">MOSAIC</a> 🧵⬇️
#NLProc 

[1/n]
Zhiyuan Zeng (@zhiyuanzeng_) 's Twitter Profile Photo

Is a single accuracy number all we can get from model evals?🤔 🚨Does NOT tell where the model fails 🚨Does NOT tell how to improve it Introducing EvalTree🌳 🔍identifying LM weaknesses in natural language 🚀weaknesses serve as actionable guidance (paper&demo 🔗in🧵) [1/n]