
Ting-En Lin
@tnlin_tw
Research scientist at Alibaba Tongyi Lab, focusing on self-evolving LLMs aimed at AGI. @Tsinghua_Uni alum.
ID: 1331994524
https://tnlin.github.io/ 06-04-2013 16:35:26
186 Tweet
216 Followers
450 Following


Excited to share what I did Sierra with Noah Shinn pedram and Karthik Narasimhan ! 𝜏-bench evaluates critical agent capabilities omitted by current benchmarks: robustness, complex rule following, and human interaction skills. Try it out!






❗ New Paper❗ 📄 In '23, we proposed LLM-as-judge for NLP research 🤔 Any real-world applications? 💯 Now, we use LLM as an automatic assignment evaluator in a course with 1000+ students at National Taiwan University, led by Hung-yi Lee (李宏毅) with me as a TA 🔗 arxiv.org/abs/2407.05216









Recent updates on verl project (RL lib for LLMs): Engine: - Megatron qwen & GRPO support, v0.11 upgrade - vllm v0.7 integration with v1 mode - experimental sglang integration Algorithm & recipes: - vision language reasoning with qwen2.5-vl - PRIME, RLOO, remax, math-verify