Danny To Eun Kim (@teknology.bsky.social) (@teknologyy) Twitter Tweets • TwiCopy

Danny To Eun Kim (@teknology.bsky.social)

@teknologyy

+ Follow

PhD student @LTIatCMU working with @841io on NLP & IR | Prev: MEng @ai_ucl

ID: 1422776859037618176

linkhttps://kimdanny.github.io/ calendar_today04-08-2021 04:30:30

213 Tweet

488 Followers

1,1K Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

TODAY it is time for the Algorithmic Fairness Workshop #AFME2024 at #NeurIPS2024! 📍West Meeting 111-112! Excited for our 5 contributed spotlight talks today! Benjamin Laufer Danny To Eun Kim (@teknology.bsky.social) Alex Tamkin Prakhar Ganesh Natalie Mackraz

thumb_up_off_alt9

chat_bubble_outline0

repeat4

shareShare

AFME 2024 @ NeurIPS

@afciworkshop

7 months ago

In our second contributed talk, Danny To Eun Kim (@teknology.bsky.social) presents “Towards Fair RAG: The Impact of Fair Ranking in Retrieval-Augmented Generation.” #AFME2024 #NeurIPS2024

In our second contributed talk, <a href="/TEKnologyy/">Danny To Eun Kim (@teknology.bsky.social)</a> presents “Towards Fair RAG: The Impact of Fair Ranking in Retrieval-Augmented Generation.” #AFME2024 #NeurIPS2024

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

So Yeon (Tiffany) Min on Industry Job Market

@soyeontiffmin

5 months ago

🚨🚨 Preprint Alert 🚨🚨 🚀🚀 As AI become agents 🤖, how can we reliably delegate tasks to them, if they cannot communicate their limitations😭 or ask for help or test-time compute 🧑‍🚒 when needed? We present our new pre-print **Self-Regulation and Requesting Interventions**

thumb_up_off_alt108

chat_bubble_outline1

repeat39

shareShare

Yiqing Xie

@yiqingxienlp

4 months ago

How to construct repo-level coding environments in a scalable way? Checkout RepoST: an automated framework to construct repo-level environments using Sandbox Testing (repost-code-gen.github.io) Models trained with RepoST data can generalize well to other datasets (e.g., RepoEval)

thumb_up_off_alt83

chat_bubble_outline3

repeat20

shareShare

Seungone Kim @ NAACL2025

@seungonekim

4 months ago

#NLProc New paper on "evaluation-time scaling", a new dimension to leverage test-time compute! We replicate the test-time scaling behaviors observed in generators (e.g., o1, r1, s1) with evaluators by enforcing to generate additional reasoning tokens. arxiv.org/abs/2503.19877

thumb_up_off_alt171

chat_bubble_outline2

repeat37

shareShare

Fernando Diaz

@841io

3 months ago

If you're interested in OpenAI including shopping results, you might also be interested in Danny To Eun Kim (@teknology.bsky.social)'s paper relating retrieval diversity/fairness and generation by downstream RAG models. This has implications for individuals selling products online. arxiv.org/abs/2409.11598

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Fernando Diaz

@841io

3 months ago

OpenAI x.com/841io/status/1…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Athiya Deviyani

@athiyad

3 months ago

Ever trusted a metric that works great on average, only for it to fail in your specific use case? In our #NAACL2025 paper (w/ Fernando Diaz), we show why global evaluations are not enough and why context matters more than you think. 📄 aclanthology.org/2025.findings-… #NLP #Evaluation (🧵1/9)

Ever trusted a metric that works great on average, only for it to fail in your specific use case?

In our #NAACL2025 paper (w/ <a href="/841io/">Fernando Diaz</a>), we show why global evaluations are not enough and why context matters more than you think.
📄 aclanthology.org/2025.findings-…
#NLP #Evaluation
(🧵1/9)

thumb_up_off_alt61

chat_bubble_outline1

repeat11

shareShare

Bhaskar Mitra | ভাস্কর মিত্র

@underdoggeek

3 months ago

Please share the word and register to participate! #TREC2025ToT

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

Shaily

@shaily99

2 months ago

🖋️ Curious how writing differs across (research) cultures? 🚩 Tired of “cultural” evals that don't consult people? We engaged with researchers to identify & measure ✨cultural norms✨in scientific writing, and show that❗LLMs flatten them❗ 📜 arxiv.org/abs/2506.00784 1/11

thumb_up_off_alt81

chat_bubble_outline2

repeat16

shareShare