
Venelin Kovatchev
@sintelion
Natural Language Processing and Computational Linguistics
Assist. Prof. @unibirmingham @uobcompsci
ID: 900332336407097344
http://vkovatchev.com/ 23-08-2017 12:22:11
282 Tweet
267 Followers
377 Following


Anubrata Das 🦋 @ NAACL 2025 giving encore talk on #acl2022nlp paper ProtoTex: Explaining Model Decisions w/ Prototype Tensors (utexas.box.com/v/das-acl-2022) @ iSchools European iSchools Doctoral Seminar on 16 Sept 15:00 CEST on zoom lnu-se.zoom.us/j/61188566292?…. Good Systems School of Information - UT Austin @engagingnews.bsky.social Venelin Kovatchev Jessy Li

New dataset coming COLING 2022 InferES - an NLI corpus for European Spanish feat. contrastive and adversarial examples based on negation and coreference Data and code (in progress): github.com/venelink/infer… Arxiv: arxiv.org/abs/2210.03068

Join our 1st annual Disinformation Day #disinfoday online 10/26/22 by our Good Systems 6-year project "Designing Responsible AI Technologies to Curb Disinformation": sites.google.com/view/ut-misinf…. Watch live and join Q&A or view recordings later. @engagingnews.bsky.social School of Information - UT Austin The Austin Forum

Anubrata Das 🦋 @ NAACL 2025 presenting "The Need for Human-centered Design in Fact-checking Research" (utexas.box.com/v/das-ipmc2022, with Venelin Kovatchev & Houjiang Liu) Oct 20th at 9am GMT @ the Info Processing & Management Conf. #IPMC2022 elsevier.com/events/confere… Good Systems School of Information - UT Austin @engagingnews.bsky.social


🚨Help NLP models process negation🚨 Introducing ℂ𝕆ℕ𝔻𝔸ℚ𝔸, a *contrastive* reading comprehension dataset that requires reasoning about negation w/ Matt Gardner & Ana Marasović AllenNLP, at #EMNLP2022 📝Paper arxiv.org/abs/2211.00295 🚀Data github.com/AbhilashaRavic… [1/8]
![Abhilasha Ravichander (@lasha_nlp) on Twitter photo 🚨Help NLP models process negation🚨
Introducing ℂ𝕆ℕ𝔻𝔸ℚ𝔸, a *contrastive* reading comprehension dataset that requires reasoning about negation
w/ <a href="/nlpmattg/">Matt Gardner</a> & <a href="/anmarasovic/">Ana Marasović</a> <a href="/ai2_allennlp/">AllenNLP</a>, at #EMNLP2022
📝Paper arxiv.org/abs/2211.00295
🚀Data github.com/AbhilashaRavic… [1/8] 🚨Help NLP models process negation🚨
Introducing ℂ𝕆ℕ𝔻𝔸ℚ𝔸, a *contrastive* reading comprehension dataset that requires reasoning about negation
w/ <a href="/nlpmattg/">Matt Gardner</a> & <a href="/anmarasovic/">Ana Marasović</a> <a href="/ai2_allennlp/">AllenNLP</a>, at #EMNLP2022
📝Paper arxiv.org/abs/2211.00295
🚀Data github.com/AbhilashaRavic… [1/8]](https://pbs.twimg.com/media/FhDJ74lVQAABUjB.jpg)


New lit review: Anubrata Das 🦋 @ NAACL 2025, Venelin Kovatchev, & Houjiang Liu call for human-centered NLP for fact-checking: "The state of human-centered NLP technology for fact-checking." Information Processing & Management, 60(2), 2023 utexas.box.com/shared/static/… School of Information - UT Austin @engagingnews.bsky.social Good Systems

Embeddings t-SNE vizualizations of paraphrase datasets show: There are big differences in semantic balance. Left: ETPC (human) by Venelin Kovatchev Right: MPC (machine) by Jan Philip Wahle Research still lacks evenly distributed paraphrase datasets by machines! arxiv.org/pdf/2303.13989…








How much does data impact the evaluation of NLP models? How can we measure data distribution in an efficient and multi-dimensional way? How to predict OOD generalizability? Check our paper with Matt Lease - "Benchmark Transparency", accepted at NAACL (arxiv.org/abs/2404.00748)



