Ashish Sabharwal (@ashish_s_ai) Twitter Tweets • TwiCopy

Wenhao Yu

2 years ago

📢 Introducing IfQA - the first large-scale open-domain question answering (ODQA) dataset centered around counterfactual reasoning. Together with Meng Jiang Aristo Team at AI2! Paper link: arxiv.org/abs/2305.14010

thumb_up_off_alt63

chat_bubble_outline4

repeat15

shareShare

Wenhao Yu

@wyu_nd

2 years ago

📢 Introducing ReFeed: a novel plug-and-play approach to enhance the factuality of large language models via retrieval feedback! Together with Meng Jiang Zhihan Zhang Zhenwen Liang Aristo Team at AI2 Read more: arxiv.org/abs/2305.14002

📢 Introducing ReFeed: a novel plug-and-play approach to enhance the factuality of large language models via retrieval feedback! Together with <a href="/Meng_CS/">Meng Jiang</a> <a href="/zhihz0535/">Zhihan Zhang</a> <a href="/LiangZhenwen/">Zhenwen Liang</a> <a href="/ai2_aristo/">Aristo Team at AI2</a>

Read more: arxiv.org/abs/2305.14002

thumb_up_off_alt73

chat_bubble_outline1

repeat15

shareShare

Ashish Sabharwal

@ashish_s_ai

2 years ago

Introducing 𝗥𝗘𝗙𝗟𝗘𝗫: What does my LLM believe?🧐We show that we can add a 𝗿𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗹𝗮𝘆𝗲𝗿 to an LLM to materialize its "belief graph", repair inconsistencies & produce reasoning chains drawn from a now-consistent system of beliefs! arxiv.org/abs/2305.14250 #NLProc

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years ago

New paper: "Attentiveness to Answer Choices Doesn’t Always Entail High QA Accuracy" 📊💬 sarahwie.github.io/attentiveness.… Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways... 1/5

thumb_up_off_alt72

chat_bubble_outline1

repeat18

shareShare

Ashish Sabharwal

@ashish_s_ai

2 years ago

ICYMI Ben Brubaker wrote an eloquent Quanta article✍️covering our ICLR-2024 paper (w/ William Merrill 🚂ACL) on how the expressive power of transformers changes with the length of CoT! Recently updated paper📜at arxiv.org/abs/2310.07923

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Ashish Sabharwal

@ashish_s_ai

2 years ago

Turns out SSMs like S4 and S6 don't quite get the best of both worlds -- sequential and parallel -- and struggle to track state just like Transformers. Excited to share the "Illusion of State" paper w/ William Merrill 🚂ACL, Jackson Petty ! arxiv.org/abs/2404.08819

thumb_up_off_alt22

chat_bubble_outline0

repeat4

shareShare

Ashish Sabharwal

@ashish_s_ai

2 years ago

Happy to share that this work will appear at NAACL-2024! Check out our recently updated version on arXiv at arxiv.org/abs/2311.09519

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ashish Sabharwal

@ashish_s_ai

a year ago

Excited to share AppWorld, our challenging new interactive coding environment and benchmark to push AI agents further! Super easy to use (`pip install...`), reliable, reproducible, realistic. Congratulations to Harsh Trivedi for the huge effort!!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ai2

@allen_ai

a year ago

🥳The BIGGEST congratulations for our teams' recognition at #ACL2024! OLMo received the Best Theme Paper, Dolma + AppWorld received the Best Resource Paper, and "Political Compass or Spinning Arrow?" was honored with an Outstanding Paper Award.