Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile
Ashish Sabharwal

@ashish_s_ai

ID: 1618315782299226112

calendar_today25-01-2023 18:32:20

11 Tweet

155 Followers

12 Following

Wenhao Yu (@wyu_nd) 's Twitter Profile Photo

๐Ÿ“ข Introducing IfQA - the first large-scale open-domain question answering (ODQA) dataset centered around counterfactual reasoning. Together with Meng Jiang Aristo Team at AI2! Paper link: arxiv.org/abs/2305.14010

๐Ÿ“ข Introducing IfQA - the first large-scale open-domain question answering (ODQA) dataset centered around counterfactual reasoning. Together with <a href="/Meng_CS/">Meng Jiang</a> <a href="/ai2_aristo/">Aristo Team at AI2</a>!

Paper link: arxiv.org/abs/2305.14010
Wenhao Yu (@wyu_nd) 's Twitter Profile Photo

๐Ÿ“ข Introducing ReFeed: a novel plug-and-play approach to enhance the factuality of large language models via retrieval feedback! Together with Meng Jiang Zhihan Zhang Zhenwen Liang Aristo Team at AI2 Read more: arxiv.org/abs/2305.14002

๐Ÿ“ข Introducing ReFeed: a novel plug-and-play approach to enhance the factuality of large language models via retrieval feedback! Together with <a href="/Meng_CS/">Meng Jiang</a> <a href="/zhihz0535/">Zhihan Zhang</a> <a href="/LiangZhenwen/">Zhenwen Liang</a> <a href="/ai2_aristo/">Aristo Team at AI2</a> 

Read more: arxiv.org/abs/2305.14002
Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile Photo

Introducing ๐—ฅ๐—˜๐—™๐—Ÿ๐—˜๐—ซ: What does my LLM believe?๐ŸงWe show that we can add a ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐—น๐—ฎ๐˜†๐—ฒ๐—ฟ to an LLM to materialize its "belief graph", repair inconsistencies & produce reasoning chains drawn from a now-consistent system of beliefs!ย arxiv.org/abs/2305.14250 #NLProc

Introducing ๐—ฅ๐—˜๐—™๐—Ÿ๐—˜๐—ซ: What does my LLM believe?๐ŸงWe show that we can add a ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐—น๐—ฎ๐˜†๐—ฒ๐—ฟ to an LLM to materialize its "belief graph", repair inconsistencies &amp; produce reasoning chains drawn from a now-consistent system of beliefs!ย arxiv.org/abs/2305.14250 #NLProc
Sarah Wiegreffe (on faculty job market!) (@sarahwiegreffe) 's Twitter Profile Photo

New paper: "Attentiveness to Answer Choices Doesnโ€™t Always Entail High QA Accuracy" ๐Ÿ“Š๐Ÿ’ฌ sarahwie.github.io/attentiveness.โ€ฆ Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways... 1/5

New paper: "Attentiveness to Answer Choices Doesnโ€™t Always Entail High QA Accuracy" ๐Ÿ“Š๐Ÿ’ฌ
sarahwie.github.io/attentiveness.โ€ฆ

Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways... 1/5
Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile Photo

ICYMI Ben Brubaker wrote an eloquent Quanta articleโœ๏ธcovering our ICLR-2024 paper (w/ William Merrill ๐Ÿš‚ACL) on how the expressive power of transformers changes with the length of CoT! Recently updated paper๐Ÿ“œat arxiv.org/abs/2310.07923

Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile Photo

Turns out SSMs like S4 and S6 don't quite get the best of both worlds -- sequential and parallel -- and struggle to track state just like Transformers. Excited to share the "Illusion of State" paper w/ William Merrill ๐Ÿš‚ACL, Jackson Petty ! arxiv.org/abs/2404.08819

Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile Photo

Happy to share that this work will appear at NAACL-2024! Check out our recently updated version on arXiv at arxiv.org/abs/2311.09519

Ashish Sabharwal (@ashish_s_ai) 's Twitter Profile Photo

Excited to share AppWorld, our challenging new interactive coding environment and benchmark to push AI agents further! Super easy to use (`pip install...`), reliable, reproducible, realistic. Congratulations to Harsh Trivedi for the huge effort!!

Ai2 (@allen_ai) 's Twitter Profile Photo

๐ŸฅณThe BIGGEST congratulations for our teams' recognition at #ACL2024! OLMo received the Best Theme Paper, Dolma + AppWorld received the Best Resource Paper, and "Political Compass or Spinning Arrow?" was honored with an Outstanding Paper Award.

๐ŸฅณThe BIGGEST congratulations for our teams' recognition at #ACL2024! OLMo received the Best Theme Paper, Dolma + AppWorld received the Best Resource Paper, and "Political Compass or Spinning Arrow?" was honored with an Outstanding Paper Award.