Shikhar (@shikharmurty) Twitter Tweets • TwiCopy

👨‍💻Tokenization errors in LLMs have the same vibe as off-by-one errors in software engineering. We develop and make progress on LLMs that can consume *bytes* directly (no tokenization needed!)

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories We are releasing the first benchmark to evaluate how well automatic evaluators, such as LLM judges, can evaluate web agent trajectories. We find that rule-based evals underreport success rates, and

thumb_up_off_alt230

chat_bubble_outline4

repeat100

shareShare

Shikhar

@shikharmurty

3 months ago

Note #1 about TreeReg: At ICLR 2023, we showed that context-freeness of span vectors predicts compositional generalization in transformers (arxiv.org/abs/2211.01288). Pratyusha Sharma and I had over 10 poster attendees asking us about a regularizer based on this idea. It took some

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Shikhar

@shikharmurty

3 months ago

Note #2 about TreeReg (faster grokking): At ACL 2023, we introduced structural grokking — where extended training lets Transformers discover hierarchical structure and generalize OOD, even when shortcuts work in-domain: arxiv.org/abs/2305.18741 With TreeReg, this transition is

thumb_up_off_alt7

chat_bubble_outline2

repeat1

shareShare

Josh Cason

@thegrizztronic

3 months ago

Some of y'all gave up on your parser project too early. Classical NLP is back: x.com/ShikharMurty/s…

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

TEDAI San Francisco

@tedaisf

3 months ago

🐋 Can AI help us understand whales — and ourselves? 📷 New TED Talk recorded at TEDAI San Francisco is live! Massachusetts Institute of Technology (MIT) researcher Pratyusha Sharma explores how machine learning is decoding the language of sperm whales — opening new frontiers in AI, linguistics & nature. ted.com/talks/pratyush…

🐋 Can AI help us understand whales — and ourselves? 📷 New TED Talk recorded at <a href="/TEDAISF/">TEDAI San Francisco</a> is live!

<a href="/MIT/">Massachusetts Institute of Technology (MIT)</a> researcher <a href="/pratyusha_PS/">Pratyusha Sharma</a> explores how machine learning is decoding the language of sperm whales — opening new frontiers in AI, linguistics & nature.

ted.com/talks/pratyush…

thumb_up_off_alt16

chat_bubble_outline0

repeat5

shareShare

Kabir

@kabirahuja004

3 months ago

I will be presenting 👇work at #NAACL2025 tomorrow (May 2) from 12 pm in Ballroom A. Please stop by if curious about inductive biases in transformers, generalization, and applying Bayesian models of cognition for understanding language models.

thumb_up_off_alt41

chat_bubble_outline0

repeat6

shareShare

Rajdeep Sardesai

@sardesairajdeep

3 months ago

I hope the IMF which gave a $ 1 billion loan to Pakistan last night realises that it has BLOOD ON ITS HANDS. India abstained, but shouldn’t many others who speak of ‘zero tolerance to terror’ have joined us ?

thumb_up_off_alt12,12K

chat_bubble_outline850

repeat1,1K

shareShare

Shikhar

@shikharmurty

a month ago

Some life updates: 1. Defended my thesis, "Building the learning-from-interaction pipeline for LLMs," on LLM browser agents that learn autonomously on digital environments, and inductive biases for compositionality. 2. Moved to NYC to start at Google Deepmind Language, where I

thumb_up_off_alt218

chat_bubble_outline20

repeat8

shareShare

Shikhar

Gate.io

Shikhar

Brian Roemmele

Shikhar

Xing Han Lu

Shikhar

Shikhar

Josh Cason

TEDAI San Francisco

Kabir

Rajdeep Sardesai

Shikhar