
Bobby
@bobby_he
Machine Learning postdoc @ETH. PhD from @UniofOxford and former research intern @DeepMind/@samsungresearch
ID: 472186323
https://bobby-he.github.io/ 23-01-2012 17:56:07
69 Tweet
759 Followers
264 Following






BPE is a greedy method to find a tokeniser which maximises compression! Why don't we try to find properly optimal tokenisers instead? Well, it seems this is a very difficult—in fact, NP-complete—problem!🤯 New paper + P. Whittington, Gregor Bachmann :) arxiv.org/abs/2412.15210




