
Chris Alberti
@chris_alberti
ID: 339399933
21-07-2011 01:45:32
1 Tweet
78 Followers
67 Following

Excited to share new work done Google DeepMind: 🏔️ DOLOMITES: Domain-Specific Long-Form Methodical Tasks, a new long-form generation benchmark for evaluating language models on **realistic** domain-specific tasks. Website: dolomites-benchmark.github.io Paper: arxiv.org/abs/2405.05938
