Dennis Aumiller
@d_aumiller
Getting paid to complain about LLM evaluation @cohere. PhD on summarization from @UniHeidelberg. Previously: @AmazonScience, @sap. Find me on Stackoverflow!
ID: 964558325001211904
https://dennis-aumiller.de 16-02-2018 17:53:20
804 Tweet
684 Followers
722 Following
A year ago we released LBBP - a drop-in replacement of HumanEval that was more challenging and less leaked Internally we have been using the multilingual version of this for benchmarking, and as code is not only python we decided to release that as well huggingface.co/datasets/Coherβ¦
ππ¨π‘ππ«π ππ¦πππ π―π - πππππ-π¨π-ππ‘π-ππ«π πππ±π & π’π¦ππ π π«πππ«π’ππ―ππ₯ Today we are releasing Embed v4, unlocking so many cool new features for retrieval. πΊπ³ 100+ languages πΌοΈ Text & Image capabilities π 128k context length
It's my first time area chairing for the ACLRollingReview May cycle! And it will also be the first time asking for availability of emergency reviewers π If you (or somebody you know) has availability for reviews in the Resources and Languages track, I have two papers missing reviews.