Afra Amini (@afra_amini) Twitter Tweets • TwiCopy

Mike S. Schäfer

a year ago

How well do different #LargeLanguageModels perform in portraying #climatechange information❓ Paper w/ colleagues from Google DeepMind & ETH Zürich - accepted for ICML Conference, one of the World's leading #machinelearning conferences Link (open access)➡️openreview.net/forum?id=ScIHQ… Thread⬇️

How well do different #LargeLanguageModels perform in portraying #climatechange information❓

Paper w/ colleagues from <a href="/GoogleDeepMind/">Google DeepMind</a> & <a href="/ETH/">ETH Zürich</a> - accepted for <a href="/icmlconf/">ICML Conference</a>, one of the World's leading #machinelearning conferences

Link (open access)➡️openreview.net/forum?id=ScIHQ…

Thread⬇️

thumb_up_off_alt27

chat_bubble_outline2

repeat8

shareShare

Niklas Stoehr

@niklas_stoehr

a year ago

Our new mechanistic interpretability work "Activation Scaling for Steering and Interpreting Language Models" was accepted into Findings of EMNLP 2024! 🔴🔵 📄arxiv.org/pdf/2410.04962 Kevin Du, Vésteinn Snæbjarnarson, Bob West, Ryan Cotterell and Aaron Schein thread 👇

thumb_up_off_alt100

chat_bubble_outline3

repeat18

shareShare

ETH AI Center

@eth_ai_center

10 months ago

The #ETHAICenter application is now open! Interested in doing research on interdisciplinary AI topics? Join our Fellowship programs: APPLY by 19 November 2024: ai.ethz.ch/apply #PhD #PhDProgram #MachineLearning #AI #BigData #DataScience #DeepLearning #PostDoc

thumb_up_off_alt44

chat_bubble_outline0

repeat13

shareShare

Afra Amini

@afra_amini

10 months ago

It is extremely sad to see ETH recommending students rejection based on four criteria, which in many cases translates into the student's country of origin!

thumb_up_off_alt19

chat_bubble_outline0

repeat0

shareShare

ETH AI Center

@eth_ai_center

10 months ago

🚨 Only 12 Days Left to Apply for the ETH AI Center Fellowship Programs! 🚨 Don’t miss your chance to be part of Europe’s top AI research hub! ⏰ Apply now! ai.ethz.ch/apply Deadline: 19 Nov, 2024

thumb_up_off_alt12

chat_bubble_outline0

repeat8

shareShare

TheodoraKontogianni

@dorakontog

8 months ago

🚨 Recruiting PhD students to join my team at DTU Visual Computing & Pioneer Centre for AI on 3D Vision! 🇩🇰 🚲🌊🏰 Copenhagen is an emerging hub for computer vision with a thriving community—and it’s an amazing city! 📢 Apply now: efzu.fa.em2.oraclecloud.com/hcmUI/Candidat… & please reach out with any questions!

thumb_up_off_alt35

chat_bubble_outline1

repeat17

shareShare

Ziteng Sun

@sziteng

7 months ago

Inference-time procedures (e.g. Best-of-N, CoT) have been instrumental to recent development of LLMs. The standard RLHF framework focuses only on improving the trained model. This creates a train/inference mismatch. Can we align our model to better suit a given inference-time

thumb_up_off_alt250

chat_bubble_outline5

repeat51

shareShare

Alice Bizeul

@alicebizeul

7 months ago

✨New Preprint ✨ Ever thought that reconstructing masked pixels for image representation learning seems sub-optimal? In our new preprint, we show how masking principal components—rather than raw pixel patches— improves Masked Image Modelling (MIM). Find out more below 🧵

thumb_up_off_alt531

chat_bubble_outline17

repeat62

shareShare

Aarash Feizi

@aarashfeizi

6 months ago

🚨 Excited to introduce PairBench! 🚨 💡 TL;DR: VLM-judges can fail at data comparison! ✅ PairBench helps you pick the right one by testing alignment, symmetry, smoothness & controllability—ensuring reliable auto-evaluation. 📄Paper: arxiv.org/abs/2502.15210 🧵 Thread: 👇

thumb_up_off_alt28

chat_bubble_outline1

repeat20

shareShare

Afra Amini

@afra_amini

6 months ago

Excited to share that this paper has been accepted to ICLR 2025 🎉 We've added more experiments in the camera-ready version: arxiv.org/pdf/2407.06057 Code is available here: github.com/rycolab/vbon

thumb_up_off_alt91

chat_bubble_outline0

repeat12

shareShare

Ben Lipkin

@ben_lipkin

4 months ago

Many LM applications may be formulated as targeting some (Boolean) constraint. Generate a… - Python program that passes a test suite - PDDL plan that satisfies a goal - CoT trajectory that yields a positive reward The list goes on… How can we efficiently satisfy these? 🧵👇

thumb_up_off_alt22

chat_bubble_outline1

repeat7

shareShare

Saumya Malik

@saumyamalik44

3 months ago

I’m thrilled to share RewardBench 2 📊— We created a new multi-domain reward model evaluation that is substantially harder than RewardBench, we trained and released 70 reward models, and we gained insights about reward modeling benchmarks and downstream performance!

thumb_up_off_alt218

chat_bubble_outline4

repeat46

shareShare

Valentina Pyatkin

@valentina__py

2 months ago

💡Beyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of constraints and verifier functions is limited and most models overfit on IFEval. We introduce IFBench to measure model generalization to unseen constraints.

thumb_up_off_alt347

chat_bubble_outline5

repeat89

shareShare