Benjamin Feuer (@feuerbenjamin) 's Twitter Profile
Benjamin Feuer

@feuerbenjamin

PhD Candidate in Computer Science, NYU, Deep Learning

ID: 1526193518456320000

calendar_today16-05-2022 13:31:35

159 Tweet

62 Followers

143 Following

Eugene Vinitsky 🍒🦋 (@eugenevinitsky) 's Twitter Profile Photo

In our new paper, we find that LLMs can efficiently do RLHF in-context! Our method, in-context preference learning (ICPL), iterates LLMs writing reward functions, training agents, and putting preferences into context. We see a 30x boost in query efficiency over baseline RLHF!

In our new paper, we find that LLMs can efficiently do RLHF in-context! 
Our method, in-context preference learning (ICPL), iterates LLMs writing reward functions, training agents, and putting preferences into context. We see a 30x boost in query efficiency over baseline RLHF!
John P Dickerson (@johnpdickerson) 's Twitter Profile Photo

Micah's an incredible mentor and one of the most creative early-career AI/ML thinkers out there -- if you're looking for a potential PhD advisor, can't imagine a better choice!

Agronomy, Crop, and Soil Science Societies (@asa_cssa_sssa) 's Twitter Profile Photo

🐞 Check out this zero-shot AI model dataset with 6M images of important species that is vital to farming and environmental research! Learn more: ow.ly/Kzig50TRSaC #ZeroShotLearning #AIModels #AgriculturalTech #EnvironmentalResearch Chinmay Hegde Benjamin Feuer AIIRA - AI Institute for Resilient Agriculture

🐞 Check out this zero-shot AI model dataset with 6M images of important species that is vital to farming and environmental research! Learn more: ow.ly/Kzig50TRSaC
 #ZeroShotLearning #AIModels #AgriculturalTech #EnvironmentalResearch

<a href="/chegday/">Chinmay Hegde</a> <a href="/FeuerBenjamin/">Benjamin Feuer</a> <a href="/AII4RA/">AIIRA - AI Institute for Resilient Agriculture</a>
Benjamin Feuer (@feuerbenjamin) 's Twitter Profile Photo

After careful consideration, I have decided to leave X for BlueSky. I hope to see many of you there with me very soon! Benjamin.bsky.social

Manos Koukoumidis (@koukoumidis) 's Twitter Profile Photo

If AI isn’t truly open, it will fail us. We can’t close in a black box our greatest invention yet just so that a few can freely monetize. AI needs its Linux moment, and so we started working towards it. This can only succeed if we all work together! #oumi #opensource

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Oumi:build state-of-the-art foundation models, end-to-end. Oumi is a fully open-source platform designed to train, evaluate, and deploy foundation models end-to-end. It supports models from 10M to 405B parameters, enabling fine-tuning using LoRA, QLoRA, DPO, and other

Oumi:build state-of-the-art foundation models, end-to-end.

Oumi is a fully open-source platform designed to train, evaluate, and deploy foundation models end-to-end. 

It supports models from 10M to 405B parameters, enabling fine-tuning using LoRA, QLoRA, DPO, and other
Thao Nguyen (@thao_nguyen26) 's Twitter Profile Photo

📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains! 📅 Deadline: May 24, AoE 🔗 Website: dataworldicml2025.github.io We have an amazing lineup of speakers + panelists from various institutions and application areas.

📢 Announcing our data-centric workshop at ICML 2025 on unifying data curation frameworks across domains!

📅 Deadline: May 24, AoE
🔗 Website: dataworldicml2025.github.io

We have an amazing lineup of speakers + panelists from various institutions and application areas.
Mike A. Merrill (@mike_a_merrill) 's Twitter Profile Photo

Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr

Many agents (Claude Code, Codex CLI) interact with the terminal to do valuable tasks, but do they currently work well enough to deploy en masse? 

We’re excited to introduce Terminal-Bench: An evaluation environment and benchmark for AI agents on real-world terminal tasks. Tl;dr
Ryan Marten (@ryanmart3n) 's Twitter Profile Photo

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals. We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data

Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-Distill-Qwen-7B by 33% on average over code, science, and math evals.

We also release our dataset, OpenThoughts3-1.2M, which is the best open reasoning dataset across all data
Benjamin Feuer (@feuerbenjamin) 's Twitter Profile Photo

New research paper for you to read over your July 4th break (if you're US-based) -- Vision is a skeleton key! 🗝️ We convert a small VLM into an "everything classifier" by transforming data into visualizations that VLMs can naturally understand and reason about. We call it

New research paper for you to read over your July 4th break (if you're US-based) --

Vision is a skeleton key! 🗝️ We convert a small VLM into an "everything classifier" by transforming data into visualizations that VLMs can naturally understand and reason about. We call it