Josef Valvoda (@valvodajosef) 's Twitter Profile
Josef Valvoda

@valvodajosef

LLMs at Amazon

ID: 270967639

linkhttps://valvoda.github.io/ calendar_today23-03-2011 16:02:49

235 Tweet

719 Followers

1,1K Following

Afra Amini (@afra_amini) 's Twitter Profile Photo

Best-of-N is a straightforward LLM alignment algorithm: return the highest reward sample of N attempts 👍simple & effective 👎generation throughput decreases by a factor of N. 🤔Can we keep the 👍 while eliminating the 👎? 🧵 w/@ryandcotterell Tim Vieira arxiv.org/pdf/2407.06057…

Best-of-N is a straightforward LLM alignment algorithm: return the highest reward sample of N attempts
👍simple & effective
👎generation throughput decreases by a factor of N.
🤔Can we keep the 👍 while eliminating the 👎? 🧵

w/@ryandcotterell <a href="/xtimv/">Tim Vieira</a>
arxiv.org/pdf/2407.06057…
C♥️LM Workshop 2024 (@calm_workshop) 's Twitter Profile Photo

We are happy 😁 to announce 📢 the First Workshop on Causality and Large Models (C♥️LM) at #NeurIPS2024 📜 Submission deadline: September 06 (4-6 pages) 💻 Website: calm-workshop-2024.github.io

We are happy 😁 to announce 📢 the First Workshop on Causality and Large Models (C♥️LM) at #NeurIPS2024 

📜 Submission deadline: September 06 (4-6 pages)
💻 Website: calm-workshop-2024.github.io
C♥️LM Workshop 2024 (@calm_workshop) 's Twitter Profile Photo

We have extended the submission deadline to September 23, 2024, AoE. You can visit our website to submit a paper or sign up as a reviewer: calm-workshop-2024.github.io

NLLP Workshop (@nllpworkshop) 's Twitter Profile Photo

📬As EMNLP 2025 notifications roll in, a friendly reminder: We accept papers 📝 from ARR (w/ reviews & meta-review✅) until 27 Sept (AoE) 📅 Notifications drop Oct 8! Brace yourselves 💪 & commit 🔄 one more time! CfP: nllpw.org/workshop/call/ #nllp #legaltech #EMNLP2024

Luca Soldaini ✈️ ICLR 25 (@soldni) 's Twitter Profile Photo

Olmo goes multimodal! We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯 We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.

Olmo goes multimodal!

We are launching Molmo, a open family of multimodal models that rival the best closed VLMs out there 🤯

We spent the last 9 months meticulously curating PixMo, a dataset of (a) high-quality image-caption pairs and (b) multimodal instruction data.
Adina Williams (@adinamwilliams) 's Twitter Profile Photo

Our responsible AI team is hiring 3 research scientist interns this cycle (2 in Montreal, one in NYC). We're seeking enrolled PhD students who are excited to spend their summer figuring out how to ensure vision and/or language models work for everyone! metacareers.com/jobs/532549086…

Samuel Müller (@samuelmullr) 's Twitter Profile Photo

Transformers perform remarkable generalizations in the in-context learning setting. E.g. when trained only on step functions, the model generalizes to smooth predictions when given a smooth input. (1/n, a paper thread)

Transformers perform remarkable generalizations in the in-context learning setting.
E.g. when trained only on step functions, the model generalizes to smooth predictions when given a smooth input.
(1/n, a paper thread)
Valentina Pyatkin (@valentina__py) 's Twitter Profile Photo

📣Research internship hiring at Ai2 is now open! We're hiring research interns at Ai2 to help at every stage of our Open Language Model (OLMo) workstream and we want people who can contribute meaningfully to our best models! Preferred deadline for consideration: Nov. 3rd, 2024

📣Research internship hiring at Ai2 is now open! 

We're hiring research interns at Ai2 to help at every stage of our Open Language Model (OLMo) workstream and we want people who can contribute meaningfully to our best models!

Preferred deadline for consideration: Nov. 3rd, 2024
Ethan Gotlieb Wilcox (@wegotlieb) 's Twitter Profile Photo

If you are interested in topics at the intersection of NLP, Cognitive Science, and Linguistics, please consider applying to the Georgetown PhD program. It's fully funded, and we have a lively 💃🪩🕺 interdisciplinary community on campus!

Alex Warstadt (@a_stadt) 's Twitter Profile Photo

I'm excited to announce my new lab: UCSD's Learning Meaning and Natural Language Lab. a.k.a. LeM🍋N Lab! And 📢WE ARE RECRUITING📢 PhD students to join us in sunny San Diego in either Linguistics OR Data Science. Apply by Dec 4: connect.grad.ucsd.edu/apply/ More about the lab👇

I'm excited to announce my new lab: UCSD's Learning Meaning and Natural Language Lab.
     a.k.a. LeM🍋N Lab!

And 📢WE ARE RECRUITING📢 PhD students to join us in sunny San Diego in either Linguistics OR Data Science. Apply by Dec 4: connect.grad.ucsd.edu/apply/

More about the lab👇
NLLP Workshop (@nllpworkshop) 's Twitter Profile Photo

🥁 Drum roll... 🥁 🏆 The Best Presentation Award goes to ... 🎉 Ieva Raminta @ievaraminta.bsky.social for presenting 📄"Comparative Study of Explainability Methods for Legal Outcome Prediction" 🤖⚖️ 🎊 Congrats! Award comes with 💵500 sponsored by Tech At Bloomberg 💻✨

🥁 Drum roll... 🥁

🏆 The Best Presentation Award goes to ... 🎉 

<a href="/RamintaIeva/">Ieva Raminta @ievaraminta.bsky.social</a> for presenting 📄"Comparative Study of Explainability Methods for Legal Outcome Prediction" 🤖⚖️

🎊 Congrats! Award comes with 💵500 sponsored by <a href="/TechAtBloomberg/">Tech At Bloomberg</a> 💻✨
Maria Antoniak (@maria_antoniak) 's Twitter Profile Photo

I'm recruiting 1-2 PhD students to work with me at the University of Colorado Boulder! Looking for creative students with interests in NLP and Cultural Analytics. Boulder is a lovely college town 30 min from Denver and 1 hr from Rocky Mountain National Park 😎 Apply by Dec 15!

I'm recruiting 1-2 PhD students to work with me at the University of Colorado Boulder! Looking for creative students with interests in NLP and Cultural Analytics.

Boulder is a lovely college town 30 min from Denver and 1 hr from Rocky Mountain National Park 😎

Apply by Dec 15!
Tiago Pimentel (@tpimentelms) 's Twitter Profile Photo

Hey :) I'm looking for 3 emergency reviewers for ARR submissions🚨📷 they are all in LM interpretability! Should be submitted within the next 36 hours 🙃 If you are interested, please DM me! #NLProc #NLP

Peter West (@peterwesttm) 's Twitter Profile Photo

I have multiple MSc/PhD openings in my lab at UBC Computer Science! Come discover the hidden capabilities/limits of LLMs, e.g. how to learn from, guide, and understand the outputs of models. See my website (bio) for more details. cs.ubc.ca/students/grad/… Apply by December 15th! Also...

I have multiple MSc/PhD openings in my lab at <a href="/UBC_CS/">UBC Computer Science</a>! Come discover the hidden capabilities/limits of LLMs, e.g. how to learn from, guide, and understand the outputs of models. See my website (bio) for more details. 
cs.ubc.ca/students/grad/…
Apply by December 15th! Also...
Niloofar (on faculty job market!) (@niloofar_mire) 's Twitter Profile Photo

I'm on the faculty market and at #NeurIPS!👩‍🏫 homes.cs.washington.edu/~niloofar/ I work on privacy, memorization, and emerging challenges in data use for AI. Privacy isn't about PII removal but about controlling the flow of information contextually, & LLMs are still really bad at this!

I'm on the faculty market and at #NeurIPS!👩‍🏫
homes.cs.washington.edu/~niloofar/

I work on privacy, memorization, and emerging challenges in data use for AI.

Privacy isn't about PII removal but about controlling the flow of information contextually, &amp; LLMs are still really bad at this!
Butoi Alexandra (@butoialexandra) 's Twitter Profile Photo

To appear at ICLR 2025, "Training Neural Networks as Recognizers of Formal Languages", and new benchmark, 🔥FLaRe🔥, for Formal Language Recognition! With Ghazal Khalighinejad, Anej Svete, Josef Valvoda, Ryan Cotterell, and Brian DuSell. See thread below.

Sian Gooding (@siangooding) 's Twitter Profile Photo

New paper alert from Google DeepMind! 🚨 We've put LLMs to the test as writing co-pilots – how good are they really at helping us write? LLMs are increasingly used for open-ended tasks like writing assistance, but how do we assess their effectiveness? 🤔 arxiv.org/abs/2503.19711

Yanai Elazar (@yanaiela) 's Twitter Profile Photo

💡 New ICLR paper! 💡 "On Linear Representations and Pretraining Data Frequency in Language Models": We provide an explanation for when & why linear representations form in large (or small) language models. Led by Jack Merullo , w/ Noah A. Smith & Sarah Wiegreffe

💡 New ICLR paper! 💡
"On Linear Representations and Pretraining Data Frequency in Language Models":

We provide an explanation for when &amp; why linear representations form in large (or small) language models.

Led by <a href="/jack_merullo_/">Jack Merullo</a> , w/ <a href="/nlpnoah/">Noah A. Smith</a> &amp; <a href="/sarahwiegreffe/">Sarah Wiegreffe</a>
Peter Henderson (@peterhndrsn) 's Twitter Profile Photo

Hallucinated-Citation Watch (May 27 – Jun 2 2025) 📊🧵 7️⃣ Seven separate decisions flagged fake authorities for the week that I've found so far, bringing the total in our tracker to 172 cases. ⚖️ 4 U.S. federal district courts, 1 U.S. state-level tax court, 1 Canadian superior

Hallucinated-Citation Watch (May 27 – Jun 2 2025) 📊🧵

7️⃣ Seven separate decisions flagged fake authorities for the week that I've found so far, bringing the total in our tracker to 172 cases.
⚖️ 4 U.S. federal district courts, 1 U.S. state-level tax court, 1 Canadian superior