Alexis Ross (@alexisjross) 's Twitter Profile
Alexis Ross

@alexisjross

phd-ing @MIT_CSAIL, working on machine teaching | formerly @allen_ai, @harvard ‘20

ID: 1013082985241960454

linkhttps://alexisjihyeross.github.io/ calendar_today30-06-2018 15:33:00

421 Tweet

3,3K Followers

935 Following

Lucy Li (@lucy3_li) 's Twitter Profile Photo

Hi friends, colleagues, followers. I am on the faculty job market! I am a PhD student Berkeley School of Information + Berkeley AI Research. I work on NLP, and I believe all language, whether AI- or human-generated, is ✨social and cultural data✨. My work includes: 🧵

Jennifer Hu (@_jennhu) 's Twitter Profile Photo

I'm recruiting PhD students + postdocs for my lab, coming to Johns Hopkins University in Fall 2025! Our brand new lab is at the intersection of cognitive science and AI, using computational + behavioral methods to understand how language works in minds and machines. Details below! (1/4)

I'm recruiting PhD students + postdocs for my lab, coming to <a href="/JohnsHopkins/">Johns Hopkins University</a> in Fall 2025! 

Our brand new lab is at the intersection of cognitive science and AI, using computational + behavioral methods to understand how language works in minds and machines.

Details below! (1/4)
Najoung Kim 🫠 (@najoungkim) 's Twitter Profile Photo

tinlab at Boston University (with a new logo! 🪄) is recruiting PhD students for F25 and/or a postdoc! Our interests include meaning, generalization, evaluation design, and the nature of computation/representation underlying language and cognition, in both humans and machines. ⬇️

tinlab at Boston University (with a new logo! 🪄) is recruiting PhD students for F25 and/or a postdoc! Our interests include meaning, generalization, evaluation design, and the nature of computation/representation underlying language and cognition, in both humans and machines. ⬇️
Zhaofeng Wu @ ICLR (@zhaofeng_wu) 's Twitter Profile Photo

I'll be presenting this paper next week at EMNLP. If you are interested in reward model generalizability and/or multilingual/cross-lingual alignment (or any random stuff), I'd be happy to chat!

Zhaofeng Wu @ ICLR (@zhaofeng_wu) 's Twitter Profile Photo

💡We find that models “think” 💭 in English (or in general, their dominant language) when processing distinct non-English or even non-language data types 🤯 like texts in other languages, arithmetic expressions, code, visual inputs, & audio inputs ‼️ 🧵⬇️arxiv.org/abs/2411.04986

💡We find that models “think” 💭 in English (or in general, their dominant language) when processing distinct non-English or even non-language data types 🤯 like texts in other languages, arithmetic expressions, code, visual inputs, &amp; audio inputs ‼️ 🧵⬇️arxiv.org/abs/2411.04986
Ekin Akyürek (@akyurekekin) 's Twitter Profile Photo

Why do we treat train and test times so differently? Why is one “training” and the other “in-context learning”? Just take a few gradients during test-time — a simple way to increase test time compute — and get a SoTA in ARC public validation set 61%=avg. human score! ARC Prize

Why do we treat train and test times so differently?

Why is one “training” and the other “in-context learning”?

Just take a few gradients during test-time — a simple way to increase test time compute — and  get a SoTA in ARC public validation set 61%=avg. human score! <a href="/arcprize/">ARC Prize</a>
Lucy Li (@lucy3_li) 's Twitter Profile Photo

If you like➕🔢💭 or you like 🐟, visit Kyle Lo Wed morning (todayyy) at #EMNLP2024 🌴. There will be a poster on this paper! (real 🐟 not included)

Rose (@rose_e_wang) 's Twitter Profile Photo

Everyone talks about AI tutors replacing humans...But can AI even follow a real tutoring session? Introducing POSR: The first framework & dataset spanning over 24,300 mins of real human tutoring to evaluate if AI can truly understand structure in conversations! 🎯

Everyone talks about AI tutors replacing humans...But can AI even follow a real tutoring session?

Introducing POSR: The first framework &amp; dataset spanning over 24,300 mins of real human tutoring to evaluate if AI can truly understand structure in conversations! 🎯
Tal Linzen (@tallinzen) 's Twitter Profile Photo

Reminder that I'm recruiting a couple of PhD students! The deadline is Dec 6 (data science) or Dec 18 (linguistics). Keywords: cognitively plausible LMs, generalization from small data, linguistics-adjacent LM evaluation and analysis, human sentence processing...

John Hewitt (@johnhewtt) 's Twitter Profile Photo

I’m hiring PhD students in computer science at Columbia! Our lab will tackle core challenges in understanding and controlling neural models that interact with language. for example, - methods for LLM control - discoveries of LLM properties - pretraining for understanding

Sasha Rush (@srush_nlp) 's Twitter Profile Photo

This year, I have an exceptional student on the academic market. Wenting Zhao (Wenting Zhao) builds systems that reason in natural settings. She combines AI & NLP to study newly emerging problems. She recently released WildChat (wildchat.allen.ai) and Commit-0

This year, I have an exceptional student on the academic market.

Wenting Zhao (<a href="/wzhao_nlp/">Wenting Zhao</a>) builds systems that reason in natural settings. She combines AI &amp; NLP to study newly emerging problems.

She recently released WildChat (wildchat.allen.ai) and Commit-0
Isha Puri (@ishapuri101) 's Twitter Profile Photo

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint MIT CSAIL / Red Hat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint <a href="/MIT_CSAIL/">MIT CSAIL</a> / <a href="/RedHat/">Red Hat</a> AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io
Ġabe Ġrand (@gabe_grand) 's Twitter Profile Photo

Tackling complex problems with LMs requires search/planning, but how should test-time compute be structured? Introducing Self-Steering, a new meta-reasoning framework where LMs coordinate their own inference procedures by writing code!

Hanna Wallach (@hannawallach.bsky.social) (@hannawallach) 's Twitter Profile Photo

Exciting news: the Fairness, Accountability, Transparency and Ethics (FATE) group at Microsoft Research NYC is hiring a predoctoral fellow!!! 🎉 microsoft.com/en-us/research…

jack morris (@jxmnop) 's Twitter Profile Photo

here are three awesome researchers everyone should follow: - Songlin (Songlin Yang / phd at MIT), - Will William Merrill (phd at NYU / gonna be a prof soon) - Rulin (Rulin Shao / phd at UW)! and here is why: 1. if i had to bet on one person to develop an architecture that