Navid Madani (@namadvid) 's Twitter Profile
Navid Madani

@namadvid

NLP researcher @ University at Buffalo

ID: 1582579773468975105

calendar_today19-10-2022 03:50:03

34 Tweet

11 Followers

112 Following

Masoud (@linguistmasoud) 's Twitter Profile Photo

I’m quite used to the cruelty students can face when they apply for a US visa but this one broke me. We offered admission to a stellar, talented & hardworking student. After months of work and hundreds of dollars, an embassy officer saw him for 5 mins & said no. why? …

Sergey Levine (@svlevine) 's Twitter Profile Photo

My recent talk at UCSD hosted by Jingbo Shang, covers an updated version of the RL with data material, including some new results on offline RL with LLMs for interactive dialogue agents (coming soon)! youtu.be/Iu_Uux0R0BI

Navid Madani (@namadvid) 's Twitter Profile Photo

There is a lot to learn from this short writing by Sutton. Although, as a PhD student in NLP today, it feels hard to accept. Maybe because it's hard to compete with large corporations with lots of computational power. incompleteideas.net/IncIdeas/Bitte…

Sergey Levine (@svlevine) 's Twitter Profile Photo

How do we get LLMs to interact with humans intelligently? Ask clarifying questions and reason about dialogue outcomes, vs. just single responses? Key idea: get LLMs to "simulate" human dialogues, then use offline RL on simulated data to learn optimal dialogue agents! A thread 👇

How do we get LLMs to interact with humans intelligently? Ask clarifying questions and reason about dialogue outcomes, vs. just single responses? Key idea: get LLMs to "simulate" human dialogues, then use offline RL on simulated data to learn optimal dialogue agents! A thread 👇
Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Language Model Inversion abs: arxiv.org/abs/2311.13647 Given the logits of the language model output (such as those returned by an API), a trained inversion model is able to reconstruct the original prompt. With Llama-2-7b-chat, their method is able to achieve a token-level F1

Language Model Inversion

abs: arxiv.org/abs/2311.13647

Given the logits of the language model output (such as those returned by an API), a trained inversion model is able to reconstruct the original prompt. With Llama-2-7b-chat, their method is able to achieve a token-level F1
Carlos E. Perez (@intuitmachine) 's Twitter Profile Photo

1/n Was December 8th, 2023, the day when we've come to realize that AGI technology has been democratized? That it cannot be confined to the few and the GPU-rich? Let me explain to you what happened yesterday.

1/n Was December 8th, 2023, the day when we've come to realize that AGI technology has been democratized?  That it cannot be confined to the few and the GPU-rich?  Let me explain to you what happened yesterday.
elvis (@omarsar0) 's Twitter Profile Photo

Really enjoyed NeurIPS! After attending great sessions around LLMs, I documented a huge list of interesting LLM papers that were either presented or mentioned. Here is a list of some of my favorite papers in no particular order. I have included papers that won awards and are

Really enjoyed NeurIPS!

After attending great sessions around LLMs, I documented a huge list of interesting LLM papers that were either presented or mentioned. 

Here is a list of some of my favorite papers in no particular order. I have included papers that won awards and are
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

The most unknown most common shortcut I use on my MacBook is: - Command+Option+Shift+4 to select a small part of the screen and copy it into clipboard as an image - Command+Shift+4 to do the same, but save it as a file on Desktop as png Life-changing.

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

# on technical accessibility One interesting observation I think back to often: - when I first published the micrograd repo, it got some traction on GitHub but then somewhat stagnated and it didn't seem that people cared much. - then I made the video building it from scratch,

Sakana AI (@sakanaailabs) 's Twitter Profile Photo

Introducing Evolutionary Model Merge: A new approach bringing us closer to automating foundation model development. We use evolution to find great ways of combining open-source models, building new powerful foundation models with user-specified abilities! sakana.ai/evolutionary-m…

Cameron R. Wolfe, Ph.D. (@cwolferesearch) 's Twitter Profile Photo

LLM-as-a-Judge is one of the most widely-used techniques for evaluating LLM outputs, but how exactly should we implement LLM-as-a-Judge? To answer this question, let’s look at a few widely-cited papers / blogs / tutorials, study their exact implementation of LLM-as-a-Judge, and

LLM-as-a-Judge is one of the most widely-used techniques for evaluating LLM outputs, but how exactly should we implement LLM-as-a-Judge?

To answer this question, let’s look at a few widely-cited papers / blogs / tutorials, study their exact implementation of LLM-as-a-Judge, and
Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Meta presents UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling - Scaling offers little benefit for reasoning or relations - Best VLMs struggle on simple digit recognition and counting tasks, e.g. MNIST repo: github.com/facebookresear… abs:

Meta presents UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling

- Scaling offers little benefit for reasoning or relations
- Best VLMs struggle on simple digit recognition and counting tasks, e.g. MNIST

repo: github.com/facebookresear…
abs:
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

I feel like a large amount of GDP is locked up because it is difficult for person A to very conveniently pay 5 cents to person B. Current high fixed costs per transaction force each of them to be of high enough amounts, which results in business models with purchase bundles,

elvis (@omarsar0) 's Twitter Profile Photo

Open Source LLM Tools If you are looking for useful open-source LLM tools, this is a really useful resource. It includes different categories like tutorials, AI engineering, and applications, among others. You can also see the # of GitHub stars.

Open Source LLM Tools

If you are looking for useful open-source LLM tools, this is a really useful resource.

It includes different categories like tutorials, AI engineering, and applications, among others. You can also see the # of GitHub stars.
Noam Brown (@polynoamial) 's Twitter Profile Photo

Today, I’m excited to share with you all the fruit of our effort at OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/

Today, I’m excited to share with you all the fruit of our effort at <a href="/OpenAI/">OpenAI</a> to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/
Prakash (Ate-a-Pi) (@8teapi) 's Twitter Profile Photo

o1 personal testing megathread 🧵 Bookmark if you need to, just keeping track of reactions since a lot of us have held out personal test sets

Kenny Joseph (@_kenny_joseph) 's Twitter Profile Photo

Our* paper on embedding social media bios into dimensions of social meaning was recently accepted ICWSM. We think bios are a unique place of self-expression worth studying/using! * Navid Madani Rabiraj Banerjee Stefan McCabe Michael Miller Yoder Briony Swire-Thompson arxiv.org/abs/2305.09548

Our* paper on embedding social media bios into dimensions of social meaning was recently accepted <a href="/icwsm/">ICWSM</a>.

We think bios are a unique place of self-expression worth studying/using!

* <a href="/namadvid/">Navid Madani</a>  <a href="/RabirajBandyop1/">Rabiraj Banerjee</a>  <a href="/mccabe_s/">Stefan McCabe</a> <a href="/michaelmyoder/">Michael Miller Yoder</a> <a href="/Briony_Swire/">Briony Swire-Thompson</a>

arxiv.org/abs/2305.09548
Raj Dabre (@prajdabre1) 's Twitter Profile Photo

As a senior researcher who has both published in A* venues and has written toolkits himself and overseen the development of several others, it is humiliating to have to do leetcode grinding for a research scientist role in big tech. I have written more code than most people and I