Navid Madani (@namadvid) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I’m quite used to the cruelty students can face when they apply for a US visa but this one broke me. We offered admission to a stellar, talented & hardworking student. After months of work and hundreds of dollars, an embassy officer saw him for 5 mins & said no. why? …

thumb_up_off_alt83,83K

chat_bubble_outline953

repeat11,11K

shareShare

Sergey Levine

@svlevine

2 years ago

My recent talk at UCSD hosted by Jingbo Shang, covers an updated version of the RL with data material, including some new results on offline RL with LLMs for interactive dialogue agents (coming soon)! youtu.be/Iu_Uux0R0BI

thumb_up_off_alt149

chat_bubble_outline0

repeat26

shareShare

Navid Madani

@namadvid

2 years ago

There is a lot to learn from this short writing by Sutton. Although, as a PhD student in NLP today, it feels hard to accept. Maybe because it's hard to compete with large corporations with lots of computational power. incompleteideas.net/IncIdeas/Bitte…

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Sergey Levine

@svlevine

2 years ago

How do we get LLMs to interact with humans intelligently? Ask clarifying questions and reason about dialogue outcomes, vs. just single responses? Key idea: get LLMs to "simulate" human dialogues, then use offline RL on simulated data to learn optimal dialogue agents! A thread 👇

thumb_up_off_alt360

chat_bubble_outline3

repeat56

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

2 years ago

Language Model Inversion abs: arxiv.org/abs/2311.13647 Given the logits of the language model output (such as those returned by an API), a trained inversion model is able to reconstruct the original prompt. With Llama-2-7b-chat, their method is able to achieve a token-level F1

thumb_up_off_alt463

chat_bubble_outline4

repeat82

shareShare

Carlos E. Perez

@intuitmachine

2 years ago

1/n Was December 8th, 2023, the day when we've come to realize that AGI technology has been democratized? That it cannot be confined to the few and the GPU-rich? Let me explain to you what happened yesterday.

thumb_up_off_alt825

chat_bubble_outline23

repeat151

shareShare

elvis

@omarsar0

2 years ago

Really enjoyed NeurIPS! After attending great sessions around LLMs, I documented a huge list of interesting LLM papers that were either presented or mentioned. Here is a list of some of my favorite papers in no particular order. I have included papers that won awards and are

thumb_up_off_alt461

chat_bubble_outline4

repeat107

shareShare

Zoubin Ghahramani

@zoubinghahrama1

2 years ago

A nice way to end the year with some data: 66 Good News Stories You Didn't Hear About in 2023 futurecrunch.com/goodnews2023/

thumb_up_off_alt330

chat_bubble_outline8

repeat102

shareShare

Andrej Karpathy

@karpathy

2 years ago

The most unknown most common shortcut I use on my MacBook is: - Command+Option+Shift+4 to select a small part of the screen and copy it into clipboard as an image - Command+Shift+4 to do the same, but save it as a file on Desktop as png Life-changing.

thumb_up_off_alt4,4K

chat_bubble_outline543

repeat254

shareShare

Andrej Karpathy

@karpathy

a year ago

# on technical accessibility One interesting observation I think back to often: - when I first published the micrograd repo, it got some traction on GitHub but then somewhat stagnated and it didn't seem that people cared much. - then I made the video building it from scratch,

thumb_up_off_alt6,6K

chat_bubble_outline325

repeat752

shareShare

Sakana AI

@sakanaailabs

a year ago

Introducing Evolutionary Model Merge: A new approach bringing us closer to automating foundation model development. We use evolution to find great ways of combining open-source models, building new powerful foundation models with user-specified abilities! sakana.ai/evolutionary-m…

thumb_up_off_alt1,1K

chat_bubble_outline60

repeat421

shareShare

Cameron R. Wolfe, Ph.D.

@cwolferesearch

a year ago

LLM-as-a-Judge is one of the most widely-used techniques for evaluating LLM outputs, but how exactly should we implement LLM-as-a-Judge? To answer this question, let’s look at a few widely-cited papers / blogs / tutorials, study their exact implementation of LLM-as-a-Judge, and

thumb_up_off_alt466

chat_bubble_outline10

repeat80

shareShare

Aran Komatsuzaki

@arankomatsuzaki

a year ago

Meta presents UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling - Scaling offers little benefit for reasoning or relations - Best VLMs struggle on simple digit recognition and counting tasks, e.g. MNIST repo: github.com/facebookresear… abs:

thumb_up_off_alt191

chat_bubble_outline3

repeat50

shareShare

Andrej Karpathy

@karpathy

a year ago

I feel like a large amount of GDP is locked up because it is difficult for person A to very conveniently pay 5 cents to person B. Current high fixed costs per transaction force each of them to be of high enough amounts, which results in business models with purchase bundles,

thumb_up_off_alt9,9K

chat_bubble_outline1,1K

repeat775

shareShare

Yann LeCun

@ylecun

a year ago

"Our country is being poisoned" by immigrants, like me, Elon Musk, Sundar Pichai, Satya Nadella, Vinod Khosla, Jensen Huang, and two of Trump's three wives.

thumb_up_off_alt4,4K

chat_bubble_outline548

repeat496

shareShare

elvis

@omarsar0

10 months ago

Open Source LLM Tools If you are looking for useful open-source LLM tools, this is a really useful resource. It includes different categories like tutorials, AI engineering, and applications, among others. You can also see the # of GitHub stars.

thumb_up_off_alt1,1K

chat_bubble_outline10

repeat260

shareShare

Noam Brown

@polynoamial

10 months ago

Today, I’m excited to share with you all the fruit of our effort at OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/

Today, I’m excited to share with you all the fruit of our effort at <a href="/OpenAI/">OpenAI</a> to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/

thumb_up_off_alt11,11K

chat_bubble_outline218

repeat1,1K

shareShare

Prakash (Ate-a-Pi)

@8teapi

10 months ago

o1 personal testing megathread 🧵 Bookmark if you need to, just keeping track of reactions since a lot of us have held out personal test sets

thumb_up_off_alt1,1K

chat_bubble_outline18

repeat141

shareShare

Kenny Joseph

@_kenny_joseph

10 months ago

Our* paper on embedding social media bios into dimensions of social meaning was recently accepted ICWSM. We think bios are a unique place of self-expression worth studying/using! * Navid Madani Rabiraj Banerjee Stefan McCabe Michael Miller Yoder Briony Swire-Thompson arxiv.org/abs/2305.09548

Our* paper on embedding social media bios into dimensions of social meaning was recently accepted <a href="/icwsm/">ICWSM</a>.

We think bios are a unique place of self-expression worth studying/using!

* <a href="/namadvid/">Navid Madani</a> <a href="/RabirajBandyop1/">Rabiraj Banerjee</a> <a href="/mccabe_s/">Stefan McCabe</a> <a href="/michaelmyoder/">Michael Miller Yoder</a> <a href="/Briony_Swire/">Briony Swire-Thompson</a>

arxiv.org/abs/2305.09548

thumb_up_off_alt63

chat_bubble_outline0

repeat10

shareShare

Raj Dabre

@prajdabre1

8 months ago

As a senior researcher who has both published in A* venues and has written toolkits himself and overseen the development of several others, it is humiliating to have to do leetcode grinding for a research scientist role in big tech. I have written more code than most people and I

thumb_up_off_alt845

chat_bubble_outline52

repeat29

shareShare

Navid Madani

Gate.io

Masoud

Sergey Levine

Navid Madani

Sergey Levine

Tanishq Mathew Abraham, Ph.D.

Carlos E. Perez

elvis

Zoubin Ghahramani

Andrej Karpathy

Andrej Karpathy

Sakana AI

Cameron R. Wolfe, Ph.D.

Aran Komatsuzaki

Andrej Karpathy

Yann LeCun

elvis

Noam Brown

Prakash (Ate-a-Pi)

Kenny Joseph

Raj Dabre