Dong Won (Don) Lee (@_dongwonlee) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Learning social skills is out of reach for most people🙁 How can we make social skill training more accessible? We introduce 🌟APAM🌟 (AI Partner and AI Mentor) that leverages LLMs for social skill training via realistic practice and tailored feedback!

thumb_up_off_alt302

chat_bubble_outline7

repeat61

shareShare

Aran Komatsuzaki

@arankomatsuzaki

a year ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments The first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across various operating

thumb_up_off_alt343

chat_bubble_outline4

repeat86

shareShare

Maria Khalusova

@mariakhalusova

a year ago

I put together a quick colab notebook using Llama-3-8B-Instruct for chatting with a PDF. Unstructured API for partitioning and chunking a large PDF file, FAISS for vector storage, LangChain for RAG, and quantized Llama-3-8B-Instruct (so that it fits on free Colab's GPU).

thumb_up_off_alt496

chat_bubble_outline11

repeat77

shareShare

Rafael Rafailov @ NeurIPS

@rm_rafailov

a year ago

We have a new preprint out - your language model is not a reward, it’s a Q function! 1. The likelihood of the preferred answer must go down - it’s a policy divergence 2. MCTS guided decoding on language is equivalent to likelihood search on DPO 3. DPO learns credit assignment

thumb_up_off_alt945

chat_bubble_outline16

repeat156

shareShare

Xingyu Fu

@xingyufu2

a year ago

Can GPT-4V and Gemini-Pro perceive the world the way humans do? 🤔 Can they solve the vision tasks that humans can in the blink of an eye? 😉 tldr; NO, they are far worse than us 💁🏻‍♀️ Introducing BLINK👁 zeyofu.github.io/blink/, a novel benchmark that studies visual perception

thumb_up_off_alt410

chat_bubble_outline9

repeat126

shareShare

Leena Mathur

@lmathur_

a year ago

Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ LP Morency, Paul Liang 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9

thumb_up_off_alt141

chat_bubble_outline6

repeat38

shareShare

andrew gao

@itsandrewgao

a year ago

Someone just dropped a dataset of 15 trillion tokens (as many as were used to train Llama 3)!!! Download this now before it gets taken down for “copyright reasons” Breakdown in thread 🧵 👇👇

thumb_up_off_alt1,1K

chat_bubble_outline32

repeat139

shareShare

Carlos E. Perez

@intuitmachine

a year ago

1/n A Ontology for Agentic AI Single Agent Architectures planning, self-correction, and suitability for straightforward tasks. Examples: * ReAct (Reason + Act): Iterative process of thought, action, and observation. * RAISE: ReAct with memory mechanism (short-term and

thumb_up_off_alt240

chat_bubble_outline6

repeat53

shareShare

Carlos E. Perez

@intuitmachine

a year ago

AlphaLLM: An LLM that Learns and Improves Itself Large Language Models (LLMs) have revolutionized the field of Natural Language Processing, demonstrating remarkable capabilities in various tasks. However, they still struggle with complex reasoning and planning, often requiring

thumb_up_off_alt433

chat_bubble_outline4

repeat125

shareShare

Abhinav Rao

@aethersura

a year ago

New paper on LLMs+culture! 🎊🎉 Thrilled to share our work on NormAd, a dataset evaluating whether LLMs can adapt to the diversity of cultural norms worldwide! (Spoiler: they can't!) ArXiv: arxiv.org/abs/2404.12464 w/ Akhila Yerukola Vishwa Shah Katharina Reinecke Maarten Sap (he/him) [1/n]

thumb_up_off_alt98

chat_bubble_outline3

repeat25

shareShare

Yubin Kim

@ybkim95_ai

a year ago

I'm excited to share my recent publication in CHIL 2024, "Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data". Our study reveals the potential of LLMs as personal health learners with wearable sensors. Arxiv: arxiv.org/pdf/2401.06866…

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

Yi Tay

@yitayml

a year ago

New paper from Reka 🔥 (yes an actual paper). This time we're releasing part of our internal evals which we call Vibe-Eval 😃 This comprises of a hard set which imo is pretty challenging for frontier models today. The fun part here is that we constructed it by trying to

New paper from <a href="/RekaAILabs/">Reka</a> 🔥 (yes an actual paper).

This time we're releasing part of our internal evals which we call Vibe-Eval 😃 This comprises of a hard set which imo is pretty challenging for frontier models today.

The fun part here is that we constructed it by trying to

thumb_up_off_alt565

chat_bubble_outline22

repeat85

shareShare

Chief AI Officer

@chiefaioffice

a year ago

AI phone agents are here. Here are 8 startups enabling this you should know about + funding: 1. Bland AI - undisclosed - available as an API - $0.09/minute

thumb_up_off_alt877

chat_bubble_outline31

repeat129

shareShare

Phillip Isola

@phillip_isola

a year ago

New paper: The Platonic Representation Hypothesis In which we posit that _different_ foundation models are converging to the _same_ representation of reality. paper: arxiv.org/abs/2405.07987 website: phillipi.github.io/prh/ code: github.com/minyoungg/plat… 1/8

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat252

shareShare

Dan Hendrycks

@danhendrycks

a year ago

As an alternative to RLHF and adversarial training, we released short-circuiting. It makes models ~100x more robust. It works for LLMs, multimodal models, and agents. Unlike before, I now think robustly stopping models from generating harmful outputs may be highly tractable and

thumb_up_off_alt619

chat_bubble_outline25

repeat93

shareShare

Paul Liang

@pliang279

8 months ago

heading to #emnlp2024! would love to chat with those interested in joining our Multisensory Intelligence research group at MIT MIT Media Lab MIT EECS media.mit.edu/groups/multise… Our group studies the foundations of multisensory AI to create human-AI symbiosis across scales and sensory

heading to #emnlp2024! would love to chat with those interested in joining our Multisensory Intelligence research group at MIT <a href="/medialab/">MIT Media Lab</a> <a href="/MITEECS/">MIT EECS</a>

media.mit.edu/groups/multise…

Our group studies the foundations of multisensory AI to create human-AI symbiosis across scales and sensory

thumb_up_off_alt116

chat_bubble_outline3

repeat15

shareShare

Sherjil Ozair

@sherjilozair

8 months ago

Very happy to hear that GANs are getting the test of time award at NeurIPS 2024. The NeurIPS test of time awards are given to papers which have stood the test of the time for a decade. I took some time to reminisce how GANs came about and how AI has evolve in the last decade.

thumb_up_off_alt982

chat_bubble_outline18

repeat119

shareShare

AI at Meta

@aiatmeta

a month ago

🚀New from Meta FAIR: today we’re introducing Seamless Interaction, a research project dedicated to modeling interpersonal dynamics. The project features a family of audiovisual behavioral models, developed in collaboration with Meta’s Codec Avatars lab + Core AI lab, that

thumb_up_off_alt356

chat_bubble_outline27

repeat105

shareShare

Dong Won (Don) Lee

Gate.io

Diyi Yang

Aran Komatsuzaki

Maria Khalusova

Rafael Rafailov @ NeurIPS

Xingyu Fu

Leena Mathur

andrew gao

Carlos E. Perez

Carlos E. Perez

Abhinav Rao

Yubin Kim

Yi Tay

Chief AI Officer

Phillip Isola

Dan Hendrycks

Paul Liang

Sherjil Ozair

AI at Meta