Shaojie Bai (@shaojieb) Twitter Tweets • TwiCopy

Sepp Hochreiter

4 years ago

ArXiv arxiv.org/abs/2204.08442: Optical flow estimation by deep equilibrium models. 4 to 6× times less memory than recurrent networks since using cheap inexact gradient makes backward pass almost for free. Improves SOTA methods on Sintel and KITTI with less compute and memory.

thumb_up_off_alt72

chat_bubble_outline1

repeat16

shareShare

Dmytro Mishkin 🇺🇦

@ducha_aiki

4 years ago

Deep Equilibrium Optical Flow Estimation Shaojie Bai, Zhengyang Geng, Yash Savani, Zico Kolter tl;dr: DEQ ("infinite depth aka single layer", arxiv.org/abs/1909.01377) look like natural fit for optical flow estimation. arxiv.org/abs/2204.08442 github.com/locuslab/deq-f…

Deep Equilibrium Optical Flow Estimation

<a href="/shaojieb/">Shaojie Bai</a>, Zhengyang Geng, <a href="/yashsavani_/">Yash Savani</a>, <a href="/zicokolter/">Zico Kolter</a>

tl;dr: DEQ ("infinite depth aka single layer", arxiv.org/abs/1909.01377) look like natural fit for optical flow estimation.

arxiv.org/abs/2204.08442
github.com/locuslab/deq-f…

thumb_up_off_alt32

chat_bubble_outline2

repeat6

shareShare

Shaojie Bai

@shaojieb

4 years ago

Stop using RNNs for optical flow!🧐 Introducing DEQ-flow that directly models a **fixed point** flow estimate, is super memory efficient, leads to significant error reduction on KITTI-15, and is compatible with prior modeling efforts (e.g., RAFT). Paper: arxiv.org/abs/2204.08442

thumb_up_off_alt77

chat_bubble_outline1

repeat12

shareShare

Michael Chang

@mmmbchang

4 years ago

Learning to represent objects is a major research direction towards representing the causal structure of the world. In our oral at #iclr2022 workshop on Objects Structure & Causality, we present a new way to conceptualize objects: as stable points of a fixed-point procedure: 👇

thumb_up_off_alt317

chat_bubble_outline4

repeat48

shareShare

Zico Kolter

@zicokolter

3 years ago

I realize this is seemingly an unpopular opinion, but I can't get onboard with these Twitter criticisms of some of the recent #ICML2022 best paper awardees. I've been thinking about this all day. A thread... 🧵 1/N

thumb_up_off_alt872

chat_bubble_outline18

repeat83

shareShare

Zico Kolter

@zicokolter

3 years ago

I just posted our Deep Learning Systems Lecture 6 on Fully Connected Networks, Optimization, and Initialization: youtu.be/CukpVt-1PA4 However, the real topic of interest here is that I used OpenAI's whisper to caption it entirely. A thread 🧵on my experience. 1/N

thumb_up_off_alt150

chat_bubble_outline5

repeat22

shareShare

Cem Anil

@cem__anil

3 years ago

🆕📜When can **Equilibrium Models** learn from simple examples to handle complex ones? We identify a property — Path Independence — that enables this by letting EMs think for longer on hard examples. (NeurIPS) 📝: [arxiv.org/abs/2211.09961](arxiv.org/abs/2211.09961)

thumb_up_off_alt115

chat_bubble_outline3

repeat35

shareShare

Zhengyang Geng

@zhengyanggeng

3 years ago

NeurIPS!!! First in-person meeting after 3yrs starting my research. 🥳 Glad to have any chats, neural dynamics, deep equilibrium models (DEQ), symmetries, protein folding/AF2, etc. Will be working on the intersection of DEQ and AF2 and expect to see all the collaboration chances!

thumb_up_off_alt30

chat_bubble_outline0

repeat1

shareShare

Zico Kolter

@zicokolter

2 years ago

Cade Metz at the New York Times just published a piece on a new paper we are releasing today, on adversarial attacks against LLMs. You can read the piece here: nytimes.com/2023/07/27/tec… And find more info and the paper at: llm-attacks.org [1/n]

thumb_up_off_alt339

chat_bubble_outline10

repeat76

shareShare

Brandon Amos

@brandondamos

2 years ago

My core ML team (AI at Meta) is hiring research interns! Our projects span optimization, optimal transport, optimal control, generative modeling, complex systems, and geometry. Please apply here and reach out ([email protected]) if you're interested: metacareers.com/jobs/627997209…

thumb_up_off_alt290

chat_bubble_outline4

repeat36

shareShare

Shaojie Bai

@shaojieb

2 years ago

Exciting new work with Evonne, Alexander Richard et al! 🤯 We (generatively) animate photorealistic full-body avatars using conversational audio (of anyone!). Next step, seeing GPT4 + photorealistic avatar [argue with]/[point a finger at]/[mock] you in VR?😏

thumb_up_off_alt14

chat_bubble_outline1

repeat0

shareShare

Zico Kolter

@zicokolter

2 years ago

I feel like a lot of people leverage LLMs suboptimally, especially for long-form interactions that span a whole project. So I wrote a VSCode extension that supports what I think is a better use paradigm. 🧵 1/N Extension: marketplace.visualstudio.com/items?itemName… Code: github.com/locuslab/chatl…

thumb_up_off_alt326

chat_bubble_outline6

repeat46

shareShare

Zhengyang Geng

@zhengyanggeng

2 years ago

🚀Our latest blog post unveils the power of Consistency Models and introduces Easy Consistency Tuning (ECT), a new way to fine-tune pretrained diffusion models to consistency models. SoTA fast generative models using 1/32 training cost! 🔽 Get ready to speed up your generative

thumb_up_off_alt150

chat_bubble_outline7

repeat50

shareShare

OpenAI

@openai

2 years ago

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: openai.com/index/hello-gp… Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks.

thumb_up_off_alt58,58K

chat_bubble_outline2,2K

repeat13,13K

shareShare

Zico Kolter

@zicokolter

a year ago

I'm thrilled to share that I will become the next Director of the Machine Learning Department at Carnegie Mellon. MLD is a true gem, a department dedicated entirely to ML. Faculty and past directors have been personal role models in my career. cs.cmu.edu/news/2024/kolt…

thumb_up_off_alt1,1K

chat_bubble_outline121

repeat79

shareShare

Russ Salakhutdinov

@rsalakhu

a year ago

I am very excited to start working with GenAI team at Meta, focusing on multimodal LLM agents, joining together with my amazing CMU colleagues Jing Yu Koh Jing Yu Koh and Daniel Fried Daniel Fried!

I am very excited to start working with GenAI team at <a href="/Meta/">Meta</a>, focusing on multimodal LLM agents, joining together with my amazing CMU colleagues Jing Yu Koh <a href="/kohjingyu/">Jing Yu Koh</a> and Daniel Fried <a href="/dan_fried/">Daniel Fried</a>!

thumb_up_off_alt582

chat_bubble_outline26

repeat25

shareShare

Zico Kolter

@zicokolter

a year ago

I'm excited to announce that I am joining the OpenAI Board of Directors. I'm looking forward to sharing my perspectives and expertise on AI safety and robustness to help guide the amazing work being done at OpenAI.

thumb_up_off_alt1,1K

chat_bubble_outline76

repeat75

shareShare

Jiao Sun

@sunjiao123sun_

a year ago

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference NeurIPS Conference We have ethical reviews for authors, but missed it for invited speakers? 😡

Mitigating racial bias from LLMs is a lot easier than removing it from humans!

Can’t believe this happened at the best AI conference <a href="/NeurIPSConf/">NeurIPS Conference</a>

We have ethical reviews for authors, but missed it for invited speakers? 😡

thumb_up_off_alt3,3K

chat_bubble_outline184

repeat837

shareShare

Ahmad Al-Dahle

@ahmad_al_dahle

7 months ago

Introducing our first set of Llama 4 models! We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4

thumb_up_off_alt5,5K

chat_bubble_outline323

repeat959

shareShare

Shaojie Bai

@shaojieb

7 months ago

We’ll be mixing ideas, cocktails and discussions for a night of *neural-networking* at Singapore! Come learn more about Thinking Machines at our happy hour at #iclr2025 😎

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare