Katerina Fragkiadaki (@katerinafragiad) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Come join us in our CoRL workshop on generalists agents! How can we build robot generalists using learning from video, scaling up simulators, real robot exploration, generative AI and more? #FirstTweet

thumb_up_off_alt30

chat_bubble_outline0

repeat11

shareShare

Katerina Fragkiadaki

@katerinafragiad

2 years ago

Gabe's work on memory-augmented prompting of LLMs supports open-ended semantic parsing and task planning without forgetting. The model can store, retrieve and adapt instructed plans on-the-fly at deployment time!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Pushkal Katara

@pushkalkatara

2 years ago

🤖🌐How to scale up data across diverse tasks and environments for robot skill learning? Harness the power of language, vision generative models in simulation! Excited to share Gen2Sim, a step towards autonomous robotic skill acquisition in simulation. gen2sim.github.io

thumb_up_off_alt54

chat_bubble_outline2

repeat13

shareShare

Katerina Fragkiadaki

@katerinafragiad

2 years ago

Check out our latest work on 3D feature field transformers for learning manipulation policies from demonstrations, with SOTA performance in RLBench.

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

2 years ago

“Find us today at 10:45 am at poster number 305 #NeurIPS”

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

a year ago

With ODIN, 3D perception most benefits from 2D feature pre-training, and is used in the real world, outside dataset-given 3D meshes. Congrats Ayush and ODIN team!

thumb_up_off_alt41

chat_bubble_outline1

repeat4

shareShare

Katerina Fragkiadaki

@katerinafragiad

a year ago

Very excited to speak tomorrow on unifying 2D/3D models of images, language and actions at multimodalitiesfor3dscenes.github.io. On Tuesday, I will talk about generative video perception at generative-vision.github.io/workshop-CVPR-… and memory-prompted 3D parsing at 3dcompat-dataset.org/workshop/C3DV2…. See you #CVPR2025 !

thumb_up_off_alt58

chat_bubble_outline0

repeat9

shareShare

Katerina Fragkiadaki

@katerinafragiad

a year ago

For anyone excited in unifying 2D / 3D perception, come chat with us this morning!

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Katerina Fragkiadaki

@katerinafragiad

a year ago

Come chat with us today about Diffusion-ES, that combines evolutionary search with diffusion models for efficient planning! Code is also now publicly available.

thumb_up_off_alt10

chat_bubble_outline1

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

a year ago

We show an LLM/VLM agent can transform plans into plan abstractions by adding language comments on preconditions, state changes and subgoals, using VLM's knowledge and human feedback. In-context planning with retrieved abstractions sets a new SOTA across domains.

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

8 months ago

The Genesis physics engine is just released, incredible effort from Xian and his team. It is a most general, fast and easy to use physics engine, which we hope it will accelerate robotics research and beyond.

thumb_up_off_alt34

chat_bubble_outline1

repeat9

shareShare

Katerina Fragkiadaki

@katerinafragiad

7 months ago

A thorough apples-to-apples speed comparison between IsaacGym, MujocoMJX and GENESIS in static and dynamic scenes, with and without collisions. We hope to have a paper ready within the next few months.

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Katerina Fragkiadaki

@katerinafragiad

6 months ago

Yes, we can.

thumb_up_off_alt171

chat_bubble_outline4

repeat8

shareShare

Katerina Fragkiadaki

@katerinafragiad

4 months ago

We train UniDisc - a diffusion model that can generate image and text jointly! Using diffusion instead of AR permits high controllability including infilling in multimodal space, not possible with any other multimodal generative model currently. Congrats to Alex and Mihir!

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

2 months ago

Self-training to maximize self-confidence improves LLM reasoning, without any extrinsic reward.

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Katerina Fragkiadaki

@katerinafragiad

2 months ago

Humans shift their gaze when they think. VLMs should too. Grounding their reasoning in the image leads to more accurate answers.

thumb_up_off_alt19

chat_bubble_outline1

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

a month ago

Check out Adam's new work for denser, faster and more accurate 2D point tracks with AllTracker.

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Katerina Fragkiadaki

@katerinafragiad

a month ago

Beautiful demo of AllTracker

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Katerina Fragkiadaki

@katerinafragiad

11 days ago

When data is limited and training spans multiple epochs, discrete diffusion beats autoregressive models for text generation. Check out Mihir and Mengning’s thorough analysis:

thumb_up_off_alt25

chat_bubble_outline0

repeat0

shareShare