Vittorio Ferrari (@vittoferraricv) Twitter Tweets • TwiCopy

Vittorio Ferrari

2 years ago

Four papers accepted to #ICCV2023! 1/4 Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories arxiv.org/abs/2306.09224 Dataset release coming soon! With @tejmensink Jasper Uijlings Lluís Castrejón Arushi Goel Cadar Howard Zhou Fei Sha, A.Araujo

thumb_up_off_alt27

chat_bubble_outline1

repeat2

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Four papers accepted to #ICCV2023! 3/4 Agile Modeling: From Concept to Classifier in Minutes We empower any user to develop a classifier for a subjective visual concept in under 30 minutes. arxiv.org/abs/2302.12948 With O.Strech, E.Vendrow, and many others

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Four papers accepted to #ICCV2023! 4/4 Tracking by 3D Model Estimation of Unknown Objects in Videos arxiv.org/abs/2304.06419 With Denys Rozumnyi, Jiri Matas, Martin R. Oswald, Marc Pollefeys

Four papers accepted to #ICCV2023!
4/4

Tracking by 3D Model Estimation of Unknown Objects in Videos
arxiv.org/abs/2304.06419

With <a href="/DRozumnyi/">Denys Rozumnyi</a>, <a href="/matas_jiri/">Jiri Matas</a>, Martin R. Oswald, <a href="/mapo1/">Marc Pollefeys</a>

thumb_up_off_alt25

chat_bubble_outline0

repeat1

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Four papers accepted to #ICCV2023! 2/4 CAD-Estate: Large-scale CAD Model Annotation in RGB Videos >100k 3D objects annotated on RGB videos of complex scenes Dataset release coming soon! arxiv.org/abs/2306.09011 Stefan Popov, Kevis-Kokitsi Maninis, Matthias Niessner

thumb_up_off_alt120

chat_bubble_outline1

repeat26

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

We released our new “Encyclopedic VQA” dataset, which contains visual questions about detailed properties of fine-grained categories (1M VQA triplets total!). These pose a hard challenge for large foundation models. arxiv.org/abs/2306.09224 github.com/google-researc…

thumb_up_off_alt69

chat_bubble_outline0

repeat17

shareShare

Emanuele Bugliarello

@ebugliarello

2 years ago

Wouldn’t it be cool if AI could help us generate movies?🎬 We built a new benchmark to measure progress in this direction🍿 “StoryBench: A Multifaceted Benchmark for Continuous Story Visualization” 📄 arxiv.org/abs/2308.11606 👩‍💻 github.com/google/storybe… 📈 paperswithcode.com/dataset/storyb…

thumb_up_off_alt96

chat_bubble_outline5

repeat25

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Check out CAD-Estate: a large dataset with 3D object and room layout annotations on RGB videos of complex multi-object scenes (101k objects in total!). github.com/google-researc… arxiv.org/abs/2306.09011 arxiv.org/abs/2306.09077 With Stefan Popov, Kevis-Kokitsi Maninis, Matthias Niessner

thumb_up_off_alt64

chat_bubble_outline0

repeat13

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

I am happy to share that I have joined Synthesia as Director of Science. Excited to start this new adventure! x.com/synthesiaIO/st…

thumb_up_off_alt121

chat_bubble_outline10

repeat1

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Three papers accepted to #NeurIPS 1/3 StoryBench: a new benchmark for text-to-video generation of stories to guide progress in assistive technology for filmmaking 🧑‍🎨 arxiv.org/abs/2308.11606 github.com/google/storybe… x.com/ebugliarello/s… With Emanuele Bugliarello, Hernan Moraldo, many others

thumb_up_off_alt16

chat_bubble_outline1

repeat2

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Three papers accepted to #NeurIPS 2/3 "Estimating Generic 3D Room Structures from 2D Annotations" 3D room layouts annotations for 2246 videos (part of CAD-Estate dataset). arxiv.org/abs/2306.09077 github.com/google-researc… With Denys Rozumnyi,Stefan Popov, Kevis-Kokitsi Maninis, Matthias Niessner

thumb_up_off_alt92

chat_bubble_outline0

repeat9

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Three papers accepted to #NeurIPS 3/3 NAVI: a dataset of image collections of objects, along with high-quality 3D object scans, near-perfect 2D-3D alignments, and accurate camera parameters. arxiv.org/abs/2306.09109 navidataset.github.io With Varun Jampani, Kevis-Kokitsi Maninis, others

thumb_up_off_alt105

chat_bubble_outline0

repeat16

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Happy to share this filmed interview - if you want to join @SynthesiaIO, now is the perfect time!

thumb_up_off_alt27

chat_bubble_outline2

repeat1

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

We are running the Vision and Sports Summer school again this year! Prague, July 22-27. We offer a broad-range of lectures on state-of-the-art Computer Vision techniques, as well as exciting sport activities, such as Volleyball, Frisbee and Table Tennis. cmp.felk.cvut.cz/summerschool20…

thumb_up_off_alt25

chat_bubble_outline0

repeat3

shareShare

Vittorio Ferrari

@vittoferraricv

2 years ago

Paper accepted to #CVPR2024! Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Paper: arxiv.org/abs/2312.00878 Demo:huggingface.co/spaces/WalidBo… Code: github.com/WalBouss/GEM With Walid BOUSSELHAM, Felix Petersen, Hilde Kuehne

thumb_up_off_alt126

chat_bubble_outline1

repeat19

shareShare

Vittorio Ferrari

@vittoferraricv

a year ago

Introducing HAMMR: hierarchical multimodal agents that handle a broad range of VQA tasks within a single system (counting, spatial reasoning, OCR, visual pointing, external knowledge, and more). arxiv.org/abs/2404.05465 Lluís Castrejón @tejmensink Howard Zhou André Araujo Jasper Uijlings

thumb_up_off_alt12

chat_bubble_outline1

repeat2

shareShare

Synthesia 🎥

@synthesiaio

a year ago

AI Avatars have learned to interpret text now. 😬 Our soon-to-be-public EXPRESS-1 AI model enables Synthesia avatars to understand and adjust to the script automatically. 🤯 Join the pre-launch tech chat with: Victor Riparbelli, Matthias Niessner & Jon Starck 👀 x.com/i/spaces/1YpJk…

thumb_up_off_alt43

chat_bubble_outline21

repeat11

shareShare

Vittorio Ferrari

@vittoferraricv

a year ago

Our EXPRESS-1 AI model enables @Synthesiaio avatars to understand and adjust to the script automatically 💥 This is a big milestone, so tune in tomorrow for a pre-launch chat with Matthias Niessner, Jon Starck, Victor Riparbelli and @AlexVoica X Spaces event link: x.com/i/spaces/1YpJk…

thumb_up_off_alt12

chat_bubble_outline1

repeat1

shareShare

Vittorio Ferrari

@vittoferraricv

a year ago

Come to poster 354 at #CVPR2024's to see our work! 10:30am today, Arch 4A-E "Grounding Everything: Emerging Localization Properties in Vision-Language Transformers" Paper: arxiv.org/abs/2312.00878 Demo:huggingface.co/spaces/WalidBo… Code: github.com/WalBouss/GEM

thumb_up_off_alt33

chat_bubble_outline0

repeat9

shareShare

Vittorio Ferrari

@vittoferraricv

a year ago

Paper accepted to the “Multimodal Algorithmic Reasoning” NeurIPS workshop! HAMMR: Hierarchical multimodal agents for handing many diverse VQA tasks in a single system arxiv.org/abs/2404.05465 Lluís Castrejón @tejmensink Howard Zhou André Araujo Jasper Uijlings

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Vittorio Ferrari

@vittoferraricv

4 months ago

I am happy to announce that I have joined Meta Reality Labs as a Principal Research Scientist, working on Spatial AI to power AR/MR experiences on Meta's wearable devices. It's the start of another adventure, and I thank all my new colleagues for making me feel welcome!

thumb_up_off_alt80

chat_bubble_outline1

repeat0

shareShare