Bill Psomas (@bill_psomas) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

At NAVER LABS Europe in Grenoble, France, we are searching for talented PhD interns for work on Spatial AI, geometric and robotic foundation models for navigation and manipulation. If you have experience in Embodied AI and are interested, DM me.

At <a href="/naverlabseurope/">NAVER LABS Europe</a> in Grenoble, France, we are searching for talented PhD interns for work on Spatial AI, geometric and robotic foundation models for navigation and manipulation. If you have experience in Embodied AI and are interested, DM me.

thumb_up_off_alt106

chat_bubble_outline1

repeat16

shareShare

Psyche Wizard

@psychewizard

8 months ago

thumb_up_off_alt30,30K

chat_bubble_outline42

repeat8,8K

shareShare

Bill Psomas

@bill_psomas

8 months ago

🚀New paper alert: FREEDOM is here! Check out “Composed Image Retrieval for Training-FREE DOMain Conversion,” our training-free method for domain conversion with VLMs.🎯 📜WACV 2025 💡Retrieve images using image+text queries! 📖arxiv.org/abs/2412.03297 🔗github.com/NikosEfth/free…

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Marcin Przewięźlikowski

@pszwnzl

8 months ago

Self-supervised Learning with Masked Autoencoders (MAE) is known to produce worse image representations than Joint-Embedding approaches (e.g. DINO). In our new paper, we identify new reasons for why that is and point towards solutions: arxiv.org/abs/2412.03215 🧵

thumb_up_off_alt24

chat_bubble_outline2

repeat8

shareShare

Efstathios Karypidis

@k_sta8is

6 months ago

1/n 🚀 Excited to share our latest work: DINO-Foresight, a new framework for predicting the future states of scenes using Vision Foundation Model features! Links to the arXiv and Github 👇

thumb_up_off_alt302

chat_bubble_outline7

repeat57

shareShare

Bill Psomas

@bill_psomas

5 months ago

🚀Exciting news🚀 I’ve been awarded the Marie Skłodowska-Curie Postdoctoral Fellowship (#MSCA-PF) 2024 with 98/100!🎉 🥟My project, RAVIOLI, hosted at ČVUT v Praze, integrates retrieval-augmented predictions into vision-language models for open-vocabulary segmentation.

thumb_up_off_alt22

chat_bubble_outline1

repeat0

shareShare

Shashank

@shawshank_v

5 months ago

Excited to share that the recordings and slides of our SSLBIG tutorial are now online! If you notice any missing reference or have feedback, feel free to reach out. European Conference on Computer Vision #ECCV2026 Stay tuned for future editions! webpage: shashankvkt.github.io/eccv2024-SSLBI… Youtube: youtube.com/@SSLBiG_tutori…

thumb_up_off_alt7

chat_bubble_outline0

repeat8

shareShare

Thodoris Kouzelis

@thkouz

5 months ago

1/n🚀If you’re working on generative image modeling, check out our latest work! We introduce EQ-VAE, a simple yet powerful regularization approach that makes latent representations equivariant to spatial transformations, leading to smoother latents and better generative models.👇

thumb_up_off_alt279

chat_bubble_outline8

repeat45

shareShare

Efstathios Karypidis

@k_sta8is

5 months ago

🧵 Excited to share our latest work: FUTURIST - A unified transformer architecture for multimodal semantic future prediction, is accepted to #CVPR2025 ! Here's how it works (1/n) 👇 Links to the arxiv and github below

thumb_up_off_alt102

chat_bubble_outline4

repeat26

shareShare

Giorgos Kordopatis-Zilos

@g_kordo

5 months ago

ILIAS is a large-scale dataset for evaluation on Instance-Level Image retrieval At Scale. It is designed to support research in image-to-image and text-to-image retrieval for particular objects and serves as a benchmark for evaluating foundation models and retrieval techniques

thumb_up_off_alt21

chat_bubble_outline1

repeat8

shareShare

Kosta Derpanis

@csprofkgd

4 months ago

Collegues in Europe are running this poll about #NeurIPS2025 participation. If in Europe, highly recommended to participate.

thumb_up_off_alt312

chat_bubble_outline8

repeat53

shareShare

valeo.ai

@valeoai

3 months ago

👏 Huge congrats to our research scientist Elias Ramzi Elias for winning the AFRIF 2024 PhD award for his thesis "Robust image retrieval with deep learning", conducted at CNAM. Well deserved recognition for amazing work! 🏆 🔗 afrif.irisa.fr/?page_id=54

thumb_up_off_alt16

chat_bubble_outline0

repeat4

shareShare

Bill Psomas

@bill_psomas

3 months ago

🚀 Greeks in AI is booming! 200+ sign-ups, 30+ OpenReview submissions, and 🔥 sponsors joining daily. 📍Limited seats at Serafeio — register now: 👉 greeksin.ai Stay tuned for speakers, program, and abstract previews! #GreeksInAI #AI #ML #Research #Greece

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Giorgos Kordopatis-Zilos

@g_kordo

3 months ago

🚨 Call for Papers! 7th Instance-Level Recognition and Generation Workshop (ILR+G) at #ICCV2025 📍 Honolulu, Hawaii 🌺 📅 October 19–20, 2025 🌐 ilr-workshop.github.io/ICCVW2025/ in-proceedings deadline: June 7 out-of-proceedings deadline: June 30 #ICCV2025

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Andrea Tagliasacchi 🇨🇦

@taiyasaki

3 months ago

Thank got that nobody submits papers to both #ICCV2025 and #NeurIPS2025. Writing rebuttals for one while working on the deadline for the other would be a total nightmare.

thumb_up_off_alt79

chat_bubble_outline4

repeat2

shareShare

Yunzhi Zhang

@zhang_yunzhi

a month ago

(1/n) Time to unify your favorite visual generative models, VLMs, and simulators for controllable visual generation—Introducing a Product of Experts (PoE) framework for inference-time knowledge composition from heterogeneous models.

thumb_up_off_alt296

chat_bubble_outline4

repeat61

shareShare

Maria Brbic

@mariabrbic

a month ago

Can we build multimodal models by simply aligning pretrained unimodal models with limited paired data? We introduce STRUCTURE 🏗️: a lightweight, plug-and-play regularizer that preserves latent geometry to align frozen unimodal models using <1% of paired data typically used in

thumb_up_off_alt212

chat_bubble_outline4

repeat34

shareShare

Shashank

@shawshank_v

a month ago

New paper out - accepted at #ICCV2025 We introduce MoSiC, a self-supervised learning framework that learns temporally consistent representations from video using motion cues. Key idea: leverage long-range point tracks to enforce dense feature coherence across time.🧵

New paper out - accepted at
<a href="/ICCVConference/">#ICCV2025</a>

We introduce MoSiC, a self-supervised learning framework that learns temporally consistent representations from video using motion cues.

Key idea: leverage long-range point tracks to enforce dense feature coherence across time.🧵

thumb_up_off_alt127

chat_bubble_outline2

repeat23

shareShare

Marcin Przewięźlikowski

@pszwnzl

a month ago

Our paper "Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations" has been accepted to #ICCV2025! 🧵 TL;DR: Masked image models (like MAE) underperform not just because of weak features, but because they aggregate them poorly. [1/7]

Our paper "Beyond [cls]: Exploring the True Potential of Masked Image Modeling Representations" has been accepted to <a href="/ICCVConference/">#ICCV2025</a>!

🧵 TL;DR: Masked image models (like MAE) underperform not just because of weak features, but because they aggregate them poorly.

[1/7]

thumb_up_off_alt145

chat_bubble_outline5

repeat23

shareShare

Bill Psomas

Gate.io

Christian Wolf (🦋🦋🦋)

Psyche Wizard

Bill Psomas

Marcin Przewięźlikowski

Efstathios Karypidis

Bill Psomas

Shashank

Thodoris Kouzelis

Efstathios Karypidis

Giorgos Kordopatis-Zilos

Kosta Derpanis

valeo.ai

Bill Psomas

Giorgos Kordopatis-Zilos

Andrea Tagliasacchi 🇨🇦

Yunzhi Zhang

Maria Brbic

Shashank

Marcin Przewięźlikowski