Thodoris Kouzelis (@thkouz) Twitter Tweets • TwiCopy

Thodoris Kouzelis

@thkouz

+ Follow

1st year PhD Candidate Archimedes, Athena RC & NTUA

ID: 710153440144592896

calendar_today16-03-2016 17:19:08

13 Tweet

73 Followers

164 Following

Efstathios Karypidis

@k_sta8is

6 months ago

🧵 Excited to share our latest work: FUTURIST - A unified transformer architecture for multimodal semantic future prediction, is accepted to #CVPR2025 ! Here's how it works (1/n) 👇 Links to the arxiv and github below

thumb_up_off_alt102

chat_bubble_outline4

repeat26

shareShare

Sander Dieleman

@sedielem

5 months ago

New blog post: let's talk about latents! sander.ai/2025/04/15/lat…

thumb_up_off_alt946

chat_bubble_outline24

repeat188

shareShare

Thodoris Kouzelis

@thkouz

4 months ago

EQ-VAE is accepted at #ICML2025 😁. Grateful to my co-authors for their guidance and collaboration! Ioannis Kakogeorgiou, Spyros Gidaris, Nikos Komodakis.

thumb_up_off_alt25

chat_bubble_outline0

repeat4

shareShare

Anastasios Gerontopoulos

@nasosger

4 months ago

1/n Multi-token prediction boosts LLMs (DeepSeek-V3), tackling key limitations of the next-token setup: • Short-term focus • Struggles with long-range decisions • Weaker supervision Prior methods add complexity (extra layers) 🔑 Our fix? Register tokens—elegant and powerful

thumb_up_off_alt134

chat_bubble_outline3

repeat17

shareShare

Sander Dieleman

@sedielem

4 months ago

As I was saying: it's happening

thumb_up_off_alt716

chat_bubble_outline8

repeat46

shareShare

Spyros Gidaris

@spyrosgidaris

3 months ago

I am at #CVPR2025 this week in Nashville! Presenting "Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers" on multi-modal semantic future prediction. Come discuss! Fri 13 Jun 10:30-12:30, poster #345 x.com/K_Sta8is/statu…

thumb_up_off_alt11

chat_bubble_outline0

repeat4

shareShare

Shashank

@shawshank_v

2 months ago

Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research🧵

thumb_up_off_alt256

chat_bubble_outline11

repeat52

shareShare