Thodoris Kouzelis (@thkouz) 's Twitter Profile
Thodoris Kouzelis

@thkouz

1st year PhD Candidate Archimedes, Athena RC & NTUA

ID: 710153440144592896

calendar_today16-03-2016 17:19:08

13 Tweet

73 Followers

164 Following

Efstathios Karypidis (@k_sta8is) 's Twitter Profile Photo

🧵 Excited to share our latest work: FUTURIST - A unified transformer architecture for multimodal semantic future prediction, is accepted to #CVPR2025 ! Here's how it works (1/n) šŸ‘‡ Links to the arxiv and github below

Anastasios Gerontopoulos (@nasosger) 's Twitter Profile Photo

1/n Multi-token prediction boosts LLMs (DeepSeek-V3), tackling key limitations of the next-token setup: • Short-term focus • Struggles with long-range decisions • Weaker supervision Prior methods add complexity (extra layers) šŸ”‘ Our fix? Register tokens—elegant and powerful

1/n Multi-token prediction boosts LLMs (DeepSeek-V3), tackling key limitations of the next-token setup:
• Short-term focus
• Struggles with long-range decisions
• Weaker supervision

Prior methods add complexity (extra layers)
šŸ”‘ Our fix? Register tokens—elegant and powerful
Spyros Gidaris (@spyrosgidaris) 's Twitter Profile Photo

I am at #CVPR2025 this week in Nashville! Presenting "Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers" on multi-modal semantic future prediction. Come discuss! Fri 13 Jun 10:30-12:30, poster #345 x.com/K_Sta8is/statu…

Shashank (@shawshank_v) 's Twitter Profile Photo

Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research🧵

Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research🧵