Roy Ganz (@roy_ganz) 's Twitter Profile
Roy Ganz

@roy_ganz

ID: 1285533655092416512

linkhttps://royg27.github.io/ calendar_today21-07-2020 11:14:50

123 Tweet

136 Followers

452 Following

Marco Franzon (@mfranz_on) 's Twitter Profile Photo

FuseCap is one the the best Image Captioning models I have tried so far🀯 The main idea is: Visual experts extract meaningful information from images. This information is then fused with the original captions by an LLM Fuser, resulting in rich captions. This ends up in a very

FuseCap is one the the best Image Captioning models I have tried so far🀯

The main idea is:
Visual experts extract meaningful information from images. This  information is then fused with the original captions by an LLM Fuser,  resulting in rich captions.

This ends up in a very
Roy Ganz (@roy_ganz) 's Twitter Profile Photo

I am thrilled to announce that our work was accepted as SPOTLIGHT to #CVPR2025! The official code is available at github.com/amazon-science… (currently, code and checkpoints for inference. Training will be available soon). Amazon Science

AK (@_akhaliq) 's Twitter Profile Photo

Paint by Inpaint Learning to Add Image Objects by Removing Them First Image editing has advanced significantly with the introduction of text-conditioned diffusion models. Despite this progress, seamlessly adding objects to images based on textual instructions without

Paint by Inpaint

Learning to Add Image Objects by Removing Them First

Image editing has advanced significantly with the introduction of text-conditioned diffusion models. Despite this progress, seamlessly adding objects to images based on textual instructions without
Gradio (@gradio) 's Twitter Profile Photo

πŸ’‘New research: 𝐏𝐚𝐒𝐧𝐭 𝐛𝐲 𝐈𝐧𝐩𝐚𝐒𝐧𝐭 - Train model to add objects into images by first learning how to remove them. 🎨 Simplifies complex image editing. 🀩 The results are really impressive! Details & linksπŸ‘‡

πŸ’‘New research: 𝐏𝐚𝐒𝐧𝐭 𝐛𝐲 𝐈𝐧𝐩𝐚𝐒𝐧𝐭 - Train model to add objects into images by first learning how to remove them. 🎨 Simplifies complex image editing.

🀩 The results are really impressive! Details & linksπŸ‘‡
Amit Bracha (@amit_bracha) 's Twitter Profile Photo

πŸŽ‰ Exciting news! Our paper has been accepted to #ECCV2024 European Conference on Computer Vision #ECCV2026! Check out our project page for more details, including our results on the DTU dataset! Special thanks to MrNeRF and Zhenjun Zhao for sharing our work! πŸ™

Roy Ganz (@roy_ganz) 's Twitter Profile Photo

πŸŽ‰ Excited to share that Class-Conditioned Transformation for Enhanced Robust Image Classification, led by Tsachi Blau, has been accepted to #WACV25! πŸš€ CODIP boosts the robustness of trained models by leveraging the perceptually aligned gradients property, without training!

Aviad Aberdam (@aberdam_aviad) 's Twitter Profile Photo

Excited to share that our paper, "DocVLM: Make Your VLM an Efficient Reader," got accepted to CVPR! πŸŽ‰ Unlike general vision tasks, document understanding with Vision-Language Models demands high-resolution images, leading to a significant computational burden. #CVPR2025 #AI #LLM

Excited to share that our paper, "DocVLM: Make Your VLM an Efficient Reader," got accepted to CVPR! πŸŽ‰ Unlike general vision tasks, document understanding with Vision-Language Models demands high-resolution images, leading to a significant computational burden.
<a href="/CVPR/">#CVPR2025</a> #AI #LLM