Fernando Pérez-García (@fepegar_) 's Twitter Profile
Fernando Pérez-García

@fepegar_

Senior Research Machine Learning Engineer @MSFTResearch. Dev @TorchIOLib. He/him. bsky.app/profile/fepega…

ID: 2910678111

linkhttp://www.fepegar.com calendar_today08-12-2014 10:04:58

994 Tweet

1,1K Followers

3,3K Following

Saksham Suri (@_sakshams_) 's Twitter Profile Photo

We introduce LiFT, an easy to train, lightweight, and efficient feature upsampler to get dense ViT features without the need to retrain the ViT. Visit our poster European Conference on Computer Vision #ECCV2026 #eccv2024 in Milan on Oct 1st (Tuesday), 16:30 (local), Poster: 79. Project Page: cs.umd.edu/~sakshams/LiFT

We introduce LiFT, an easy to train, lightweight, and efficient feature upsampler to get dense ViT features without the need to retrain the ViT.

Visit our poster <a href="/eccvconf/">European Conference on Computer Vision #ECCV2026</a> #eccv2024 in Milan on Oct 1st (Tuesday), 16:30 (local), Poster: 79. Project Page: cs.umd.edu/~sakshams/LiFT
Fernando Pérez-García (@fepegar_) 's Twitter Profile Photo

I heard at #ECCV2024 multiple people complaining that European Conference on Computer Vision #ECCV2026 is mostly about vision–language machine learning, and that it’s too large and should be broken up into smaller conferences. I see analogous issues with e.g. NeurIPS Conference. Why does this happen? What is the solution?

Curt Langlotz (@curtlanglotz) 's Twitter Profile Photo

Congratulations Geoffrey Hinton on the Nobel Prize for “discoveries and inventions that enable machine learning with artificial neural networks”! No doubt your #MRI (read by a human?) can be rescheduled. Would love to hear your latest thoughts on how AI will affect radiologists.

Simon Willison (@simonw) 's Twitter Profile Photo

Confession: despite all of the debates about whether or not an LLM can "reason", I still don't really understand exactly what the term "reasoning" means So just like with "agents" and "AI" itself, I'm not sure the people engaged in those debates are talking about the same thing

Yann LeCun (@ylecun) 's Twitter Profile Photo

When training a visual encoder with self-supervised learning, we know for a fact that using a decoder with a reconstruction loss doesn't work nearly as well as using a joint embedding architecture with feature prediction loss and a collapse prevention mechanism. This paper from

François Chollet (@fchollet) 's Twitter Profile Photo

I remember doing a little speech to a number of TF team members at my desk back in 2019 -- about why they should think long-term, about how I wanted my future children (not even in the plans at the time) to eventually learn deep learning with Keras, 20 or so years in the future.

Massimo (@rainmaker1973) 's Twitter Profile Photo

Today is public holiday in Spain and literally thousands of people in Valencia came out to help the victims of the October 29 disaster armed with shovels and brooms. A flood of people against the flood of water.

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Mark Saroufim Probably not what you want to hear but docs 😅. Actual real life examples. Better and more comprehensive kwarg docs. More helpful links to actual code not just wrapper of wrapper of wrapper code. Example code of larger apps showing best practices (style of torch titan, nanoGPT or

Jaime Gómez-Obregón (@jaimeobregon) 's Twitter Profile Photo

En mi empresa obligué a todos a usar git. Fue en 2014. Al principio esto molestó a los compañeros de administración, marketing, comunicación, gestión interna… «¿Por qué tengo que aprender git, si yo no soy un técnico?» git extendió a toda la organización las mismas ventajas

FFmpeg (@ffmpeg) 's Twitter Profile Photo

A major blocker to free and open source multimedia is ISO. ISO standards are behind an expensive paywall, making them impossible to access for volunteer developers in FFmpeg, or anyone wanting to learn about multimedia technology

Open Life Science AI (@openlifesciai) 's Twitter Profile Photo

🚨 Medical AI Research Alert! 🚨 How can segmentation improve radiology report generation? The team at Microsoft Research presents MAIRA-Seg: a Segmentation-Aware Multimodal LLM designed to elevate chest X-ray report generation. 🚀 📌 By Harshita Sharma, Valentina Salvatelli,

Gabriel Peyré (@gabrielpeyre) 's Twitter Profile Photo

Optimal transport computes an interpolation between two distributions using an optimal coupling. Flow matching, on the other hand, uses a simpler “independent” coupling, which is the product of the marginals.

Peter Lee (@peteratmsr) 's Twitter Profile Photo

Important advance in AI medical imaging by pretraining solely on unimodal biomedical imaging data. This achieves similar or greater performance than state-of-the-art biomedical-language-supervised models on a diverse range of benchmarks.

Peter Lee (@peteratmsr) 's Twitter Profile Photo

We're proud Microsoft Research to be collaborating with Mayo Clinic to advance the state-of-the-art in multimodal AI radiology by bringing AI innovation to real-world clinical use in a way that scales.