Hamed Shirzad (@hamedshirzad13) Twitter Tweets • TwiCopy

Yi (Joshua) Ren (looking for postdoc @ICLR)

a year ago

Poster at Compositional Learning Workshop @ NeurIPS2024 Exploring why simplicity bias in deep learning often aligns with compositionality. A fresh perspective through the lens of learning dynamics. (1/5) 🗓️Sunday, Dec 15 ⏰8:30 - 17:00 📍West Meeting Room 118-120

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Mark Schmidt

@markschmidtubc

a year ago

For those still at NeurIPS, check out my student's posters at the OPT workshop at 3pm: - Line search with constant sharpness. - Proof line search can beat acceleration. - Allowing negative step sizes? - BCD for deep nets? - Normalization in graph neural net codes makes no sense.

thumb_up_off_alt22

chat_bubble_outline1

repeat3

shareShare

Peymàn M. Kiasari

@pkiasari

9 months ago

If you are interested in generalization in deep learning, you can find our poster in the poster session right now at #AAAI2025 : )

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Hamed Shirzad

@hamedshirzad13

9 months ago

Happy to see sparse attention methods taking off! We've also worked on sparse attention for learning on graphs (Exphormer, Spexphormer papers) — convincing reviewers about sparse attention (and not just flash attention) wasn’t easy.

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Kumo

@kumo_ai_team

8 months ago

What makes Graph Transformers a perfect fit for relational data? In our latest blog, Federico López breaks down how Kumo integrates Graph Transformers to elevate predictive modeling—eliminating the need for complex pipelines and manual feature engineering. kumo.ai/research/graph…

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Yi (Joshua) Ren (looking for postdoc @ICLR)

@joshuarenyi

8 months ago

📢Curious why your LLM behaves strangely after long SFT or DPO? We offer a fresh perspective—consider doing a "force analysis" on your model’s behavior. Check out our #ICLR2025 Oral paper: Learning Dynamics of LLM Finetuning! (0/12)

thumb_up_off_alt795

chat_bubble_outline5

repeat117

shareShare

Kazem Meidani

@kazemmeidani

8 months ago

Can’t attend ICLR 🇸🇬 due to visa issues but Chandan Reddy will have the oral presentation of *LLM-SR*on Friday 👇🏻 + see our new preprint on benchmarking capabilities of LLMs for scientific equation discovery *LLM-SRBench*: arxiv.org/abs/2504.10415

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Jure Leskovec

@jure

8 months ago

🚀 New tutorial on Graph Transformers! We introduce a powerful architecture combining the strengths of GNNs and Transformers for structured data. Learn how GTs open new possibilities for graph learning. kumo.ai/research/intro… #GraphTransformers #MachineLearning #AI #GNN

thumb_up_off_alt175

chat_bubble_outline3

repeat37

shareShare

Hamed Shirzad

@hamedshirzad13

7 months ago

Huge congrats to my labmate Yi (Joshua) Ren and my supervisor Danica Sutherland for receiving an Outstanding Paper Award at ICLR 2026 for their work on Learning Dynamics of LLM Finetuning! So proud to see their amazing research recognized 👏🔥

thumb_up_off_alt12

chat_bubble_outline1

repeat0

shareShare

Hamed Shirzad

@hamedshirzad13

7 months ago

Thank you Graph Signal Processing Workshop 2025 for the opportunity to give the morning keynote! It was a pleasure to present my work on Exphormer and Spexphormer models to such an insightful audience.

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Valence Labs

@valence_ai

7 months ago

1/ At Valence Labs, Recursion's AI research engine, we’re focused on advancing drug discovery outcomes through cutting-edge computational methods Today, we're excited to share our vision for building virtual cells, guided by the predict-explain-discover framework 🧵

1/ At Valence Labs, <a href="/RecursionPharma/">Recursion</a>'s AI research engine, we’re focused on advancing drug discovery outcomes through cutting-edge computational methods

Today, we're excited to share our vision for building virtual cells, guided by the predict-explain-discover framework 🧵

thumb_up_off_alt89

chat_bubble_outline3

repeat20

shareShare

Hamed Shirzad

@hamedshirzad13

7 months ago

Loved working on the TxPert project! It's also exciting to see my PhD work on Graph Transformers (Exphormer model) finding such meaningful application in a critical real-world task.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Valence Labs

@valence_ai

7 months ago

1/ Introducing TxPert: a new model that predicts transcriptional responses across diverse biological contexts It’s designed to generalize across unseen single-gene perturbations, novel combinations of gene perturbations, and even new cell types 🧵

thumb_up_off_alt53

chat_bubble_outline1

repeat20

shareShare

Ali Behrouz

@behrouz_ali

6 months ago

What makes attention the critical component for most advances in LLMs and what holds back long-term memory modules (RNNs)? Can we strictly generalize Transformers? Presenting Atlas (A powerful Titan): a new architecture with long-term in-context memory that learns how to

thumb_up_off_alt897

chat_bubble_outline23

repeat133

shareShare

Mo Lotfollahi

@mo_lotfollahi

6 months ago

امیدوارم که خانواده هاتون سالم باشن، چشمهامون اشک آلوده برای ایران، برای کودکان و ادم های بیگناه که به خاک و خون کشیده شدن. برای همه دانشجویان ایرانی در زمینه هوش مصنوعی و بیوانفرماتیک و زیست شناسی محاسباتی که دچار مشکل شدن برای ادامه تحصیل یا تحقیقاتشون، به من ایمیل بزننین برای

thumb_up_off_alt90

chat_bubble_outline2

repeat15

shareShare

Erfan Loghmani

@loghmanierfan

5 months ago

Can we align language models using observational data instead of costly experiments like A/B tests? In my latest research, we find that historical observational data *does* carry useful signals, but without causal care, models can learn the wrong things. Thread 🧵

thumb_up_off_alt8

chat_bubble_outline1

repeat3

shareShare

Peymàn M. Kiasari

@pkiasari

5 months ago

We’re presenting our work today at #ICML2025, Poster Session 4 West, W-214! If you are interested in computer vision reasoning and multimodal LLMs come visit us!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Christopher Morris

@chrsmrrs

5 months ago

Fun times at ICML. Graph learning dinner, position poster gang, theory, and graph learning hike. :)

thumb_up_off_alt148

chat_bubble_outline2

repeat11

shareShare

Hamed Shirzad

@hamedshirzad13

5 months ago

Great attending ICML this year, really enjoyed connecting with the graph learning community! Thanks a lot to Christopher Morris for organizing the lovely dinner :)

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Hamed Shirzad

@hamedshirzad13

3 months ago

Enjoyed giving our tutorial on Geometric & Topological Deep Learning at IEEE MLSP 2025 alongside Semih Cantürk. Loving the Istanbul vibes and the amazing food here! ✅

Enjoyed giving our tutorial on Geometric & Topological Deep Learning at <a href="/ieee_mlsp/">IEEE MLSP</a> 2025 alongside <a href="/semihcanturk_en/">Semih Cantürk</a>. Loving the Istanbul vibes and the amazing food here! ✅

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare