Tanveer Hannan (@hannan_tanveer) Twitter Tweets • TwiCopy

lmarena.ai (formerly lmsys.org)

9 months ago

News: Qwen Qwen-Max jumps to #7, surpassing DeepSeek-v3! 🔥 Highlights: - Matches top proprietary models (GPT-4o/Sonnet 3.5) - +30 pts vs DeepSeek-v3 in coding, math, and hard prompts ChatGLM GLM-4-Plus also breaks into top-10, Chinese AI companies are closing the gap

News: <a href="/Alibaba_Qwen/">Qwen</a> Qwen-Max jumps to #7, surpassing DeepSeek-v3! 🔥

Highlights:
- Matches top proprietary models (GPT-4o/Sonnet 3.5)
- +30 pts vs DeepSeek-v3 in coding, math, and hard prompts

<a href="/ChatGLM/">ChatGLM</a> GLM-4-Plus also breaks into top-10, Chinese AI companies are closing the gap

thumb_up_off_alt263

chat_bubble_outline9

repeat41

shareShare

AK

@_akhaliq

9 months ago

ByteDance OmniHuman-1 Einstein AI video for those that missed it

thumb_up_off_alt260

chat_bubble_outline11

repeat31

shareShare

AK

@_akhaliq

8 months ago

discuss: huggingface.co/papers/2502.08…

thumb_up_off_alt24

chat_bubble_outline3

repeat3

shareShare

Tanveer Hannan

@hannan_tanveer

8 months ago

I recently reviewed Transformer²—a novel approach in self-adaptive AI. The model dynamically adjusts its weight matrices for task-specific optimization using reinforcement learning, marking a significant advancement in adaptive LLMs. sakana.ai/transformer-sq… #AI #MachineLearning

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Brett Adcock

@adcock_brett

8 months ago

Google added infinite memory to Gemini, allowing it to remember & refer to past interactions while answering It's available in Gemini Advanced and can be tweaked by editing/deleting chats OpenAI is also working on a similar feature, but no release yet

thumb_up_off_alt306

chat_bubble_outline6

repeat24

shareShare

Jonathan Roberts

@jrobertsai

8 months ago

Is computer vision “solved”? Not yet Current models score 0% on ZeroBench 🧵1/6

thumb_up_off_alt2,2K

chat_bubble_outline59

repeat257

shareShare

AK

@_akhaliq

8 months ago

Qwen2.5-VL Technical Report just dropped

thumb_up_off_alt112

chat_bubble_outline5

repeat19

shareShare

Andrej Karpathy

@karpathy

8 months ago

Okay so I didn't super expect the results of the GPT4 vs. GPT4.5 poll from earlier today 😅, of this thread: x.com/karpathy/statu… ✅ Question 1: GPT4.5 is A; 56% of people prefer it. ❌Question 2: GPT4.5 is B; 43% of people prefer it. ❌Question 3: GPT4.5 is A; 35% of people

thumb_up_off_alt2,2K

chat_bubble_outline246

repeat183

shareShare

AK

@_akhaliq

8 months ago

VisualThinker-R1-Zero R1-Zero's Aha Moment on just a 2B non-SFT Model VisualThinker-R1-Zero is a replication of DeepSeek-R1-Zero in visual reasoning. Successfully observe the emergent “aha moment” and increased response length in visual reasoning on just a 2B non-SFT models

thumb_up_off_alt340

chat_bubble_outline11

repeat59

shareShare

Tanveer Hannan

@hannan_tanveer

7 months ago

Check out the #CVPR2025 paper on long video understanding. It achieves SOTA with a much simpler and efficient end-to-end approach.

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Tanveer Hannan

@hannan_tanveer

7 months ago

Effective long-context comprehension remains a significant hurdle for LLMs. Meta's forthcoming Llama 4 aims to address this by iRoPE architecture. I am looking forward to testing them on more real life setups like streaming videos.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Laura Leal-Taixe

@lealtaixe

5 months ago

The time for new architectures is over? Not quite! SeNaTra, a native segmentation backbone, is waiting, let's see how it works 🧵arxiv.org/abs/2505.16993

thumb_up_off_alt201

chat_bubble_outline3

repeat39

shareShare

Mohaiminul (Emon) Islam

@mmiemon

5 months ago

Had a great time presenting at the GenAI session @CiscoMeraki—thanks Nahid Alam @ CVPR 2025 for the invite🙏 Catch us at #CVPR2025: 📌 BIMBA: arxiv.org/abs/2503.09590 (June 15, 4–6PM, Poster #282) 📌 ReVisionLLM: arxiv.org/abs/2411.14901 (June 14, 5–7PM, Poster #307) Gedas Bertasius Tanveer Hannan

thumb_up_off_alt4

chat_bubble_outline0

repeat3

shareShare

Tanveer Hannan

@hannan_tanveer

5 months ago

Excited to have our paper ReVisionLLM presented today at #CVPR2025! Website: lnkd.in/g7VEeQCM

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Mohaiminul (Emon) Islam

@mmiemon

5 months ago

Great to see a lot of interest among the video understanding community about ReVisionLLM! If you missed it, checkout arxiv.org/abs/2411.14901 Tanveer Hannan

Great to see a lot of interest among the video understanding community about ReVisionLLM! If you missed it, checkout arxiv.org/abs/2411.14901

<a href="/hannan_tanveer/">Tanveer Hannan</a>

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Tanveer Hannan

@hannan_tanveer

4 months ago

🚀 Check out our latest work, ReVisionLLM, now featured on the MCML blog! 🔍 A Vision-Language Model for accurate temporal grounding in hour-long videos. 👉 mcml.ai/news/2025-06-2… #VisionLanguage #MultimodalAI #MCML #CVPR2025

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Mohaiminul (Emon) Islam

@mmiemon

4 months ago

🚀 On the job market! Final-year PhD @ UNC Chapel Hill working on computer vision, video understanding, multimodal LLMs & AI agents. 2x Research Scientist Intern Meta 🔍 Seeking Research Scientist/Engineer roles! 🔗 md-mohaiminul.github.io 📧 mmiemon [at] cs [dot] unc [dot] edu

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare