Spandana Gella (@gspandana) Twitter Tweets • TwiCopy

Xiangru (Edward) Jian @ ICLR 2025

5 months ago

🚀 Our team at ServiceNow ServiceNow Research is gonna present our paper at #ICLR2025: BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks 📍 Thursday, April 24 | 10 a.m.–12:30 p.m. 📍 Hall 3 + Hall 2B, Poster #280 🔗 bigdocs.github.io

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

🇺🇦 Dzmitry Bahdanau

@dbahdanau

5 months ago

I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until your bored GPUs finish all sequences? Just update the weights and continue inference! Code: github.com/ServiceNow/Pip… Blog: huggingface.co/blog/ServiceNo…

thumb_up_off_alt507

chat_bubble_outline6

repeat114

shareShare

Xing Han Lu

@xhluca

5 months ago

⚠️We’re looking for emergency reviewers for the REALM workshop @ ACL 2025 (realm-workshop.github.io) If you work on LLM agents & can review 1–3 papers, we’d love your help! Sign up here: forms.gle/Mah45swKGNrUY5…

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Siva Reddy

@sivareddyg

5 months ago

Incredibly proud of my students Ada Tur and Gaurav Kamath for winning a SAC award at #NAACL2025 for their work on assessing how LLMs model constituent shifts. Humans have a tendency to move heavier constituents towards the end of the sentence. While LLMs unsurprisingly show

thumb_up_off_alt65

chat_bubble_outline1

repeat10

shareShare

Mila - Institut québécois d'IA

@mila_quebec

5 months ago

Congratulations to Mila members Ada Tur, Gaurav Kamath and Siva Reddy for their SAC award at #NAACL2025! Check out Ada's talk in Session I: Oral/Poster 6. Paper: arxiv.org/abs/2502.05670

thumb_up_off_alt24

chat_bubble_outline2

repeat10

shareShare

Torsten Scholak

@tscholak

5 months ago

🚨🤯 Today Jensen Huang announced SLAM Lab's newest model on the ServiceNow Events stage: Apriel‑Nemotron‑15B‑Thinker 🚨 A lean, mean reasoning machine punching way above its weight class 👊 Built by SLAM × NVIDIA. Smaller models, bigger impact. 🧵👇

thumb_up_off_alt46

chat_bubble_outline2

repeat22

shareShare

VLMs4All - CVPR 2025 Workshop

@vlms4all

5 months ago

🚀 Important Update! We're reaching out to collect email IDs of the CulturalVQA and GlobalRG challenge participants for time-sensitive communications, including informing the winning teams. ALL participating teams please fill out the forms below ASAP (ideally within 24 hours). 👇

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

P Shravan Nayak

@pshravannayak

5 months ago

🚀 Excited to share that UI-Vision has been accepted at ICML 2025! 🎉 We have also released the UI-Vision grounding datasets. Test your agents on it now! 🚀 🤗 Dataset: huggingface.co/datasets/Servi… #ICML2025 #AI #DatasetRelease #Agents

thumb_up_off_alt36

chat_bubble_outline0

repeat15

shareShare

Perouz Taslakian

@perouzt

5 months ago

Our team has released the UI-Vision benchmark (accepted at #ICML2025) for testing GUI agent visual grounding and action prediction! 🚀🚀🚀 🤗 Dataset: huggingface.co/datasets/Servi… Special thanks to the students to lead this effort, P Shravan Nayak and Xiangru (Edward) Jian ServiceNow Research

thumb_up_off_alt17

chat_bubble_outline0

repeat6

shareShare

Sai Rajeswar

@rajeswarsai

4 months ago

Congrats Tianbao Xie and team on this exciting work and release! 🎉 We’re happy to share that Jedi-7B performs on par with UI-Tars-72B agent on our challenging UI-Vision benchmark, with 10x fewer parameters! 👏 Incredible 🤗Dataset: huggingface.co/datasets/Servi… 🌐uivision.github.io

Congrats <a href="/TianbaoX/">Tianbao Xie</a> and team on this exciting work and release! 🎉 We’re happy to share that Jedi-7B performs on par with UI-Tars-72B agent on our challenging UI-Vision benchmark, with 10x fewer parameters! 👏 Incredible
🤗Dataset: huggingface.co/datasets/Servi…
🌐uivision.github.io

thumb_up_off_alt46

chat_bubble_outline0

repeat18

shareShare

Spandana Gella

@gspandana

4 months ago

Very excited to release StarFlow: a large diverse workflow dataset and open models that can transform sketches into executable workflows using VLMs. Stay tuned for more updates on this space!

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

AK

@_akhaliq

4 months ago

Rendering-Aware Reinforcement Learning for Vector Graphics Generation RLRF significantly outperforms supervised fine-tuning, addressing common failure modes and enabling precise, high-quality SVG generation with strong structural understanding and generalization

thumb_up_off_alt180

chat_bubble_outline4

repeat39

shareShare

Juan A. Rodríguez 💫

@joanrod_ai

4 months ago

Thanks AK for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learning from Rendering Feedback (RLRF). 🧠 We think we cracked SVG generalization with this one. Go read the paper! arxiv.org/abs/2505.20793 More details on

Thanks <a href="/_akhaliq/">AK</a> for sharing our work! Excited to present our next generation of SVG models, now using Reinforcement Learning from Rendering Feedback (RLRF).

🧠 We think we cracked SVG generalization with this one.

Go read the paper! arxiv.org/abs/2505.20793

More details on

thumb_up_off_alt122

chat_bubble_outline3

repeat41

shareShare

Spandana Gella

@gspandana

4 months ago

Are you new to ML, or first-time author attending ICML 📢 Submit to New in ML @ ICML 2025! ✅ Feedback from top researchers ✅ Oral presentations + awards ✅ Limited ICML 2025 tickets! newinml.github.io

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Spandana Gella

@gspandana

4 months ago

If you are at #CVPR25 come join us at our workshop in room 104E for exciting line up of talks, posters, and a panel! And some cool merch!!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

P Shravan Nayak

@pshravannayak

2 months ago

A Hindu wedding without a sacred fire? A Chinese banquet with forks? Do text-to-image models meet cultural expectations, both explicitly stated and implicitly assumed? Excited to share our latest paper on evaluating cultural alignment in T2I models 🌐 culturalframes.github.io

thumb_up_off_alt47

chat_bubble_outline1

repeat20

shareShare

IVADO

@ivado_qc

a month ago

The IVADO #Bootcamp marked the launch of the Thematic Semester on Autonomous #LLM Agents last week at Université de Montréal. Over 4 days, researchers, experts, and #AI enthusiasts gathered for conferences, tutorials, and rich discussions, laying the groundwork for our next two workshops.

The IVADO #Bootcamp marked the launch of the Thematic Semester on Autonomous #LLM Agents last week at <a href="/UMontreal/">Université de Montréal</a>. Over 4 days, researchers, experts, and #AI enthusiasts gathered for conferences, tutorials, and rich discussions, laying the groundwork for our next two workshops.

thumb_up_off_alt14

chat_bubble_outline1

repeat6

shareShare

Massimo Caccia

@masscaccia

a month ago

🔥 We stress-tested today’s best AI code generators in 𝑑𝑒𝑝𝑒𝑛𝑑𝑒𝑛𝑐𝑦 ℎ𝑒𝑙𝑙. Introducing 𝐆𝐢𝐭𝐂𝐡𝐚𝐦𝐞𝐥𝐞𝐨𝐧 𝟐.𝟎: 328 challenges for version-controlled code generation. The verdict? Even top models only hit ~50% success.

thumb_up_off_alt41

chat_bubble_outline3

repeat25

shareShare

Alexandre Lacoste

@alex_lacoste_

a month ago

🚨 Is #WorkArena on the verge of being solved? Or did GPT-5 just get trained on it? 🔥While some benchmarks show modest gains, GPT-5 is crushing WorkArena L2🔥 ➡️ 69.4% avg success vs. ~40% for next best🤯 ➡️ Complex tasks, up to 100 steps, 5–20 min for humans

thumb_up_off_alt39

chat_bubble_outline4

repeat24

shareShare

Ahmed Masry

@ahmed_masry97

a month ago

UI-Vision vs GPT-5: Still holding the crown 👑 and far from saturation. GPT-5 has strengths in coding and reasoning, but when it comes to computer-use tasks, it’s still awkward to rely on it alone. And our team's UI-Vision (ICML 2025) remains a key and still unbeaten multimodal

thumb_up_off_alt19

chat_bubble_outline2

repeat9

shareShare