Ehsan Kamalloo
@ehsk0
Research Scientist @ServiceNowRSRCH
ID: 1663072824
http://ehsk.github.io 11-08-2013 17:51:55
177 Tweet
326 Followers
592 Following
Internship ServiceNow Research to build the next generation of computer use agents that are safe and secure from malicious attacks. Focus on intervention strategies, defenses to make agents robust against unsafe behavior.. Apply here: bit.ly/3V3mmTg
Glad to see OpenAI prioritizing abstention responses in their paper! That's a great intro to our TMLR paper in which we developed an iterative self-reflection method for LLM to know when to abstain without ground truth and no additional cost at test time. openreview.net/pdf?id=SvKPfch…
💡So far, I have been sharing our multimodal AI research at ServiceNow focused on reasoning over pixels. Today, we share a new chapter with an open-source release of our big initiative in the voice and speech domain.🚀 🎧 AU-Harness: Holistic Evaluation of Audio LLM Responses
Excited to speak at the AAAI-26 Workshop on Agentic AI Benchmarks & Enterprise Tasks (Jan 26, Singapore) 🇸🇬 As agents are rapidly productized, realistic enterprise benchmarks for capabilities and reliability are essential! Submit: openreview.net/group?id=AAAI.… 🗓️ Oct 29 cc Graham Neubig
In-flight weight updates have gone from a “weird trick” to a must to train LLMs with RL in the last few weeks. If you want to understand the on-policy and throughput benefits here’s the CoLM talk 🇺🇦 Dzmitry Bahdanau and I gave: youtu.be/Z1uEuRKACRs