Anuj Gupta (@mettalrose) 's Twitter Profile
Anuj Gupta

@mettalrose

PhD researcher @uarizona. GenAI in education. AR/VR. Algorithms. Computational Research. Digital Writing Technologies. UI/UX. Rhetoric, writing studies, TPC.

ID: 126321418

linkhttps://www.linkedin.com/in/anuj-gupta-3533541a1/ calendar_today25-03-2010 13:45:33

2,2K Tweet

986 Followers

2,2K Following

Laura Weidinger (@weidingerlaura) 's Twitter Profile Photo

Very proud of our 📣 new paper on Measuring Anthropomorphism in LLMs! This new multi-turn evaluation & large-scale human study led by Lujain Ibrahim لجين إبراهيم is a key step in better understanding what factors make people perceive LLMs as more "human-like". 🧵

Sarvam AI (@sarvamai) 's Twitter Profile Photo

Today we introduce Sarvam-M, a 24B open-weights hybrid model built on top of Mistral Small. Sarvam-M achieves a new benchmark across a range of Indian languages, math, and programming tasks, for a model of its size. Here is a detailed technical blog on how we customize

Today we introduce Sarvam-M, a 24B open-weights hybrid model built on top of Mistral Small.

Sarvam-M achieves a new benchmark across a range of Indian languages, math, and programming tasks, for a model of its size.

Here is a detailed technical blog on how we customize
Anuj Gupta (@mettalrose) 's Twitter Profile Photo

UAE is planning to become the first country in the world to offer free access to ChatGPT plus to all its citizens and residents! OpenAI has likely made a deal with an entire country as its client. thearabianstories.com/2025/05/25/fre…

Kiran Garimella (@gvrkiran) 's Twitter Profile Photo

this paper quantiatively shows what someone told me: "everyone loves interdisciplinary but no one will give u a job if you are interdisciplinary" Hiring at top universities rewards disciplinary loyalty over interdisciplinary breadth. Things are changing. arxiv.org/abs/2503.21912

this paper quantiatively shows what someone told me: "everyone loves interdisciplinary but no one will give u a job if you are interdisciplinary"

Hiring at top universities rewards disciplinary loyalty over interdisciplinary breadth. Things are changing.

arxiv.org/abs/2503.21912
Simon Willison (@simonw) 's Twitter Profile Photo

It's interesting how the major LLM API vendors are converging on the following features: - Code execution: Python in a sandbox - Web search - like Anthropic, Mistral seem to use Brave - Document library aka hosted RAG - Image generation (FLUX for Mistral) - Model Context Protocol

Ravid Shwartz Ziv (@ziv_ravid) 's Twitter Profile Photo

You know all those arguments that LLMs think like humans? Turns out it's not true. 🧠 In our paper "From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning" we test it by checking if LLMs form concepts the same way humans do Yann LeCun Chen Shani Dan Jurafsky

You know all those arguments that LLMs think like humans? Turns out it's not true.

🧠 In our paper  "From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning" we test it by checking if LLMs form concepts the same way humans do  <a href="/ylecun/">Yann LeCun</a> <a href="/ChenShani2/">Chen Shani</a>  <a href="/jurafsky/">Dan Jurafsky</a>
Simon Willison (@simonw) 's Twitter Profile Photo

I had an hour long conversation with Natasha Zouves from News Nation about the extremely complex topic of the impact of AI on jobs and careers The video is on YouTube - here's my Claude Opus 4 assisted index of the transcript, with links to key sections simonwillison.net/2025/May/30/ai…

Anuj Gupta (@mettalrose) 's Twitter Profile Photo

I wonder why "age of AI" has become such a common phrase in recent research articles about AI recently. I don't remember "age of Covid" being a common academic phrase. How did "age of AI" come to be then? Was it injected by LLMs itself into academic prose? 🧐

Anuj Gupta (@mettalrose) 's Twitter Profile Photo

People with ALS often use eye-gaze typing, reaching 8–10 wpm (compared to 150–200 wpm for neurotypicals). A new AI tool, SpeakFaster, lets users select just the first letter and predicts the rest. Early results: 29–60% faster communication. nature.com/articles/s4146…

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Good post from Balaji on the "verification gap". You could see it as there being two modes in creation. Borrowing GAN terminology: 1) generation and 2) discrimination. e.g. painting - you make a brush stroke (1) and then you look for a while to see if you improved the

Anuj Gupta (@mettalrose) 's Twitter Profile Photo

🏆 Honored to receive the 2025 ATTW Graduate Research Award for my work on generative AI prompts in tech & professional communication. Grateful to the committee and mentors for their support! #ATTW2025 #AI #TPC #HCI #WritingStudies

🏆 Honored to receive the 2025 ATTW Graduate Research Award for my work on generative AI prompts in tech &amp; professional communication. Grateful to the committee and mentors for their support! #ATTW2025 #AI #TPC #HCI #WritingStudies
John Gallagher (@meresophistry) 's Twitter Profile Photo

I interviewed 108 machine learning researchers about their experiences with writing! From the abstract: Given the hype around artificial intelligence (AI), it is imperative to investigate how researchers of AI negotiate this hype as well as wrestle with it in their research.

Simon Willison (@simonw) 's Twitter Profile Photo

I've published video, slides and a detailed annotated transcript from my talk at this week's AI Engineer World's Fair conference AI Engineer in San Francisco - "The last year six months in LLMs, illustrated by pelicans on bicycles"

I've published video, slides and a detailed annotated transcript from my talk at this week's AI Engineer World's Fair conference <a href="/aiDotEngineer/">AI Engineer</a>  in San Francisco - "The last year six months in LLMs, illustrated by pelicans on bicycles"
Dheemanth Reddy (@dheemanthreddy_) 's Twitter Profile Photo

We're launching Veena TTS 🪕 on June 20 Our flagship text-to-speech model for Indian languages 🇮🇳 Natural, expressive, and actually sounds like us. We’re launching two models: Veena Lite >Open-source and lightweight >4 unique, natural-sounding voices >The first open-source

Anuj Gupta (@mettalrose) 's Twitter Profile Photo

A key critique from Bender et al. (2021) was that LLMs lack linguistic diversity. That’s thankfully changing—Sarvam AI just released open-source models trained on 10 Indian languages including English. Check them out: sarvam.ai 🇮🇳📚