Thomas Rory Stone, Ph.D. 🇬🇧 🇪🇺 🇺🇦 (@thomasrorystone) Twitter Tweets • TwiCopy

Thomas Rory Stone, Ph.D. 🇬🇧 🇪🇺 🇺🇦

@thomasrorystone

+ Follow

Investing in advanced technologies for people and planet @kintsugiad 🏞️ Previously Co-founder of PredictionIO 🐸 (Acquired by Salesforce) and Ph.D. @UCLCS 🖥

ID: 62800938

linkhttp://kintsugi.vc calendar_today04-08-2009 12:13:54

9,9K Tweet

1,1K Followers

2,2K Following

Tim Rocktäschel

@_rockt

5 months ago

Happy "The NetHack Learning Environment is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).

Happy "<a href="/NetHack_LE/">The NetHack Learning Environment</a> is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).

thumb_up_off_alt133

chat_bubble_outline3

repeat28

shareShare

Fouad Al Noor

@fouadalnoor1

5 months ago

I am hugely excited to announce that ThinkSono has raised $6Mn in our oversubscribed seed two funding round ($13Mn raised to date)! This was led by Id4 ventures with participation from Cur8 Capital, Brandenburg Kapital, clinical leaders and the founders!

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Joseph Gordon-Levitt

@hitrecordjoe

5 months ago

A little shot of hope and optimism today from the United Nations. I got to speak about AI at their annual Internet Governance Forum (my first time at a UN thing 🤩) and it really does feel good to see so many people from so many places around the world convening in good faith to

thumb_up_off_alt775

chat_bubble_outline56

repeat67

shareShare

The Economist

@theeconomist

5 months ago

In Silicon Valley, old-school spreadsheet measures are out; vibe valuing is in. We explain how AI startups are raising billions before even making a dollar econ.st/3Ghqco1

thumb_up_off_alt23

chat_bubble_outline3

repeat6

shareShare

Ben Casnocha

@bencasnocha

5 months ago

The LLM bias to positivity is interesting.

thumb_up_off_alt0

chat_bubble_outline1

repeat1

shareShare

Dr. Dominic Ng

@drdominicng

5 months ago

Microsoft claims their new AI framework diagnoses 4x better than doctors. I'm a medical doctor and I actually read the paper. Here's my perspective on why this is both impressive AND misleading ... 🧵

thumb_up_off_alt8,8K

chat_bubble_outline273

repeat1,1K

shareShare

Tom Westgarth

@tom_westgarth15

5 months ago

Many top academic AI researchers are GPU poor. The Sovereign AI Unit wants to change this story. We are seeking ambitious AI research proposals for UK academics aiming to take their research to the next level. Read below to find out more and apply 🧵👇

thumb_up_off_alt60

chat_bubble_outline3

repeat16

shareShare

MIT CSAIL

@mit_csail

5 months ago

77 years ago this week Claude Shannon ushered in the field of information theory with his paper "A Mathematical Theory of Communication," which has been cited over 100,000 times: bit.ly/44rfByX

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat312

shareShare

François Chollet

@fchollet

5 months ago

We are now closer to the year 2100 than to 1950. Also closer to 2050 than to 2000. Time to start acting like it.

thumb_up_off_alt1,1K

chat_bubble_outline51

repeat230

shareShare

François-Xavier Briol

@fx_briol

5 months ago

Today is the day of the pre-ICML event at UCL! Come check out the exciting work from academics, industry researchers, postdocs and PhD students from around London: sites.google.com/view/pre-icml-… UCL Statistical Science UCL CSML

thumb_up_off_alt32

chat_bubble_outline0

repeat6

shareShare

Edward Hughes

@edwardfhughes

5 months ago

The automation of innovation is within reach! Delighted that my RAAIS talk is now available for anyone to watch, alongside an excellent blogpost summary by the inimitable Nathan Benaich.

thumb_up_off_alt41

chat_bubble_outline0

repeat16

shareShare

ARC Prize

@arcprize

5 months ago

Grok 4 (Thinking) achieves new SOTA on ARC-AGI-2 with 15.9% This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA

thumb_up_off_alt5,5K

chat_bubble_outline236

repeat751

shareShare

Matt Clifford

@matthewclifford

4 months ago

I’m excited about this: 12 month fellowship for AI engineers to work on some of the biggest challenges in government👇

thumb_up_off_alt67

chat_bubble_outline4

repeat18

shareShare

Ethan Mollick

@emollick

4 months ago

I am starting to think sycophancy is going to be a bigger problem than pure hallucination as LLMs improve. Models that won’t tell you directly when you are wrong (and justify your correctness) are ultimately more dangerous to decision-making than models that are sometimes wrong.

thumb_up_off_alt3,3K

chat_bubble_outline217

repeat433

shareShare

Thomas Rory Stone, Ph.D. 🇬🇧 🇪🇺 🇺🇦

@thomasrorystone

4 months ago

If you are building advanced technology in Europe check out Andreas Klinger 🦾’s PROTOTYPE prototypecap.com (now on Fund III) 🇪🇺🦾

If you are building advanced technology in Europe check out <a href="/andreasklinger/">Andreas Klinger 🦾</a>’s PROTOTYPE prototypecap.com (now on Fund III) 🇪🇺🦾

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Psyho

@fakepsyho

4 months ago

Humanity has prevailed (for now!) I'm completely exhausted. I figured, I had 10h of sleep in the last 3 days and I'm barely alive. I'll post more about the contest when I get some rest. (To be clear, those are provisional results, but my lead should be big enough)

thumb_up_off_alt13,13K

chat_bubble_outline549

repeat1,1K

shareShare

François Chollet

@fchollet

4 months ago

Today we're releasing a developer preview of our next-gen benchmark, ARC-AGI-3. The goal of this preview, leading up to the full version launch in early 2026, is to collaborate with the community. We invite you to provide feedback to help us build the most robust and effective

thumb_up_off_alt2,2K

chat_bubble_outline209

repeat1,1K

shareShare

François Chollet

@fchollet

4 months ago

What's included in the developer preview: • Three test games: Go play them yourself – they're fun! • An agent API: Start building and testing your agents against the games. • Sprint competition: Submit the best-performing agent in the next 4 weeks and win a $10,000 prize.

thumb_up_off_alt122

chat_bubble_outline9

repeat9

shareShare

Edward Grefenstette

@egrefen

4 months ago

Do you have a PhD (or equivalent) or will have one in the coming months (i.e. 2-3 months away from graduating)? Do you want to help build open-ended agents that help humans do humans things better, rather than replace them? We're hiring 1-2 Research Scientists! Check the 🧵👇

thumb_up_off_alt339

chat_bubble_outline9

repeat38

shareShare

Thomas Wolf

@thom_wolf

4 months ago

if you’re interested in pushing new records on fusion with AI, we’ve just launched a kaggle-like competition with Proxima Fusion (the startup that spun out of Wendelstein 7-X discussed in this HN news) All the details at huggingface.co/blog/cgeorgiaw…

thumb_up_off_alt29

chat_bubble_outline1

repeat7

shareShare