Jannik Kossen (@janundnik) 's Twitter Profile
Jannik Kossen

@janundnik

AI Research Scientist at FAIR (@meta) working on LLMs for CodeGen and Reasoning. PhD Student @OATML_Oxford and @oxcsml. Interned @DeepMind and @GoogleAI.

ID: 284635049

linkhttp://jlko.eu calendar_today19-04-2011 17:18:30

307 Tweet

1,1K Followers

695 Following

Basil Mustafa (@_basilm) 's Twitter Profile Photo

awesome work from the brilliant, tenacious & impeccably organized Jannik Kossen ! works great for contrastive learning, but generally this is a performant + flexible way to distill from teachers when you don't want the student to be too forced to exactly match. go chat to Jannik!

Yarin (@yaringal) 's Twitter Profile Photo

I'm hiring! I'm building 4 research groups under me at AISI (formerly the UK's Taskforce on Frontier AI) to work on foundational AI safety research. [1/5] gov.uk/government/pub…

Pascal Notin (@notinpascal) 's Twitter Profile Photo

Want to catch up with the rapid progress in ML for functional protein design? Not sure where to start? Check out our review in Nature Biotech! #ProteinDesign #NatureBiotechnology #Cover

Freddie Bickford Smith (@fbickfordsmith) 's Twitter Profile Photo

The current default recipe for Bayesian active learning doesn’t really work beyond MNIST scale. We suggest why that is and identify a simple fix. arxiv.org/abs/2404.17249 AISTATS Conference with @adamefoster Tom Rainforth 1/5

Tom Rainforth (@tom_rainforth) 's Twitter Profile Photo

I have an opening for a 2.5-year postdoc position in the RainML lab as part of my ERC grant on probabilistic machine learning and intelligent data acquisition. Application deadline 10th July 2024. See here for details and to apply: tinyurl.com/rainmlpostdoc

Sebastian Farquhar (@seb_far) 's Twitter Profile Photo

Great piece in TIME on hallucinations and semantic entropy from Billy Perrigo Precise but accessible. Was a pleasure to speak with him. So many insightful questions.

Sebastian Farquhar (@seb_far) 's Twitter Profile Photo

Excellent piece by karin verspoor (professor) in @nature News and Views discussing our recent paper on detecting hallucinations with semantic entropy nature.com/articles/d4158…

karin verspoor (professor) (@karinv) 's Twitter Profile Photo

I was invited by Nature News & Views to comment on a paper recently published in nature by Sebastian Farquhar Jannik Kossen Lorenz Kuhn Yarin on "Detecting hallucinations in large language models using semantic entropy". My piece on LLMs "Fighting fire with fire" is here: rdcu.be/dLkVv

Oxford Comp Sci (@compscioxford) 's Twitter Profile Photo

Major study out now in Nature by Prof Yarin Gal Yarin, Dr Sebastian Farquhar Sebastian Farquhar, Jannik Kossen Jannik Kossen & Lorenz Kuhn Lorenz Kuhn advances the reliability of AI with novel approach to improving the detection of 'hallucinating’ LLMs. Read more: cs.ox.ac.uk/news/2345-full…

Major study out now in Nature by Prof Yarin Gal <a href="/yaringal/">Yarin</a>, Dr Sebastian Farquhar <a href="/seb_far/">Sebastian Farquhar</a>, Jannik Kossen <a href="/janundnik/">Jannik Kossen</a> &amp; Lorenz Kuhn <a href="/_lorenzkuhn/">Lorenz Kuhn</a> advances the reliability of AI with novel approach to improving the detection of 'hallucinating’ LLMs. Read more: cs.ox.ac.uk/news/2345-full…
Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs abs: arxiv.org/abs/2406.15927 Semantic entropy: repeated LLM generation at non-zero temperature is usually consistent for accurate generation, inconsistent for inaccurate generation. This consistency can

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

abs: arxiv.org/abs/2406.15927

Semantic entropy: repeated LLM generation at non-zero temperature is usually consistent for accurate generation, inconsistent for inaccurate generation. This consistency can
Rishabh Agarwal (@agarwl_) 's Twitter Profile Photo

Yoav Artzi Sewon Min I think this paper by Jannik Kossen might be what you are looking for: openreview.net/forum?id=YPIA7…. We also looked at this in the many-shot setting on a sentiment analysis task (that models do look at labels indeed and can even override pre-training biases): arxiv.org/abs/2404.11018

<a href="/yoavartzi/">Yoav Artzi</a> <a href="/sewon__min/">Sewon Min</a> I think this paper by <a href="/janundnik/">Jannik Kossen</a> might be what you are looking for: openreview.net/forum?id=YPIA7….

We also looked at this in the many-shot setting on a sentiment analysis task (that models do look at labels indeed and can even override pre-training biases): arxiv.org/abs/2404.11018
Jannik Kossen (@janundnik) 's Twitter Profile Photo

Life update! I've joined FAIR @meta as an AI Research Scientist to work on code generation with LLMs in Gabriel Synnaeve's team ✨ Thanks to everyone who supported me along the way 🙏 I'm super excited for what's to come!

Life update!

I've joined FAIR @meta as an AI Research Scientist to work on code generation with LLMs in <a href="/syhw/">Gabriel Synnaeve</a>'s team ✨

Thanks to everyone who supported me along the way 🙏

I'm super excited for what's to come!
aj (@anndvision) 's Twitter Profile Photo

hello ! we will be presenting Estimating the Hallucination Rate of Generative AI at NeurIPS this friday come if you'd like to chat about predicting and understanding in-context hallucinations and epistemic uncertainty poster 2703 - east - 4:30pm arxiv.org/abs/2406.07457

hello ! 

we will be presenting Estimating the Hallucination Rate of Generative AI at NeurIPS this friday

come if you'd like to chat about predicting and understanding in-context hallucinations and epistemic uncertainty

poster 2703 - east - 4:30pm
arxiv.org/abs/2406.07457
Freddie Bickford Smith (@fbickfordsmith) 's Twitter Profile Photo

The aleatoric-epistemic view on uncertainty doesn't serve ML researchers' needs and should be replaced. Come to the talk and poster tomorrow (Sat 14 Dec) at the #NeurIPS2024 workshop on Bayesian decisions (gp-seminar-series.github.io/neurips-2024). openreview.net/forum?id=WIjgb…

The aleatoric-epistemic view on uncertainty doesn't serve ML researchers' needs and should be replaced.

Come to the talk and poster tomorrow (Sat 14 Dec) at the #NeurIPS2024 workshop on Bayesian decisions (gp-seminar-series.github.io/neurips-2024).

openreview.net/forum?id=WIjgb…