Robin Tibor Schirrmeister (@robintibor) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I like this new paper on DL vs trees at the #NeurIPS 2023 datasets and benchmark track: arxiv.org/abs/2305.02997. One of their findings: #TabPFN performs well across 98 datasets: it ties for 1st place with CatBoost in predictive performance but is 200x faster to train. This is

thumb_up_off_alt320

chat_bubble_outline4

repeat68

shareShare

Polina Kirichenko

@polkirichenko

2 years ago

Excited to share our #NeurIPS paper analyzing the good, the bad and the ugly sides of data augmentation (DA)! DA is crucial for computer vision but can introduce class-level performance disparities. We explain and address these negative effects in: openreview.net/pdf?id=yageaKl… 1/9

thumb_up_off_alt412

chat_bubble_outline4

repeat71

shareShare

Steven Adriaensen

@stadriaensen

2 years ago

Heading off to #NewOrleans to present our #NeurIPS2023 paper. w/ Herilalaina Rakotoarison, Samuel Müller @ ICML, Frank Hutter , we propose LC-PFN: A transformer that does Bayesian learning curve extrapolation in a single forward pass, 10.000x faster than prior art using MCMC.

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Noah Hollmann

@noahholl

2 years ago

Excited to present "LLMs for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" at #NeurIPS2023! We had a simple idea: Can we use the domain knowledge of LLMs to automate the feature engineering process? with Samuel Müller Frank Hutter

thumb_up_off_alt30

chat_bubble_outline1

repeat5

shareShare

Rhea Sukthanker

@rheasukthanker

2 years ago

I am at #NeurIPS2023 from 10-16 Dec. My research focuses on efficient and multi-objective neural architecture search eg: fairness, hardware efficiency. Please reach out if you'll like to chat about using #AutoML and #NAS to make deep learning more aligned with human objectives.

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Frank Hutter

@frankrhutter

2 years ago

I'm at #NeurIPS2023 with my amazing team, excited to be presenting 6 papers at the main track and 2 at workshops, as well as a keynote in the table representation learning workshop. Here's all the info in one tweet, ordered by day. Please come by and chat with us 🙂 Tuesday

thumb_up_off_alt46

chat_bubble_outline0

repeat7

shareShare

Dan Zhang

@isdanzhang

2 years ago

👇Check out our work presented at NeurIPs now! Early-exit networks aim at adjusting the computation effort according to the actual complexity of classifying each sample. We further encourage them to become gradually confident. More processing only for better prediction quality!

thumb_up_off_alt6

chat_bubble_outline2

repeat2

shareShare

Bernardino Romera-Paredes

@ber24

2 years ago

Today in nature we present FunSearch: an LLM-powered system that has found new discoveries in an established mathematical problem. FunSearch is versatile too: it can be applied to impactful practical problems. Blogpost: deepmind.google/discover/blog/… Paper: storage.googleapis.com/deepmind-media… 1/8

Today in <a href="/Nature/">nature</a> we present FunSearch: an LLM-powered system that has found new discoveries in an established mathematical problem. FunSearch is versatile too: it can be applied to impactful practical problems.
Blogpost: deepmind.google/discover/blog/…
Paper: storage.googleapis.com/deepmind-media… 1/8

thumb_up_off_alt119

chat_bubble_outline10

repeat18

shareShare

Matej Balog

@matejbalog

2 years ago

Can LLMs be used to discover something new? Yes! Happy to share our new paper in nature on #FunSearch, a system that uses LLMs for making new discoveries in mathematical sciences. Blog: deepmind.google/discover/blog/… Paper: storage.googleapis.com/deepmind-media… 1/n

Can LLMs be used to discover something new? Yes!

Happy to share our new paper in <a href="/Nature/">nature</a> on #FunSearch, a system that uses LLMs for making new discoveries in mathematical sciences.

Blog: deepmind.google/discover/blog/…
Paper: storage.googleapis.com/deepmind-media…

1/n

thumb_up_off_alt130

chat_bubble_outline3

repeat27

shareShare

Google DeepMind

@googledeepmind

2 years ago

Introducing FunSearch in nature: a method using large language models to search for new solutions in mathematics & computer science. 🔍 It pairs the creativity of an LLM with an automated evaluator to guard against hallucinations and incorrect ideas. 🧵 dpmd.ai/x-funsearch

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat491

shareShare

Polina Kirichenko

@polkirichenko

2 years ago

Come to our poster on data augmentation biases today at the Poster Session 6 at 5-7pm, poster #1619! #NeurIPS Trustworthy ML Initiative (TrustML)

thumb_up_off_alt60

chat_bubble_outline2

repeat6

shareShare

Eric Topol

@erictopol

2 years ago

Big #AI discovery: a new structural class of antibiotics (the last one took 38 years) with multiple compounds effective vs methicillin-resistant Staph aureus, without toxicity nature nature.com/articles/s4158…

thumb_up_off_alt4,4K

chat_bubble_outline2

repeat1,1K

shareShare

Anna Khoreva

@anna_khoreva

2 years ago

Excited to share that our paper on integrating adversarial supervision into diffusion model training for image synthesis has been accepted to #ICLR2024 ICLR 2026 . Bosch Center for Artificial Intelligence Paper: arxiv.org/abs/2401.08815 Code & models: github.com/boschresearch/…

thumb_up_off_alt36

chat_bubble_outline6

repeat2

shareShare

Polina Kirichenko

@polkirichenko

a year ago

An image is worth more than one caption! In our #ICML2024 paper “Modeling Caption Diversity in Vision-Language Pretraining” we explicitly bake in that observation in our VLM called Llip and condition the visual representations on the latent context. arxiv.org/abs/2405.00740 🧵1/6

thumb_up_off_alt121

chat_bubble_outline5

repeat20

shareShare

Lily Zhang

@lilyhzhang

a year ago

Excited to present Targeted Negative Training, a finetuning method for updating a language model to avoid unwanted outputs while minimally changing model behavior otherwise: arxiv.org/abs/2406.13660 (Work done during internship at Google, with Rajesh Ranganath and Arya Tafvizi)

thumb_up_off_alt50

chat_bubble_outline4

repeat10

shareShare

Aaron Klein

@kleiaaro

a year ago

Happy to share that I am returning to academia! I am starting a new research group on AutoML at ScaDS.AI. The team's focus will be on methods to reduce the training and inference time of LLM. I am currently looking for a PhD student to work on NAS and HPO for LLM.

thumb_up_off_alt35

chat_bubble_outline3

repeat6

shareShare

Robert Lange

@roberttlange

a year ago

🎉 Stoked to share The AI-Scientist 🧑‍🔬 - our end-to-end approach for conducting research with LLMs including ideation, coding, experiment execution, paper write-up & reviewing. Blog 📰: sakana.ai/ai-scientist/ Paper 📜: arxiv.org/abs/2408.06292 Code 💻: github.com/SakanaAI/AI-Sc…

thumb_up_off_alt363

chat_bubble_outline13

repeat66

shareShare

Ke Li 🍁

@kl_div

10 months ago

Diffusion models turn the data into a mixture of isotropic Gaussians, and so struggle to capture the underlying structure when trained on small datasets. In our new #ECCV2024 paper, we introduce RS-IMLE, a generative model that gets around this issue. Website:

thumb_up_off_alt751

chat_bubble_outline6

repeat123

shareShare

Polina Kirichenko

@polkirichenko

3 months ago

We are hiring a PhD research intern at FAIR w/ Mark Ibrahim Kamalika Chaudhuri to start this summer or Fall! Potential topics: trustworthy and reliable LLMs, multi-modal LLMs and agents, post-training, reasoning, with a focus on open science/sharing our findings in the paper at the end

thumb_up_off_alt403

chat_bubble_outline4

repeat36

shareShare

Polina Kirichenko

@polkirichenko

2 months ago

Excited to release AbstentionBench -- our paper and benchmark on evaluating LLMs’ *abstention*: the skill of knowing when NOT to answer! Key finding: reasoning LLMs struggle with unanswerable questions and hallucinate! Details and links to paper & open source code below! 🧵1/9

thumb_up_off_alt590

chat_bubble_outline11

repeat81

shareShare