Nic Fishman (@njwfish) 's Twitter Profile
Nic Fishman

@njwfish

(s)gd is all you need. prev: cs/soc @stanford, ml @oxfordstats. now: stats @harvard. always @zahra_thab. he/they.

ID: 2359436953

linkhttp://njw.fish calendar_today24-02-2014 12:30:20

4,4K Tweet

844 Followers

755 Following

Nic Fishman (@njwfish) 's Twitter Profile Photo

🚨 New preprint 🚨 We introduce Generative Distribution Embeddings (GDEs) β€” a framework for learning representations of distributions, not just datapoints. GDEs enable multiscale modeling and come with elegant statistical theory and some miraculous geometric results! 🧡

🚨 New preprint 🚨

We introduce Generative Distribution Embeddings (GDEs) β€” a framework for learning representations of distributions, not just datapoints.

GDEs enable multiscale modeling and come with elegant statistical theory and some miraculous geometric results!

🧡
Nic Fishman (@njwfish) 's Twitter Profile Photo

1/ Real-world problems are hierarchical: we observe sets of datapoints from a distribution. - Cells grouped by clone or patient - Sequences by tissue, time, or location And we want to model the clone, the patient, or the tissue, not just the individual points.

Nic Fishman (@njwfish) 's Twitter Profile Photo

2/ GDEs show that all you need is: 1. an encoder that maps samples to a latent representation and 2. any conditional generative model that samples from the distribution conditional on the latent to lift autoencoders from points to distributions!

Nic Fishman (@njwfish) 's Twitter Profile Photo

3/ Why does this matter? βœ… Abstracts away sampling noise βœ… Enables reasoning at the population level βœ… Supports prediction, comparison, and generation of distributions βœ… Lets us leverages powerful domain-specific generative models to learn distributional structure

Nic Fishman (@njwfish) 's Twitter Profile Photo

4/ With this simple setup, we find surprisingly elegant geometry: GDE latent distances track Wasserstein-2 (Wβ‚‚) distances across modalities (shown for multinomial distributions) Latent interpolations recover optimal transport paths (shown for Gaussians and Gaussian mixtures)

4/ With this simple setup, we find surprisingly elegant geometry:

GDE latent distances track Wasserstein-2 (Wβ‚‚) distances across modalities (shown for multinomial distributions)

Latent interpolations recover optimal transport paths (shown for Gaussians and Gaussian mixtures)