Ilia Kulikov
@uralik1
ramming llms @MetaAI
ID: 977013150871638016
https://iliakulikov.ru 23-03-2018 02:44:22
118 Tweet
511 Followers
270 Following
"Mode recovery in neural autoregressive sequence modeling" We study mismatches between the most probable sequences of each stage of the "learning chain": ground-truth, data-collection, learning, decoding Led by Ilia Kulikov w/ Kyunghyun Cho, SPNLP talk on Friday arxiv.org/abs/2106.05459
Apparently ancestral sampling yields high quality translations if we sample enough number of times, but how to choose one of them in the end? Bryan Eikema shows how to scale utility computations over large hypotheses spaces efficiently! very cool
We are using fairseq2 for llm post-training research in our team. This release comes with a decent documentation (facebookresearch.github.io/fairseq2/stabl…) 😅 My favorite feature of the lib is the runtime extension support: one can develop research code without forking out the entire lib repo!