Jaewoong Cho (@jaewoong_cho) 's Twitter Profile
Jaewoong Cho

@jaewoong_cho

ID: 1706846368256380928

calendar_today27-09-2023 01:41:10

4 Tweet

20 Followers

18 Following

Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding abs: arxiv.org/abs/2401.07851 repo: github.com/hemingkx/Specu…

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

abs: arxiv.org/abs/2401.07851
repo: github.com/hemingkx/Specu…
AK (@_akhaliq) 's Twitter Profile Photo

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks paper page: huggingface.co/papers/2402.04… State-space models (SSMs), such as Mamba Gu & Dao (2034), have been proposed as alternatives to Transformer networks in language modeling, by incorporating

Can Mamba Learn How to Learn? 

A Comparative Study on In-Context Learning Tasks

paper page: huggingface.co/papers/2402.04…

State-space models (SSMs), such as Mamba Gu & Dao (2034), have been proposed as alternatives to Transformer networks in language modeling, by incorporating
Dongmin Park @ iclr25 (@dongmin_park11) 's Twitter Profile Photo

🚨New Paper Alert As a game company, KRAFTON AI is actively exploring how to apply LLM agents to video games. We present Orak—a foundational video gaming benchmark for LLM agents! Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵

🚨New Paper Alert

As a game company, <a href="/Krafton_AI/">KRAFTON AI</a> is actively exploring how to apply LLM agents to video games.

We present Orak—a foundational video gaming benchmark for LLM agents!

Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵