Juno KIM (@junokim_ai) Twitter Tweets • TwiCopy

Juno KIM

a year ago

Also giving a contributed talk on the learning-theoretic complexity and optimality of ICL at the Theoretical Foundations of Foundation Models happy to share our results with the ML theory and LLM community!

Also giving a contributed talk on the learning-theoretic complexity and optimality of ICL at the <a href="/tf2m_workshop/">Theoretical Foundations of Foundation Models</a> happy to share our results with the ML theory and LLM community!

thumb_up_off_alt32

chat_bubble_outline0

repeat7

shareShare

Juno KIM

@junokim_ai

a year ago

Both joint work with my wonderful supervisor Taiji Suzuki

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

a year ago

Incredibly honored and humbled by the overwhelming response to my tutorial, and thank you everyone who attended in person. Truly heartwarming to hear how much you enjoyed it. Many have been asking for a recording, and I prepared one with my own subtitles youtu.be/yBL7J0kgldU

thumb_up_off_alt1,1K

chat_bubble_outline27

repeat183

shareShare

Juno KIM

@junokim_ai

a year ago

I'm honored to receive the best paper award at Theoretical Foundations of Foundation Models on the last day of ICML Conference in Vienna!

I'm honored to receive the best paper award at <a href="/tf2m_workshop/">Theoretical Foundations of Foundation Models</a> on the last day of <a href="/icmlconf/">ICML Conference</a> in Vienna!

thumb_up_off_alt66

chat_bubble_outline0

repeat4

shareShare

Stat.ML Papers

@statmlpapers

a year ago

Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics ift.tt/5ZMQJLk

thumb_up_off_alt54

chat_bubble_outline0

repeat10

shareShare

Juno KIM

@junokim_ai

a year ago

Our paper is finally out on arXiv!

thumb_up_off_alt55

chat_bubble_outline2

repeat6

shareShare

Alex Bilzerian

@alexbilz

a year ago

'High-Dimensional Probability' - Vershynin (2024, PDF): math.uci.edu/~rvershyn/pape… Full course on high-dimensional probability with 41 video lectures & 13 problem sets provided: math.uci.edu/~rvershyn/teac…

thumb_up_off_alt1,1K

chat_bubble_outline9

repeat286

shareShare

Juno KIM

@junokim_ai

a year ago

Our paper on statistical complexity and optimality of in-context learning of deep transformers has been accepted to NeurIPS! arxiv.org/abs/2408.12186

thumb_up_off_alt56

chat_bubble_outline1

repeat6

shareShare

Math Cafe

@riazi_cafe_en

a year ago

"Deep Learning Theory" by Matus Telgarsky mjt.cs.illinois.edu/dlt/index.pdf

thumb_up_off_alt744

chat_bubble_outline4

repeat134

shareShare

Juno KIM

@junokim_ai

a year ago

Visit our poster #2035 NeurIPS Conference on Dec 11 (Wed) from 11 a.m. Also presenting two papers on chain-of-thought and neural IV regression M3L Workshop @ NeurIPS 2024 !

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Taiji Suzuki

@btreetaiji

8 months ago

Here is a great Youtube video by Charles Riou that effectively explains our recent paper with Juno KIM on chain-of-thought accepted by ICLR2025. Check it! Kim&Suzuki: Transformers Provably Solve Parity Efficiently with Chain of Thought. ICLR2025. youtu.be/pj-GEBU2iVs?si…

thumb_up_off_alt30

chat_bubble_outline1

repeat5

shareShare

Sasha Rush

@srush_nlp

8 months ago

Simons Institute Workshop: "Future of LLMs and Transformers": 21 talks Monday - Friday next week. simons.berkeley.edu/workshops/futu…

thumb_up_off_alt528

chat_bubble_outline4

repeat92

shareShare

Juno KIM

@junokim_ai

7 months ago

Graduated with my master's from the University of Tokyo!🇯🇵 I was honored to receive the graduate school dean's award for outstanding research and to deliver a speech at the ceremony! Deeply grateful to my supervisor Taiji Suzuki🙏 i.u-tokyo.ac.jp/news/topics/20…

thumb_up_off_alt35

chat_bubble_outline0

repeat0

shareShare

Juno KIM

@junokim_ai

7 months ago

I will be joining UC Berkeley as an EECS PhD student this fall!

thumb_up_off_alt48

chat_bubble_outline1

repeat0

shareShare

Juno KIM

@junokim_ai

7 months ago

Visit our poster at 10am Friday! #ICLR2025

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Juno KIM

@junokim_ai

6 months ago

Our paper studying dynamics of a simple Markov model for CoT reasoning has been accepted to #ICML2025 ! (reposting a nice summary↓)

thumb_up_off_alt85

chat_bubble_outline7

repeat10

shareShare

Juno KIM

@junokim_ai

6 months ago

I will also be presenting my ICLR oral paper at the NLP Colloquium on the 21st (talk in Japanese):

thumb_up_off_alt33

chat_bubble_outline0

repeat4

shareShare

Zixuan Wang

@zzzixuanwang

5 months ago

LLMs can solve complex tasks that require combining multiple reasoning steps. But when are such capabilities learnable via gradient-based training? In our new COLT 2025 paper, we show that easy-to-hard data is necessary and sufficient! arxiv.org/abs/2505.23683 🧵 below (1/10)

thumb_up_off_alt186

chat_bubble_outline1

repeat34

shareShare

Jyo Pari

@jyo_pari

5 months ago

What if an LLM could update its own weights? Meet SEAL🦭: a framework where LLMs generate their own training data (self-edits) to update their weights in response to new inputs. Self-editing is learned via RL, using the updated model’s downstream performance as reward.

thumb_up_off_alt3,3K

chat_bubble_outline124

repeat514

shareShare