James Thorne (@j6mes) Twitter Tweets • TwiCopy

UK in Korea🇬🇧🇰🇷

2 years ago

Come and enjoy on Fri, Oct 27 at 6 pm, as The Bands of HM Royal Marines and K-TIGERS E&C Official do a joint performance at 📍Gwanghwamun Plaza in front of the Sejong Center to celebrate 🇬🇧🇰🇷140yrs.

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

James Thorne

@j6mes

2 years ago

Pleasure to host Ohad Rubin at KAIST AI to hear about his ongoing work of retrieval augmented language models, long range modeling and RPT arxiv.org/abs/2306.13421

Pleasure to host <a href="/OhadRubin/">Ohad Rubin</a> at <a href="/kaist_ai/">KAIST AI</a> to hear about his ongoing work of retrieval augmented language models, long range modeling and RPT arxiv.org/abs/2306.13421

thumb_up_off_alt35

chat_bubble_outline0

repeat4

shareShare

KAIST AI

@kaist_ai

2 years ago

⭐️Congratulations to everyone at KAIST AI who have a total of 20 papers accepted to #NeurIPS2023! Check them here 🧵docs.google.com/document/d/1t5…

thumb_up_off_alt47

chat_bubble_outline1

repeat2

shareShare

KAIST AI

@kaist_ai

2 years ago

⭐️Congratulations to everyone at KAIST AI who has a total of 11 papers accepted to #EMNLP2023! Check them here 📷🧵 docs.google.com/document/d/1YB…

thumb_up_off_alt77

chat_bubble_outline18

repeat8

shareShare

A group photo from the poster presentation of »AmbiFC: Fact-Checking Ambiguous Claims with Evidence«, co-authored by @Max_Glockner (UKP Lab), Ieva Staliūnaitė (Cambridge Computer Science), James Thorne (KAIST AI) , Gisela Vallejo (@unimelb), Andreas Vlachos (Cambridge Computer Science) and Iryna Gurevych. #EMNLP2023

A group photo from the poster presentation of »AmbiFC: Fact-Checking Ambiguous Claims with Evidence«, co-authored by @Max_Glockner (<a href="/UKPLab/">UKP Lab</a>), Ieva Staliūnaitė (<a href="/Cambridge_CL/">Cambridge Computer Science</a>), <a href="/j6mes/">James Thorne</a> (<a href="/kaist_ai/">KAIST AI</a>) , Gisela Vallejo (@unimelb), <a href="/vlachos_nlp/">Andreas Vlachos</a> (<a href="/Cambridge_CL/">Cambridge Computer Science</a>) and <a href="/IGurevych/">Iryna Gurevych</a>. #EMNLP2023

thumb_up_off_alt40

chat_bubble_outline1

repeat6

shareShare

James Thorne

@j6mes

2 years ago

Congratulations Seonghyeon!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

James Thorne

@j6mes

2 years ago

Pleased to announce that Theia Insights has raised $6.5M to develop foundational AI technologies for the investment industry. Well done team!

thumb_up_off_alt24

chat_bubble_outline1

repeat0

shareShare

Alice Oh

@aliceoh

2 years ago

I can’t keep up with all these new papers and datasets from my students 😊 James Thorne and I wanted to give #kaist undergrad students research experience with LLMs in their own languages, and this is one of the outcomes of that project!

thumb_up_off_alt25

chat_bubble_outline0

repeat2

shareShare

James Thorne

@j6mes

2 years ago

New preprint: Align LLMs without reference models using ORPO ⬇️ Code and checkpoints available on Hugging Face

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Theia Insights

@theiainsights

2 years ago

🚀Calling all astronaut economists! Introducing Theia Theme Connect, where you can travel the cosmos by connecting companies and themes. Can you navigate through the stars and matrix of data before your oxygen runs out? 🔗Click here to launch into action: connect.theiainsights.com/?utm_source=so…

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Daniel Vila Suero

@dvilasuero

2 years ago

This is actually huge: - No SFT stage (e.g., Zephyr used 200k examples) - Preference tuning with 7K examples only (other models trained with at least 60k samples) I've put a lot of care & love building the DPO version of the amazing Capybara dataset from LDJ so I'm

thumb_up_off_alt73

chat_bubble_outline2

repeat18

shareShare

Sayak Paul

@risingsayak

2 years ago

Aligning a diffusion model on preference data WITHOUT a reference model could be nice no? So, Kashif Rasul and I are ideating the use of ORPO to align SDXL 1.0 on PickAPic. Diffusion ORPO with LoRA 💫 Code and model ⬇️ huggingface.co/sayakpaul/diff…

Aligning a diffusion model on preference data WITHOUT a reference model could be nice no?

So, <a href="/krasul/">Kashif Rasul</a> and I are ideating the use of ORPO to align SDXL 1.0 on PickAPic.

Diffusion ORPO with LoRA 💫

Code and model ⬇️ huggingface.co/sayakpaul/diff…

thumb_up_off_alt60

chat_bubble_outline3

repeat13

shareShare

Alice Oh

@aliceoh

2 years ago

Great work! I’m glad our benchmarks KoBBQ, KOLD, and CLIcK are used to evaluate HyperCLOVA_X. Making progress in Korean LLM! 🇰🇷🇰🇷🇰🇷 arxiv.org/abs/2307.16778 arxiv.org/abs/2205.11315 arxiv.org/abs/2403.06412

thumb_up_off_alt45

chat_bubble_outline1

repeat5

shareShare

Jiwoo Hong @ NAACL 2025

@jiwoohong98

2 years ago

🔥Mixtral-8x22B-base + ORPO🔥 7k data & 1.3 hours to build a strong human-aligned 140B chat model🦾 👉IFEval: 65% 👉BBH: 59% 👉MT-Bench: 8.17 More models will be added to the Zephyr-ORPO collection with Argilla and Hugging Face , stay-tuned😃

thumb_up_off_alt70

chat_bubble_outline2

repeat17

shareShare

Alice Oh

@aliceoh

a year ago

We are hosting wonderful NLP colleagues at KAIST on their way to ACL Bangkok! 🤩 On-site registration is closed, but the talks will be broadcast on Zoom. Please join us! Date/Time: Aug 10, 2024, 10:05-12:30 KST (UCT+9) Parallel Session 1: Advanced Language Models and AI

thumb_up_off_alt118

chat_bubble_outline2

repeat40

shareShare

Valeria Ruscio

@rusciovaleria

a year ago

📢 Excited to share our new paper with Fabrizio Silvestri: "Beyond Position: How Rotary Embeddings Shape Representations and Memory in Autoregressive Transformers"! arxiv.org/abs/2410.18067 Keep reading to find out how RoPE affects Transformer models beyond just positional encoding 🧵

thumb_up_off_alt173

chat_bubble_outline7

repeat29

shareShare

James Thorne

@j6mes

a year ago

NVIDIA's earnings call wasn’t about semiconductors — it was a vision for AI factories, 24/7 inference, and industry transformation. Using TIIC, we reveal $NVDA's diversification shows the future is broader than chips. What's your take? #AI #NVIDIA #LLMs linkedin.com/pulse/nvidias-…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

James Thorne

@j6mes

a year ago

Looking forward to attending #NeurIPS2024 this week. Come meet Namgyu Ho who will present on block-transformer, which speeds up decoding by up to 20x.

thumb_up_off_alt25

chat_bubble_outline2

repeat4

shareShare

Andreas Vlachos

@vlachos_nlp

9 months ago

Pleased to announce the next FEVERworkshop at ACL2025! Regular workshop papers (ARR and direct submissions) due 15th of April! And new shared task focusing on reproducible and efficient verification of real world claims! Check fever.ai and get keen!

thumb_up_off_alt38

chat_bubble_outline1

repeat13

shareShare

Jiwoo Hong @ NAACL 2025

@jiwoohong98

4 months ago

⁉️Why do reward models suffer from over-optimization in RLHF? We revisit how representations are learned during reward modeling, revealing “hidden state dispersion” as the key, with a simple fix! 🧵 Meet us at ICML Conference! 📅July 16th (Wed) 11AM–1:30PM 📍East Hall A-B E-2608

thumb_up_off_alt19

chat_bubble_outline1

repeat3

shareShare