Arcee.ai (@arcee_ai) Twitter Tweets • TwiCopy

Prince Canuma

4 months ago

Congratulations to the amazing team at Arcee.ai on their first LLM built from scratch! 🔥 I got to see the beginning and I’m happy with final results. Arcee Foundation Model (AFM) is an SLM that punches above its weight class 🚀 Wonderful work! Lucas Atkins Fernando Fernandes Neto

thumb_up_off_alt39

chat_bubble_outline2

repeat5

shareShare

Matthew Leavitt

@leavittron

4 months ago

"Pretraining is dead" is dead

thumb_up_off_alt114

chat_bubble_outline0

repeat10

shareShare

Arcee.ai

@arcee_ai

4 months ago

Our first foundation model, AFM-4.5B, is not even 24 hours old, and our users are already going wild. "Don't sleep on Arcee" seems to be the motto. We love that, because we haven't slept much lately 😃 You can try the model in our playground (afm.arcee.ai/#Chat-UI) and on

thumb_up_off_alt18

chat_bubble_outline0

repeat5

shareShare

Siddharth Joshi

@sjoshi804

4 months ago

Congratulations to the DatologyAI team on powering the data for AFM-4B by Arcee.ai - competitive with Qwen3 - using way way less data! This is exactly why I'm so excited to be joining DatologyAI this summer to push the frontier of data curation 🚀

thumb_up_off_alt29

chat_bubble_outline0

repeat1

shareShare

Arcee.ai

@arcee_ai

4 months ago

Last week, we launched AFM-4.5B, our first foundation model. In this post by Charles Goddard , you will learn how we extended the context length of AFM-4.5B from 4k to 64k context through aggressive experimentation, model merging, distillation, and a concerning amount of soup. Bon

Last week, we launched AFM-4.5B, our first foundation model.

In this post by <a href="/chargoddard/">Charles Goddard</a> , you will learn how we extended the context length of AFM-4.5B from 4k to 64k context through aggressive experimentation, model merging, distillation, and a concerning amount of soup.

Bon

thumb_up_off_alt190

chat_bubble_outline6

repeat34

shareShare

kalomaze

@kalomaze

4 months ago

this release is pure class. arcee using their data to do some short-term continued pretraining on GLM 32b. long context support has gone from effectively 8k -> 32k, and all base model evaluations (including short context ones) have improved

thumb_up_off_alt202

chat_bubble_outline2

repeat22

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

4 months ago

You can't overstate how impressive this is. Arcee took one of the strongest base models, GLM-4, a product of many years of Tsinghua R&D (THUDM/THUKEG/Z.ai, GLM-130B was maybe *the first* real open weights attack on OpenAI, Oct 2022)… and made it plain better. And told us how.

thumb_up_off_alt202

chat_bubble_outline4

repeat26

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

4 months ago

“First of many blogs” from Arcee. AFM-4.5B scaled from 4K → 64K context. ⮕ arcee.ai/blog/extending… 𝑱𝑼𝑺𝑻 𝑴𝑬𝑹𝑮𝑬, 𝑫𝑰𝑺𝑻𝑰𝑳𝑳, 𝑹𝑬𝑷𝑬𝑨𝑻. Proof it scales: Same merge–distill cycle applied to GLM-4-32B. Fixes 8K degradation in the 0414 release. +5% overall, strong

thumb_up_off_alt27

chat_bubble_outline0

repeat6

shareShare

Arcee.ai

@arcee_ai

4 months ago

In this post, Mariam Jabara, one of our Field Engineers, walks you through three real-life use cases for model merging, recently published in research papers: ➡️ Model Merging in Pre-training of Large Language Models ➡️ PatientDx: Merging Large Language Models for Protecting

thumb_up_off_alt14

chat_bubble_outline0

repeat4

shareShare

Arcee.ai

@arcee_ai

4 months ago

We’re beyond thrilled to share that Arcee AI Conductor has been named “LLM Application of the Year” at the 2025 AI Breakthrough Awards. This recognition isn’t just a shiny badge—it’s a celebration of a vision we’ve been chasing for years: making AI smarter, more accessible, and

thumb_up_off_alt13

chat_bubble_outline0

repeat6

shareShare

Arcee.ai

@arcee_ai

4 months ago

Today, we're excited to announce the integration of Arcee.ai Conductor, our SLM/LLM model routing solution, into the Zerve AI platform, an agent-driven operating system for Data & AI teams 😃 This collaboration enables data scientists, engineers, and AI developers to build,

Today, we're excited to announce the integration of <a href="/arcee_ai/">Arcee.ai</a> Conductor, our SLM/LLM model routing solution, into the <a href="/Zerve_AI/">Zerve AI</a> platform, an agent-driven operating system for Data & AI teams 😃

This collaboration enables data scientists, engineers, and AI developers to build,

thumb_up_off_alt16

chat_bubble_outline2

repeat2

shareShare

Arcee.ai

@arcee_ai

4 months ago

Today, we're happy to announce the open-weights release of five language models, including three enterprise-grade production models that have been powering customer workloads through our SaaS platform (SuperNova, Virtuoso-Large, Caller), and two cutting-edge research models

thumb_up_off_alt98

chat_bubble_outline3

repeat16

shareShare

Arcee.ai

@arcee_ai

4 months ago

Save the date! A week from now, please join us live to discover how Zerve AI is leveraging our model routing solution, Arcee Conductor, to improve its agentic platform for data science workflows. This should be a super interesting discussion, and of course, we'll do demos!

Save the date! A week from now, please join us live to discover how <a href="/Zerve_AI/">Zerve AI</a> is leveraging our model routing solution, Arcee Conductor, to improve its agentic platform for data science workflows. This should be a super interesting discussion, and of course, we'll do demos!

thumb_up_off_alt7

chat_bubble_outline1

repeat1

shareShare

Julien Simon

@julsimon

4 months ago

In this new video, I introduce two new research-oriented models that Arcee.ai recently released on Hugging Face Face. Homunculus is a 12 billion-parameter instruction model distilled from Qwen3-235B onto the Mistral AI Nemo backbone. It was purpose-built to preserve Qwen’s

In this new video, I introduce two new research-oriented models that <a href="/arcee_ai/">Arcee.ai</a> recently released on <a href="/huggingface/">Hugging Face</a> Face.

Homunculus is a 12 billion-parameter instruction model distilled from Qwen3-235B onto the Mistral AI Nemo backbone. It was purpose-built to preserve Qwen’s

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Julien Simon

@julsimon

4 months ago

In this video, I introduce and demonstrate three production-grade models that Arcee.ai recently opened and released on Hugging Face . Arcee-SuperNova-v1 (70B) is a merged model built from multiple advanced training approaches. At its core is a distilled version of

In this video, I introduce and demonstrate three production-grade models that <a href="/arcee_ai/">Arcee.ai</a> recently opened and released on <a href="/huggingface/">Hugging Face</a> .

Arcee-SuperNova-v1 (70B) is a merged model built from multiple advanced training approaches. At its core is a distilled version of

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Julien Simon

@julsimon

4 months ago

In this fun demonstration, you can witness the impressive capabilities of Arcee.ai AFM-4.5B-Preview, Arcee's first foundation model, across diverse domains. The demo showcases the model tackling complex knowledge questions, creating sophisticated creative writing, and

In this fun demonstration, you can witness the impressive capabilities of <a href="/arcee_ai/">Arcee.ai</a> AFM-4.5B-Preview, Arcee's first foundation model, across diverse domains. The demo showcases the model tackling complex knowledge questions, creating sophisticated creative writing, and

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Arcee.ai

@arcee_ai

4 months ago

As generative AI becomes increasingly central to business applications, the cost, complexity, and privacy concerns associated with language models are becoming significant. At Arcee.ai, we’ve been asking a critical question: Can CPUs actually handle the demands of language

thumb_up_off_alt19

chat_bubble_outline1

repeat3

shareShare