TNG Technology Consulting GmbH (@tngtech) Twitter Tweets • TwiCopy

TNG Technology Consulting GmbH

@tngtech

+ Follow

TNG, aka "The Nerd Group", is a consulting partnership focused on high end information technology, particularly AI. 916 employees, 99.9% academics, ~55% PhDs.

ID: 224374031

linkhttp://www.tngtech.com/en calendar_today08-12-2010 20:58:13

1,1K Tweet

816 Followers

80 Following

TNG Technology Consulting GmbH

@tngtech

6 months ago

We'd be happy to see it hosted by an Inference Provider via huggingface. Upvoted

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

gfodor.id

@gfodor

6 months ago

Brain surgery like this is what keeps Yud up at night

thumb_up_off_alt42

chat_bubble_outline2

repeat1

shareShare

DeepSeek-R1T-Chimera，结合 DeepSeek R1 的智能性和 V3 的token 效率，由 TNG Technology Consulting GmbH 团队开发主要特点 - 规模：拥有 685B 参数，属于超大规模模型 - 类型：Text Generation Transformers - 架构：基于 DeepSeek-MoE Transformer 架构技术特点 - 模型合并项目，将 DeepSeek-R1 和 DeepSeek-V3(0324)

DeepSeek-R1T-Chimera，结合 DeepSeek R1 的智能性和 V3 的token 效率，由 <a href="/tngtech/">TNG Technology Consulting GmbH</a> 团队开发

主要特点
- 规模：拥有 685B 参数，属于超大规模模型
- 类型：Text Generation Transformers
- 架构：基于 DeepSeek-MoE Transformer 架构

技术特点
- 模型合并项目，将 DeepSeek-R1 和 DeepSeek-V3(0324)

thumb_up_off_alt65

chat_bubble_outline3

repeat14

shareShare

TNG Technology Consulting GmbH

@tngtech

6 months ago

Oh man, lucky day 😅 R1T-Chimera is ranked the #2 trending model on OpenRouter Sure, "trending" is a temporary attention metric, not reflecting total usage. And the world spins fast: everybody talks about Qwen3 now. Still a nice screenshot with Google AI, Microsoft,

Oh man, lucky day 😅

R1T-Chimera is ranked the #2 trending model on <a href="/OpenRouterAI/">OpenRouter</a>

Sure, "trending" is a temporary attention metric, not reflecting total usage.

And the world spins fast: everybody talks about Qwen3 now. Still a nice screenshot with <a href="/GoogleAI/">Google AI</a>, <a href="/Microsoft/">Microsoft</a>,

thumb_up_off_alt107

chat_bubble_outline11

repeat9

shareShare

Vaibhav (VB) Srivastav

@reach_vb

6 months ago

THE WHALE IS BACK!!! 🐳🐳🐳 huggingface.co/deepseek-ai/De…

thumb_up_off_alt372

chat_bubble_outline10

repeat51

shareShare

TNG Technology Consulting GmbH

@tngtech

6 months ago

DeepSeek uploaded a new model on huggingface: DeepSeek-Prover-V2 It seems, the architecture is identical to V3 and R1 models, because: model_config.py shows no difference, also the safetensor index files are the same. One minor diff is a new experimental feature in

thumb_up_off_alt25

chat_bubble_outline2

repeat4

shareShare

Andrej Karpathy

@karpathy

6 months ago

There's a new paper circulating looking in detail at LMArena leaderboard: "The Leaderboard Illusion" arxiv.org/abs/2504.20879 I first became a bit suspicious when at one point a while back, a Gemini model scored #1 way above the second best, but when I tried to switch for a few

thumb_up_off_alt4,4K

chat_bubble_outline192

repeat429

shareShare

Vaibhav (VB) Srivastav

@reach_vb

6 months ago

BOOOOM! you can now use the latest DeepSeek Prover V2 directly on the model page powered by Novita AI 🔥 Open Source FTW! 💥

thumb_up_off_alt187

chat_bubble_outline8

repeat33

shareShare

TNG Technology Consulting GmbH

@tngtech

6 months ago

x.com/tngtech/status…

thumb_up_off_alt19

chat_bubble_outline1

repeat0

shareShare

TNG Technology Consulting GmbH

@tngtech

5 months ago

Eight new AMD MI325X GPUs joined our compute cluster of NVIDIA H100s. The new Supermicro server is an AI machine with spectacular 2 Terabytes of total GPU memory in one ~10kW node. ROCm worked right away with full VRAM and GPU utilization, allowing new types of

Eight new <a href="/AMD/">AMD</a> MI325X GPUs joined our compute cluster of <a href="/nvidia/">NVIDIA</a> H100s.

The new <a href="/Supermicro_SMCI/">Supermicro</a> server is an AI machine with spectacular 2 Terabytes of total GPU memory in one ~10kW node.

ROCm worked right away with full VRAM and GPU utilization, allowing new types of

thumb_up_off_alt63

chat_bubble_outline9

repeat8

shareShare

TNG Technology Consulting GmbH

@tngtech

5 months ago

Hello #USA 🇺🇸 TNG Technology Consulting USA Inc. is now incorporated in #Austin, #Texas. Thanks to our existing clients in #SiliconValley and #NewYork. We look forward to meeting more interesting people, fast companies and hard #IT problems to solve.

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

TNG Technology Consulting GmbH

@tngtech

5 months ago

x.com/i/article/1928…

thumb_up_off_alt30

chat_bubble_outline6

repeat4

shareShare

TNG Technology Consulting GmbH

@tngtech

5 months ago

More evidence for the effectiveness of the Chimera construction method: Taking DeepSeek's R1-0528 release, we started benchmarking new Chimera variants on AIME-24 and SimpleQA. R1-0528 significantly improves AIME performance from 79.8 to 91.4 while doubling the amount of output

thumb_up_off_alt44

chat_bubble_outline1

repeat8

shareShare

TNG Technology Consulting GmbH

TNG Technology Consulting GmbH

gfodor.id

meng shao

TNG Technology Consulting GmbH

Vaibhav (VB) Srivastav

TNG Technology Consulting GmbH

Andrej Karpathy

Vaibhav (VB) Srivastav

TNG Technology Consulting GmbH

TNG Technology Consulting GmbH

TNG Technology Consulting GmbH

TNG Technology Consulting GmbH

TNG Technology Consulting GmbH