
DeepSeek
@deepseek_ai
Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
ID: 1714580962569588736
https://www.deepseek.com/ 18-10-2023 09:55:45
139 Tweet
975,975K Followers
0 Following





π Day 3 of #OpenSourceWeek: DeepGEMM Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference. β‘ Up to 1350+ FP8 TFLOPS on Hopper GPUs β No heavy dependency, as clean as a tutorial β Fully Just-In-Time compiled




π Day 6 of #OpenSourceWeek: One More Thing β DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: π§ Cross-node EP-powered batch scaling π Computation-communication overlap βοΈ Load balancing Statistics of DeepSeek's Online Service: β‘ 73.7k/14.8k

π DeepSeek-V3-0324 is out now! πΉ Major boost in reasoning performance πΉ Stronger front-end development skills πΉ Smarter tool-use capabilities β For non-complex reasoning tasks, we recommend using V3 β just turn off βDeepThinkβ π API usage remains unchanged π Models are

π DeepSeek-R1-0528 is here! πΉ Improved benchmark performance πΉ Enhanced front-end capabilities πΉ Reduced hallucinations πΉ Supports JSON output & function calling β Try it now: chat.deepseek.com π No change to API usage β docs here: api-docs.deepseek.com/guides/reasoniβ¦ π