Huajian Xin (@huajian_xin) Twitter Tweets • TwiCopy

Huajian Xin

@huajian_xin

+ Follow

Ph.D. Student @InfAtEd on LLMs for theorem proving
| Student Researcher @BytedanceTalk Doubao/Seed
| Ex. @deepseek_ai
| Recent: DeepSeek-Prover, LEGO-Prover

ID: 1565340438009036802

linkhttps://xinhuajian.wordpress.com/ calendar_today01-09-2022 14:07:11

47 Tweet

1,1K Followers

317 Following

Huajian Xin

@huajian_xin

7 months ago

Hope so :)

thumb_up_off_alt81

chat_bubble_outline0

repeat4

shareShare

From my personal observations, this is the most accurate description I've seen of DeepSeek's research and engineering ethos (with English translation provided by DeepSeek R1): (Source: mp.weixin.qq.com/s/WFJxnTF9fGII…)

thumb_up_off_alt66

chat_bubble_outline1

repeat9

shareShare

Jia Li

@jiali52524397

7 months ago

🚀 NuminaMath 1.5 is here! 🚀 900k+ high-quality competition math problems with CoT solutions, new problem metadata, manually verified Olympiad problems, and more! 📚🏅 Check it out: 🔗 huggingface.co/datasets/AI-MO… Thanks to Zhenzhe Ying Léo Dreyfus-Schmidt

thumb_up_off_alt393

chat_bubble_outline5

repeat67

shareShare

Kefan Dong

@kefandong

7 months ago

Update: check out github.com/kfdong/STP for our code, data, and model!

thumb_up_off_alt41

chat_bubble_outline1

repeat7

shareShare

Huajian Xin

@huajian_xin

7 months ago

Glad to see DeepSeek-Prover V1.5 continuing to be used as the foundation model and for cold-start data synthesis. Hope you all enjoy it! 🚀

thumb_up_off_alt73

chat_bubble_outline1

repeat4

shareShare

Huajian Xin

@huajian_xin

7 months ago

Had a great time giving a talk at UCL today on LLMs and formal mathematics! Excited to share my slides here: xinhuajian.wordpress.com/wp-content/upl…

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Huajian Xin

@huajian_xin

6 months ago

No exaggeration and no bias, this is truly divine.

thumb_up_off_alt140

chat_bubble_outline0

repeat10

shareShare

Jia Li

@jiali52524397

5 months ago

We believe formal math is the future. 🔥Introducing Kimina-Prover Preview, a Numina & Kimi.ai collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F. github.com/MoonshotAI/Kim…

We believe formal math is the future.
🔥Introducing Kimina-Prover Preview, a Numina &
<a href="/Kimi_Moonshot/">Kimi.ai</a> collaboration, the first large formal reasoning model for Lean 4, achieving 80.78% miniF2F.
github.com/MoonshotAI/Kim…

thumb_up_off_alt759

chat_bubble_outline29

repeat134

shareShare

Kimi.ai

@kimi_moonshot

5 months ago

🔬 Sharing an early look at Kimina-Prover, our new Lean theorem proving model from our collaboration with Numina! Jia Li 🏆 Using an RL pipeline for proof exploration, Kimina-Prover Preview achieved 80.7% on the miniF2F — currently SOTA on this benchmark. We see promise

🔬 Sharing an early look at Kimina-Prover, our new Lean theorem proving model from our collaboration with Numina! <a href="/JiaLi52524397/">Jia Li</a>

🏆 Using an RL pipeline for proof exploration, Kimina-Prover Preview achieved 80.7% on the miniF2F — currently SOTA on this benchmark. We see promise

thumb_up_off_alt340

chat_bubble_outline10

repeat64

shareShare

Haiming Wang

@haimingw97

5 months ago

🚀 Thrilled to be a core contributor to Kimina-Prover. Huge thanks to the Numina & Moonshot teams! Kimi.ai This has been the most exciting project I've worked on since entering the field. A minimal, flexible, and powerful approach that unifies ATP ideas like no other.

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Huajian Xin

@huajian_xin

4 months ago

Excited to present our work DeepSeek-Prover at #ICLR2025! Grateful for all the engaging discussions and feedback.

thumb_up_off_alt252

chat_bubble_outline3

repeat10

shareShare

Jia Li

@jiali52524397

4 months ago

Combinatorics are the two last problems unsolved by AlphaProof at last year's IMO。 Introducing CombiBench Kimi.ai , a benchmark focusing on combinatorics problems ! 🔥 🏆moonshotai.github.io/CombiBench/ 📘Dataset -> huggingface.co/datasets/AI-MO…

Combinatorics are the two last problems unsolved by AlphaProof at last year's IMO。
Introducing CombiBench <a href="/Kimi_Moonshot/">Kimi.ai</a> , a benchmark focusing on combinatorics problems ! 🔥
🏆moonshotai.github.io/CombiBench/
📘Dataset -> huggingface.co/datasets/AI-MO…

thumb_up_off_alt136

chat_bubble_outline2

repeat34

shareShare

Huajian Xin

@huajian_xin

4 months ago

Although I left DeepSeek quite a while ago, being able to scale up to 671B truly feels like a dream come true for me. I'm deeply grateful to ZZ, Zhihong and other colleagues at DeepSeek for their support, to Liang for the opportunity, and to everyone in the field who has

thumb_up_off_alt579

chat_bubble_outline6

repeat44

shareShare

Zhouliang Yu

@zhouliangy

4 months ago

🚀 Excited to introduce FormalMATH: a large-scale formal math benchmark with 5,560 formally verified Lean 4 statements from Olympiad and UG-level problems. 📉 Best model performance: just 16.46% — plenty of room for progress! 🔗 Explore the project: spherelab.ai/FormalMATH/

thumb_up_off_alt23

chat_bubble_outline1

repeat6

shareShare

AI for Math Workshop @ ICML 2025

@ai4mathworkshop

4 months ago

Glad to announce the ICML 2025 Challenge on Automated Math Reasoning and Extensions! 🌟🧮⚛️ Track 1: File-level Automated Proof Engineering (APE) of Formal Math Libraries (APE-Bench I). Participation: codabench.org/competitions/8… Track 2: Physics Reasoning with Diagrams and

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Huajian Xin

@huajian_xin

3 months ago

🚀 Thrilled to share that APE-Bench I has just been selected as Track 1 of the ICML 2025 AI4Math Workshop Challenge—the first competition devoted to automated proof engineering (APE) at scale. Formal mathematics is racing past the limits of manual refactoring, and APE-Bench I

thumb_up_off_alt19

chat_bubble_outline8

repeat2

shareShare