Excited to share our latest work BC-Design - a new framework for highly accurate inverse protein folding that achieves unprecedented 88.37% sequence recovery rate (previous SOTA achieved 67%)! 🧬
To put this in perspective, the field's progress on CATH 4.2 benchmark:
Reasoning models lack atomic thought ⚛️
Unlike humans using independent units, they store full histories🤔
Introducing Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1 !
The best part? It's plugs in for ANY framework 🔌
1/5
20 Months: 0 → 7 papers (2 ICLR orals) & 40+ institution collabs.
With a clear vision, we're building the open-source foundation for tomorrow's agents.
We also release MGX (mgx.dev) and commit to open-source its core soon.
Check threads for what we've built!
1/8
No fortress, purely open ground. Manus đź‘‹.
We open-sourced its core feature in 2 hours after dinner.
Check it out 👇:
github.com/mannaandpoem/O…
1/4
đź§µ1/
✨New preprint ✨
LLMs are getting better at answering medical questions.
However, they still struggle to spot and fix errors in their own reasoning.
That’s a big problem in medicine, where stakes are high and mistakes at any step could be critical.
To address this issue,
AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody–Antigen Affinity Ranking
1.AbRank introduces a large-scale benchmark for antibody–antigen (Ab–Ag) affinity prediction, reframing the task as pairwise ranking rather than regression. This design improves