Kevin Wang (@kevinwang_111) 's Twitter Profile
Kevin Wang

@kevinwang_111

PhD student @UTAustin | 3D Foundation model, VLM, LLM Planning

ID: 1730054209867796480

linkhttps://www.kevin-ai.com/ calendar_today30-11-2023 02:41:02

33 Tweet

155 Followers

84 Following

MrNeRF (@janusch_patas) 's Twitter Profile Photo

Learn more about the state-of-the-art in 3D Gaussian Splatting (3DGS) compression and how compression differs from compaction.

Kevin Wang (@kevinwang_111) 's Twitter Profile Photo

Exciting Opportunity! Wuyang Chen , Assistant Professor at SFU, is recruiting PhD students for Fall 2025 in AI/ML! šŸš€ 🌐 Explore thier research: Delta Lab (delta-lab-ai.github.io/index.html) šŸ“¬ Interested? Reach out to Wuyang & apply! #AI #MachineLearning #PhDOpportunity #SFU

Jason Wei (@_jasonwei) 's Twitter Profile Photo

An underrated but occasionally make-or-break skill in AI research (that didn’t really exist ten years ago) is the ability to find a dataset that actually exercises a new method you are working on. Back in the day when the bottleneck in AI was learning, many methods were

Wenyan Cong (@congwenyan0320) 's Twitter Profile Photo

šŸš€ Our latest work, VideoLifter, is here! VideoLifter converts long video sequences into 3D representations with unprecedented speed, achieving 82% faster training while maintaining state-of-the-art accuracy. šŸ”— Explore the details and visuals on our project page:

šŸš€ Our latest work, VideoLifter, is here! VideoLifter converts long video sequences into 3D representations with unprecedented speed, achieving 82% faster training while maintaining state-of-the-art accuracy.

šŸ”— Explore the details and visuals on our project page:
Kevin Wang (@kevinwang_111) 's Twitter Profile Photo

Picture a ball flying toward you: you don’t replay every frame to know it’ll hit. Today’s world models predict visually, understand via captions, and logic through text. The next leap? World Models that fuse neural states with symbolic logic for human-like reasoning.

Kevin Wang (@kevinwang_111) 's Twitter Profile Photo

šŸŽ¶ Excited to introduce SPIN-Bench! šŸŽ‰ TL;DR: Benchmarking LLM capabilities across various strategic planning game environments. šŸš€ 🌐 Project Page: spinbench.github.io šŸ“„ arXiv: arxiv.org/abs/2503.12349 šŸŽ® Interact with PDDL domains: spinbench.github.io/tools/pddl/tra… šŸ“‰ LLM

Kevin Wang (@kevinwang_111) 's Twitter Profile Photo

Claude-sonnet (White) vs. o1-mini (Black): Given an absolutely winning poisoned position, o1-mini boldly plays... e5h5! šŸ¤” Small legal action space—but vast stage action space—places heavy cognitive burden on LLMs. Explore more chess examples & insights in SPIN-bench here:

Claude-sonnet (White) vs. o1-mini (Black): Given an absolutely winning poisoned position, o1-mini boldly plays... e5h5! šŸ¤”
Small legal action space—but vast stage action space—places heavy cognitive burden on LLMs.
Explore more chess examples & insights in SPIN-bench here:
Philipp Schmid (@_philschmid) 's Twitter Profile Photo

How well Do LLMs Plan Strategically? Can they beat Humans in board games? SPIN or Strategic Planning,Ā Interaction, andĀ NegotiationĀ is a new multi-domain evaluation showing LLMs can do basic planning, but fail in complex strategic and social reasoning tasks compared to humans.

How well Do LLMs Plan Strategically? Can they beat Humans in board games? SPIN or Strategic Planning,Ā Interaction, andĀ NegotiationĀ is a new multi-domain evaluation showing LLMs can do basic planning, but fail in complex strategic and social reasoning tasks compared to humans.
Kevin Wang (@kevinwang_111) 's Twitter Profile Photo

šŸš€ Proud to share our new arXiv preprint: VLM-3R learns 3D spatial-temporal reasoning straight from monocular video—no depth sensors or prebuilt maps! Trained on 200K+ 3D QA instructs šŸ‘‰ vlm-3r.github.io