Aditya Rajagopal (@adityaraja0) 's Twitter Profile
Aditya Rajagopal

@adityaraja0

Founder of nCompass Tech. Building optimized AI inference engines to enable cost effective AI inference at scale.

ID: 1748106018071146496

calendar_today18-01-2024 22:13:39

25 Tweet

60 Followers

250 Following

Aditya Rajagopal (@adityaraja0) 's Twitter Profile Photo

๐Ÿš€๐—ง๐—ต๐—ฟ๐—ถ๐—น๐—น๐—ฒ๐—ฑ ๐˜๐—ผ ๐—ฎ๐—ป๐—ป๐—ผ๐˜‚๐—ป๐—ฐ๐—ฒ ๐˜๐—ต๐—ฎ๐˜ ๐˜๐—ต๐—ฒ ๐—ป๐—–๐—ผ๐—บ๐—ฝ๐—ฎ๐˜€๐˜€ ๐—”๐—ฃ๐—œ ๐—ถ๐˜€ ๐—ป๐—ผ๐˜„ ๐—ถ๐—ป๐˜๐—ฒ๐—ด๐—ฟ๐—ฎ๐˜๐—ฒ๐—ฑ ๐—ถ๐—ป๐˜๐—ผ ๐˜๐—ต๐—ฒ ๐—ฃ๐—ผ๐—ฟ๐˜๐—ธ๐—ฒ๐˜† ๐—ฒ๐—ฐ๐—ผ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ! Deploying AI models into production involves navigating a complex, multi-layered application stack. With this

Aditya Rajagopal (@adityaraja0) 's Twitter Profile Photo

If youโ€™re looking to run a 430K context window AI at Meta's Llama 4 Maverick via an API and not suffer any rate limits, sign up here (app.ncompass.tech) and run your first query! From what Iโ€™ve seen, it works grest for large context document processing!

Aditya Rajagopal (@adityaraja0) 's Twitter Profile Photo

๐Ÿš€ Weโ€™re live on Product Hunt ๐Ÿ˜ธ and weโ€™re trending in the Top 10 ๐Ÿ‘€ Check out our inference engine + share feedback! ๐Ÿ‘‰ producthunt.com/products/ncompโ€ฆ #AI #Inference #ProductHunt

Aditya Rajagopal (@adityaraja0) 's Twitter Profile Photo

Profiling GPU-heavy AI models shouldnโ€™t require downloading 200MB traces, switching machines, and guessing which line caused the slowdown. So we built ncprof โ€” GPU + AI profiling directly inside VSCode/Cursor. โœจ Add TorchRecord/NVTX markers without touching code โœจ Built-in

Aditya Rajagopal (@adityaraja0) 's Twitter Profile Photo

We just shipped a new ncprof feature: automatic detection of repeated kernel patterns in your GPU traces + timing stats for each group. Super helpful when you're looking at 1000s of launches but most are repeats. Hereโ€™s a quick demo. Would love your feedback on what else you'd