sid (@sid_mnk) 's Twitter Profile
sid

@sid_mnk

@pulse__ai | prev nvidia, de shaw, berkeley cs

ID: 1802588977180069888

calendar_today17-06-2024 06:27:59

84 Tweet

267 Followers

93 Following

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

@pulse__ai team just dropped why "98% accurate" document extraction still breaks in production with 4000 errors per 1000 pages. single accuracy scores miss broken reading order, shifted table columns, and lost cross page context that silently corrupt entire datasets. we’ve

@pulse__ai team just dropped why "98% accurate" document extraction still breaks in production with 4000 errors per 1000 pages.

single accuracy scores miss broken reading order, shifted table columns, and lost cross page context that silently corrupt entire datasets.

we’ve
Y Combinator (@ycombinator) 's Twitter Profile Photo

Pulse (Pulse) just launched their state-of-the-art document extraction platform. It turns complex PDFs, scans, decks, and images into LLM-ready data. No training required. runpulse.com/blog/pulse-ope… Congrats on the launch, sid and Ritvik Pandey!

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

The Pulse team just published "The Precision Tax" - why "99% accuracy" fails in finance. One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same

The <a href="/Pulse__AI/">Pulse</a>  team just published "The Precision Tax" - why "99% accuracy" fails in finance.

One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same
Pulse (@pulse__ai) 's Twitter Profile Photo

Pulse is now officially part of Cloudera's Enterprise AI ecosystem. Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.

Pulse is now officially part of <a href="/cloudera/">Cloudera</a>'s Enterprise AI ecosystem.

Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.
Cloudera (@cloudera) 's Twitter Profile Photo

That's a wrap on #EVOLVE25 in NYC. Some highlights from our time in the Big Apple... 👋 Welcoming Galileo, ServiceNow, Fundamental, and Pulse to the Enterprise AI Ecosystem 🤝 Showcasing customer success with AbbVie, IQVIA, and Banco do Brasil 🐾 Featuring Rain or

That's a wrap on #EVOLVE25 in NYC. Some highlights from our time in the Big Apple...

👋 Welcoming <a href="/rungalileo/">Galileo</a>, <a href="/ServiceNow/">ServiceNow</a>, Fundamental, and <a href="/Pulse__AI/">Pulse</a> to the Enterprise AI Ecosystem

🤝 Showcasing customer success with <a href="/abbvie/">AbbVie</a>, <a href="/IQVIA/">IQVIA</a>, and <a href="/BancodoBrasil/">Banco do Brasil</a> 

🐾 Featuring Rain or
Y Combinator (@ycombinator) 's Twitter Profile Photo

.Pulse just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs. runpulse.com/blog/self-host… Congrats on the launch, sid

.<a href="/Pulse__AI/">Pulse</a> just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs.

runpulse.com/blog/self-host…

Congrats on the launch, <a href="/sid_mnk/">sid</a>
Cloudera (@cloudera) 's Twitter Profile Photo

We recently welcomed new members to our Enterprise AI Ecosystem: ServiceNow, Galileo, Pulse, and Fundamental. In doing so, we're able to deliver complete, production-ready AI solutions to customers. Our Abhas Ricky says it best: "We’re only as good as our ecosystem and

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

DeepSeek AI dropped a new open-source OCR model today 👀 At @pulse__ai, we tested it on financial docs, handwritten forms, and complex tables. The results showed the same issues plaguing LLM-driven OCR: - Unstable outputs - Hallucinated text - Broken table structures Reality

DeepSeek AI dropped a new open-source OCR model today 👀

At @pulse__ai, we tested it on financial docs, handwritten forms, and complex tables. The results showed the same issues plaguing LLM-driven OCR:

- Unstable outputs
- Hallucinated text
- Broken table structures

Reality
sid (@sid_mnk) 's Twitter Profile Photo

Exciting research preview to share on XLSX parsing at Pulse . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them. Our team has developed and implemented a token-efficient encoder resulting in

Exciting research preview to share on XLSX parsing at <a href="/Pulse__AI/">Pulse</a> . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them. 

Our team has developed and implemented a token-efficient encoder resulting in