Pulse (@pulse__ai) 's Twitter Profile
Pulse

@pulse__ai

ID: 1804369767563923456

linkhttps://www.runpulse.com calendar_today22-06-2024 04:24:15

35 Tweet

310 Followers

3 Following

Y Combinator (@ycombinator) 's Twitter Profile Photo

After processing 400M+ pages for the world's largest investment firms, AI startups, and Fortune 500s, Pulse is launching Ultra: their new hybrid reasoning model. It's the most accurate document extraction model in the industry. Live for all customers today.

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

After processing nearly 500M pages, we discovered the biggest challenge in document AI isn't OCR accuracy - it's semantic understanding across page breaks and column boundaries. 🧵 (1/8)

After processing nearly 500M pages, we discovered the biggest challenge in document AI isn't OCR accuracy - it's semantic understanding across page breaks and column boundaries. 🧵

(1/8)
Y Combinator (@ycombinator) 's Twitter Profile Photo

Pulse (Pulse) has just launched Meridian, an AI-powered financial document processor that can automatically convert any PDF, Word doc, PowerPoint presentation, or image into a structured Excel export with charts and graphs. runpulse.com/blog/introduci… Congrats on the launch,

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

we're super excited to be launching Meridian publicly! no more analysts having to manually copy numbers from pdfs into spreadsheets at 2 am before a board deadline if you're interested in trying it out give me a DM! 🫡

Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

the team at Pulse put bytedance's dolphin OCR to the test against complex documents that matter for real business use cases. while it shows improvements in reading order detection, we found critical limitations across key areas: - 7.7% structured data extraction from

the team at <a href="/Pulse__AI/">Pulse</a>  put bytedance's dolphin OCR to the test against complex documents that matter for real business use cases. 

while it shows improvements in reading order detection, we found critical limitations across key areas:
-  7.7% structured data extraction from
Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

@pulse__ai team just dropped why "98% accurate" document extraction still breaks in production with 4000 errors per 1000 pages. single accuracy scores miss broken reading order, shifted table columns, and lost cross page context that silently corrupt entire datasets. we’ve

@pulse__ai team just dropped why "98% accurate" document extraction still breaks in production with 4000 errors per 1000 pages.

single accuracy scores miss broken reading order, shifted table columns, and lost cross page context that silently corrupt entire datasets.

we’ve
Y Combinator (@ycombinator) 's Twitter Profile Photo

Pulse (Pulse) just launched their state-of-the-art document extraction platform. It turns complex PDFs, scans, decks, and images into LLM-ready data. No training required. runpulse.com/blog/pulse-ope… Congrats on the launch, sid and Ritvik Pandey!

sid (@sid_mnk) 's Twitter Profile Photo

@pulse__ai just launched formula recognition. trained on 10m+ formula/latex pairs from papers + handwritten notes. traditional ocr breaks on math (α, β, fractions, matrices). our model treats formulas as structured objects → clean latex. built on pulse’s production-grade

@pulse__ai just launched formula recognition.
trained on 10m+ formula/latex pairs from papers + handwritten notes.

traditional ocr breaks on math (α, β, fractions, matrices). our model treats formulas as structured objects → clean latex.

built on pulse’s production-grade
Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

Culture building is everything when you're asking engineers to solve the hardest problems for enterprises. The entire Pulse team is usually in the office 12 hours a day - everyone needs to be in one place, building together. Having an immediate feedback loop is incredibly

Culture building is everything when you're asking engineers to solve the hardest problems for enterprises. The entire <a href="/Pulse__AI/">Pulse</a>  team is usually in the office 12 hours a day - everyone needs to be in one place, building together.

Having an immediate feedback loop is incredibly
Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

The Pulse team just published "The Precision Tax" - why "99% accuracy" fails in finance. One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same

The <a href="/Pulse__AI/">Pulse</a>  team just published "The Precision Tax" - why "99% accuracy" fails in finance.

One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same
Pulse (@pulse__ai) 's Twitter Profile Photo

Pulse is now officially part of Cloudera's Enterprise AI ecosystem. Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.

Pulse is now officially part of <a href="/cloudera/">Cloudera</a>'s Enterprise AI ecosystem.

Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.
Y Combinator (@ycombinator) 's Twitter Profile Photo

.Pulse just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs. runpulse.com/blog/self-host… Congrats on the launch, sid

.<a href="/Pulse__AI/">Pulse</a> just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs.

runpulse.com/blog/self-host…

Congrats on the launch, <a href="/sid_mnk/">sid</a>
Ritvik Pandey (@ritvikpandey21) 's Twitter Profile Photo

DeepSeek AI dropped a new open-source OCR model today 👀 At @pulse__ai, we tested it on financial docs, handwritten forms, and complex tables. The results showed the same issues plaguing LLM-driven OCR: - Unstable outputs - Hallucinated text - Broken table structures Reality

DeepSeek AI dropped a new open-source OCR model today 👀

At @pulse__ai, we tested it on financial docs, handwritten forms, and complex tables. The results showed the same issues plaguing LLM-driven OCR:

- Unstable outputs
- Hallucinated text
- Broken table structures

Reality
sid (@sid_mnk) 's Twitter Profile Photo

Exciting research preview to share on XLSX parsing at Pulse . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them. Our team has developed and implemented a token-efficient encoder resulting in

Exciting research preview to share on XLSX parsing at <a href="/Pulse__AI/">Pulse</a> . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them. 

Our team has developed and implemented a token-efficient encoder resulting in