Pulse (@pulse__ai) Twitter Tweets • TwiCopy

Y Combinator

7 months ago

After processing 400M+ pages for the world's largest investment firms, AI startups, and Fortune 500s, Pulse is launching Ultra: their new hybrid reasoning model. It's the most accurate document extraction model in the industry. Live for all customers today.

thumb_up_off_alt256

chat_bubble_outline18

repeat33

shareShare

Ritvik Pandey

@ritvikpandey21

6 months ago

After processing nearly 500M pages, we discovered the biggest challenge in document AI isn't OCR accuracy - it's semantic understanding across page breaks and column boundaries. 🧵 (1/8)

thumb_up_off_alt20

chat_bubble_outline2

repeat3

shareShare

Y Combinator

@ycombinator

5 months ago

Pulse (Pulse) has just launched Meridian, an AI-powered financial document processor that can automatically convert any PDF, Word doc, PowerPoint presentation, or image into a structured Excel export with charts and graphs. runpulse.com/blog/introduci… Congrats on the launch,

thumb_up_off_alt303

chat_bubble_outline24

repeat26

shareShare

Ritvik Pandey

@ritvikpandey21

5 months ago

we're super excited to be launching Meridian publicly! no more analysts having to manually copy numbers from pdfs into spreadsheets at 2 am before a board deadline if you're interested in trying it out give me a DM! 🫡

thumb_up_off_alt23

chat_bubble_outline3

repeat4

shareShare

Ritvik Pandey

@ritvikpandey21

5 months ago

the team at Pulse put bytedance's dolphin OCR to the test against complex documents that matter for real business use cases. while it shows improvements in reading order detection, we found critical limitations across key areas: - 7.7% structured data extraction from

the team at <a href="/Pulse__AI/">Pulse</a> put bytedance's dolphin OCR to the test against complex documents that matter for real business use cases.

while it shows improvements in reading order detection, we found critical limitations across key areas:
- 7.7% structured data extraction from

thumb_up_off_alt18

chat_bubble_outline5

repeat3

shareShare

Ritvik Pandey

@ritvikpandey21

4 months ago

@pulse__ai team just dropped why "98% accurate" document extraction still breaks in production with 4000 errors per 1000 pages. single accuracy scores miss broken reading order, shifted table columns, and lost cross page context that silently corrupt entire datasets. we’ve

thumb_up_off_alt7

chat_bubble_outline2

repeat3

shareShare

Y Combinator

@ycombinator

4 months ago

Pulse (Pulse) just launched their state-of-the-art document extraction platform. It turns complex PDFs, scans, decks, and images into LLM-ready data. No training required. runpulse.com/blog/pulse-ope… Congrats on the launch, sid and Ritvik Pandey!

thumb_up_off_alt279

chat_bubble_outline18

repeat27

shareShare

sid

@sid_mnk

3 months ago

customer feedback last week hours of work → minutes with @pulse__ai

thumb_up_off_alt12

chat_bubble_outline1

repeat2

shareShare

Pulse

@pulse__ai

3 months ago

Join us!

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

sid

@sid_mnk

3 months ago

Almost a year since this

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

sid

@sid_mnk

3 months ago

@pulse__ai just launched formula recognition. trained on 10m+ formula/latex pairs from papers + handwritten notes. traditional ocr breaks on math (α, β, fractions, matrices). our model treats formulas as structured objects → clean latex. built on pulse’s production-grade

$@pulse__ai just launched formula recognition. trained on 10m+ formula/latex pairs from papers + handwritten notes. traditional ocr breaks on math (α, β, fractions, matrices). our model treats formulas as structured objects → clean latex. built on pulse’s production-grade$

thumb_up_off_alt31

chat_bubble_outline7

repeat6

shareShare

Ritvik Pandey

@ritvikpandey21

3 months ago

Culture building is everything when you're asking engineers to solve the hardest problems for enterprises. The entire Pulse team is usually in the office 12 hours a day - everyone needs to be in one place, building together. Having an immediate feedback loop is incredibly

Culture building is everything when you're asking engineers to solve the hardest problems for enterprises. The entire <a href="/Pulse__AI/">Pulse</a> team is usually in the office 12 hours a day - everyone needs to be in one place, building together.

Having an immediate feedback loop is incredibly

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Ritvik Pandey

@ritvikpandey21

3 months ago

The Pulse team just published "The Precision Tax" - why "99% accuracy" fails in finance. One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same

The <a href="/Pulse__AI/">Pulse</a> team just published "The Precision Tax" - why "99% accuracy" fails in finance.

One percent error in financial document processing means broken valuations, failed covenant tests, and regulatory exposure. The real benchmark isn't accuracy, it's determinism. Same

thumb_up_off_alt12

chat_bubble_outline1

repeat2

shareShare

Pulse

@pulse__ai

2 months ago

Pulse is now officially part of Cloudera's Enterprise AI ecosystem. Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.

Pulse is now officially part of <a href="/cloudera/">Cloudera</a>'s Enterprise AI ecosystem.

Excited to partner with Cloudera and continue delivering the most accurate document extraction models at enterprise scale.

thumb_up_off_alt35

chat_bubble_outline10

repeat5

shareShare

sid

@sid_mnk

2 months ago

🧢

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Y Combinator

@ycombinator

2 months ago

.Pulse just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs. runpulse.com/blog/self-host… Congrats on the launch, sid

.<a href="/Pulse__AI/">Pulse</a> just launched Ultra Nano, their new enterprise-focused document extraction model with complete self-hosting, already running across Fortune 50s, insurers, investment firms, banks, and foundational model labs.

runpulse.com/blog/self-host…

Congrats on the launch, <a href="/sid_mnk/">sid</a>

thumb_up_off_alt62

chat_bubble_outline11

repeat5

shareShare

Ritvik Pandey

@ritvikpandey21

a month ago

DeepSeek AI dropped a new open-source OCR model today 👀 At @pulse__ai, we tested it on financial docs, handwritten forms, and complex tables. The results showed the same issues plaguing LLM-driven OCR: - Unstable outputs - Hallucinated text - Broken table structures Reality

thumb_up_off_alt194

chat_bubble_outline18

repeat24

shareShare

sid

@sid_mnk

a month ago

threw a screenshot of this post into Pulse ~99% accurate try it here: platform.runpulse.com

threw a screenshot of this post into <a href="/pulse__ai/">Pulse</a> ~99% accurate

try it here: platform.runpulse.com

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

sid

@sid_mnk

a month ago

Exciting research preview to share on XLSX parsing at Pulse . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them. Our team has developed and implemented a token-efficient encoder resulting in

Exciting research preview to share on XLSX parsing at <a href="/Pulse__AI/">Pulse</a> . Spreadsheets are deceptively hard - merged cells, multi-tab workbooks, and cross-sheet references break when you flatten them.

Our team has developed and implemented a token-efficient encoder resulting in

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

sid

@sid_mnk

14 days ago

spotted

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare