Pulse API
Updates
Latest updates and improvements to Pulse API
Batch Processing Endpoint
- Introduced
/batch_extract_async
endpoint for parallel document processing - Process multiple documents with customized parameters in one API call
- Support for both file URLs and file paths in batch requests
- Integrated with existing
/job/<job_id>
polling endpoint
Example Request Structure
Full documentation available at docs.runpulse.com
Cloud-Agnostic Processing & Table Improvements
- Universal presigned URL support (AWS, GCP, Azure)
- Direct support for any valid presigned URL from major cloud providers
- Support for direct PDF links without cloud storage
Table Processing Enhancements
- New
experimental_return_table
parameter (beta) - Improved cell boundary detection
- Better handling of merged cells
- Original
return_table
parameter maintained for production stability
Schema Processing Improvements
- Resolved token limit truncation in large document processing
- Optimized processing pipeline for faster schema generation
- Improved handling of nested data structures
Important Notes
- experimental_return_table is in beta
- Processing time estimates now reflect real-world processing times
- All existing endpoints and features remain fully supported
- Schema processing improvements are automatically applied
Async Processing Update
- New asynchronous processing capabilities for large documents
- Job status monitoring and management
- Cancel in-progress jobs
- All existing endpoints and features maintained
New Features
- New
/extract_async
endpoint for processing large documents - Immediate job ID response for status tracking
- Progress monitoring through
/job/<job_id>
endpoint - Cancel long-running jobs with
/cancel/<job_id>
Response Examples
Post-Thanksgiving Feature Release
- Enhanced schema processing engine - more accurate field extraction
- New domain & API endpoint: api.runpulse.com (existing endpoint still supported)
- New table extraction capabilities
- Improved layout recognition for complex documents
Direct Table Extraction
- New return_table parameter returns tables in 2D matrix format
- Perfect for financial documents, rent rolls, and structured data
- Clean, processed tables ready for data analysis
Schema & Response Changes
- Better field type handling (string, float, time)
- More accurate extraction of nested data
- Specify schema to receive “schema-json” in response
Resources & Support
- Full documentation: docs.runpulse.com
- Questions? Email our founders directly: founders@trypulse.ai
Pulse API Launch!
Key Update
- Renaming
/upload
to/extract
for documents requiring custom extraction patterns and high volume processing /convert
endpoint unchanged for quick PDF processing & URL generation
Performance Improvements
- 2x faster document processing
- Significantly improved table extraction accuracy
- Enhanced layout recognition for complex documents
- All existing parameters and schemas work exactly the same
Custom Schema Power
- Extract exactly what you need with flexible JSON schemas
- Design any extraction pattern - from simple text to complex nested structures
Coming Soon
- New parameter to receive all detected tables as separate CSV files
- More extraction capabilities rolling out weekly
Working with Tables
Need to convert extracted tables to matrix format? Here’s a helper script:
Resources & Support
- Full documentation: docs.trypulse.ai
- Questions? Email our founders directly: founders@trypulse.ai
Keep extracting amazing things!