Goal
Run a document ingestion workflow where source files and extraction results live in your cloud storage path instead of being copied through one-off local scripts.Use This Workflow
Use this pattern for regulated intake queues, nightly backfills, and customer-controlled storage workflows.Batch Extract From S3
Single File With A Presigned URL
Checks
- Use short-lived presigned URLs for single-file extraction.
- Use batch S3 prefixes for high-volume queues.
- Enable Bring Your Own Storage when artifacts must stay in your cloud account.
- Persist
batch_job_id, child job IDs, source object keys, and output prefixes. - Make downstream writes idempotent by source object key and extraction ID.
Related
AWS S3 Setup
Configure Pulse access to your bucket.
Batch Processing
Full batch endpoint reference.
Production Webhooks
Receive completion events.