Best practices for processing large documents efficiently
Document Size | Processing Method | Response Type | Recommended Approach |
---|---|---|---|
< 50 pages | Synchronous | Direct JSON | Use /extract endpoint |
50-70 pages | Synchronous/Async | Direct JSON | Consider async for reliability |
> 70 pages | Either | S3 URL | Results delivered via URL |
> 100 pages | Async recommended | S3 URL | Use /extract_async |
> 500 pages | Async required | S3 URL | Consider page ranges |
"5"
"10-20"
"1-5,10-15,20"
"1,3,5-10,15"
Use Async for Large Files
Optimize Page Ranges
Handle S3 URLs Properly
is_url
in responsesMonitor Performance