Download Large Result
Download the full result for a large extraction. When /extract
or GET /job/{jobId} returns is_url: true, fetch the complete
result from the URL provided. The URL is single-use: after a
successful download the resource is deleted and subsequent
requests return 410 Gone.
For form jobs (/form/detect, /form/fill, /form/clear)
you don’t need this endpoint at all — GET /job/{jobId}
already returns the full FormResult inline under result,
and the pdf_url field points at
GET /results//pdf for the
binary.
Overview
Large async results may be returned as a URL instead of an inline response body. Use this endpoint to download the full completed result for a job when the job response indicates a large-result URL.GET /job/{jobId} first, then download the result URL if the job response says the result is URL-backed.Typical Flow
Related
Working With Large Documents
Poll Job
Authorizations
Path Parameters
Job identifier from the extraction response.
Response
Full extraction result (streamed JSON)
Core extraction result fields shared by the synchronous /extract endpoint and the pipeline extract step.
Primary markdown content extracted from the document. Always present in the new format.
Output from enabled extensions. Each key corresponds to an extension that was enabled in the request under extensions.*. Only keys for enabled extensions are present.
Positional bounding-box data for text, titles, headers, footers, images, and tables. Images carries chart/image visuals (with image_url when figure_processing.show_images is enabled), Tables the detected tables, and Text/Title/Footer the paragraph/title/footer regions. Additional keys (e.g. markdown_with_ids, defined_names) round-trip without being typed.
Persisted extraction ID. Present when storage is enabled (default). Use this ID with /split and /schema endpoints.
URL to view the extraction on the Pulse platform. Present when storage is enabled.
Number of pages processed.
x >= 1Billing tier and cumulative usage information. Includes total_credits_used (primary billing metric) and pages_used (legacy compatibility).
Non-fatal warnings generated during extraction. Includes deprecation notices when legacy input parameters are used, as well as processing warnings (e.g. word-level bounding box limitations).
Number of credits consumed by this request. Only present when the organization has the credit billing system enabled.