Download large extraction result

curl --request GET \
  --url https://api.runpulse.com/large_results/{jobId} \
  --header 'x-api-key: <api-key>'

{
  "markdown": "<string>",
  "extensions": {
    "chunking": {
      "semantic": [
        "<string>"
      ],
      "header": [
        "<string>"
      ],
      "page": [
        "<string>"
      ],
      "recursive": [
        "<string>"
      ]
    },
    "footnoteReferences": [
      {
        "symbol": "<string>",
        "footnoteTextId": "<string>",
        "referenceTextIds": [
          "<string>"
        ]
      }
    ],
    "altOutputs": {
      "wlbb": {
        "words": [
          {
            "id": "<string>",
            "text": "<string>",
            "page_number": 2,
            "bounding_box": [
              123
            ],
            "average_word_confidence": 123
          }
        ],
        "error": "<string>"
      },
      "html": "<string>",
      "xml": "<string>"
    }
  },
  "bounding_boxes": {
    "Images": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "image_url": "<string>",
        "description": "<string>",
        "classification": {
          "confidence": 0.5,
          "model": "<string>",
          "error": "<string>"
        },
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123,
        "excel_range": "<string>",
        "chart_type": "<string>",
        "chart_title": "<string>",
        "source_ranges": [
          "<string>"
        ],
        "render_error": "<string>",
        "description_error": "<string>"
      }
    ],
    "Tables": [
      {
        "table_info": {
          "id": "<string>",
          "dimensions": [
            123
          ],
          "excel_range": "<string>",
          "sheet_name": "<string>",
          "sheet_index": 123,
          "workbook_sheet_index": 123,
          "section_index": 123,
          "section_type": "<string>",
          "section_name": "<string>",
          "table_name": "<string>",
          "layout_type": "<string>",
          "is_chart": true,
          "chart_type": "<string>",
          "chart_title": "<string>",
          "source_ranges": [
            "<string>"
          ],
          "location": {}
        },
        "cell_data": [
          {}
        ]
      }
    ],
    "Text": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Title": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Footer": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "markdown_with_ids": "<string>"
  },
  "extraction_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
  "extraction_url": "<string>",
  "page_count": 2,
  "plan_info": {
    "tier": "<string>",
    "total_credits_used": 123,
    "pages_used": 1,
    "note": "<string>"
  },
  "warnings": [
    "<string>"
  ],
  "credits_used": 123
}

Jobs, Results & Webhooks

Download Large Result

Download the full result for a large extraction. When /extract or GET /job/{jobId} returns is_url: true, fetch the complete result from the URL provided. The URL is single-use: after a successful download the resource is deleted and subsequent requests return 410 Gone.

For form jobs (/form/detect, /form/fill, /form/clear) you don’t need this endpoint at all — GET /job/{jobId} already returns the full FormResult inline under result, and the pdf_url field points at GET /results//pdf for the binary.

GET

large_results

{jobId}

Download large extraction result

curl --request GET \
  --url https://api.runpulse.com/large_results/{jobId} \
  --header 'x-api-key: <api-key>'

{
  "markdown": "<string>",
  "extensions": {
    "chunking": {
      "semantic": [
        "<string>"
      ],
      "header": [
        "<string>"
      ],
      "page": [
        "<string>"
      ],
      "recursive": [
        "<string>"
      ]
    },
    "footnoteReferences": [
      {
        "symbol": "<string>",
        "footnoteTextId": "<string>",
        "referenceTextIds": [
          "<string>"
        ]
      }
    ],
    "altOutputs": {
      "wlbb": {
        "words": [
          {
            "id": "<string>",
            "text": "<string>",
            "page_number": 2,
            "bounding_box": [
              123
            ],
            "average_word_confidence": 123
          }
        ],
        "error": "<string>"
      },
      "html": "<string>",
      "xml": "<string>"
    }
  },
  "bounding_boxes": {
    "Images": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "image_url": "<string>",
        "description": "<string>",
        "classification": {
          "confidence": 0.5,
          "model": "<string>",
          "error": "<string>"
        },
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123,
        "excel_range": "<string>",
        "chart_type": "<string>",
        "chart_title": "<string>",
        "source_ranges": [
          "<string>"
        ],
        "render_error": "<string>",
        "description_error": "<string>"
      }
    ],
    "Tables": [
      {
        "table_info": {
          "id": "<string>",
          "dimensions": [
            123
          ],
          "excel_range": "<string>",
          "sheet_name": "<string>",
          "sheet_index": 123,
          "workbook_sheet_index": 123,
          "section_index": 123,
          "section_type": "<string>",
          "section_name": "<string>",
          "table_name": "<string>",
          "layout_type": "<string>",
          "is_chart": true,
          "chart_type": "<string>",
          "chart_title": "<string>",
          "source_ranges": [
            "<string>"
          ],
          "location": {}
        },
        "cell_data": [
          {}
        ]
      }
    ],
    "Text": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Title": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Footer": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "markdown_with_ids": "<string>"
  },
  "extraction_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
  "extraction_url": "<string>",
  "page_count": 2,
  "plan_info": {
    "tier": "<string>",
    "total_credits_used": 123,
    "pages_used": 1,
    "note": "<string>"
  },
  "warnings": [
    "<string>"
  ],
  "credits_used": 123
}

Overview

Large async results may be returned as a URL instead of an inline response body. Use this endpoint to download the full completed result for a job when the job response indicates a large-result URL.

Large-result links are intended for completed jobs. Poll GET /job/{jobId} first, then download the result URL if the job response says the result is URL-backed.

Typical Flow

Working With Large Documents

Plan long-running and large-output workflows.

Poll Job

Check async job status.

Authorizations

x-api-key

string

header

required

Path Parameters

jobId

string

required

Job identifier from the extraction response.

Response

Full extraction result (streamed JSON)

Core extraction result fields shared by the synchronous /extract endpoint and the pipeline extract step.

markdown

string

Primary markdown content extracted from the document. Always present in the new format.

extensions

object

Output from enabled extensions. Each key corresponds to an extension that was enabled in the request under extensions.*. Only keys for enabled extensions are present.

Show child attributes

bounding_boxes

object

Positional bounding-box data for text, titles, headers, footers, images, and tables. Images carries chart/image visuals (with image_url when figure_processing.show_images is enabled), Tables the detected tables, and Text/Title/Footer the paragraph/title/footer regions. Additional keys (e.g. markdown_with_ids, defined_names) round-trip without being typed.

Show child attributes

extraction_id

string<uuid>

Persisted extraction ID. Present when storage is enabled (default). Use this ID with /split and /schema endpoints.

extraction_url

string

URL to view the extraction on the Pulse platform. Present when storage is enabled.

page_count

integer

Number of pages processed.

Required range: x >= 1

plan_info

object

Billing tier and cumulative usage information. Includes total_credits_used (primary billing metric) and pages_used (legacy compatibility).

Show child attributes

warnings

string[]

Non-fatal warnings generated during extraction. Includes deprecation notices when legacy input parameters are used, as well as processing warnings (e.g. word-level bounding box limitations).

credits_used

number<float> | null

Number of credits consumed by this request. Only present when the organization has the credit billing system enabled.

Cancel Job Get Result Image

⌘I

​Overview

​Typical Flow

​Related

Working With Large Documents

Poll Job

Authorizations

Path Parameters

Response

Overview

Typical Flow

Related