Skip to main content
GET
/
large_results
/
{jobId}
Download large extraction result
curl --request GET \
  --url https://api.runpulse.com/large_results/{jobId} \
  --header 'x-api-key: <api-key>'
{
  "markdown": "<string>",
  "extensions": {
    "chunking": {
      "semantic": [
        "<string>"
      ],
      "header": [
        "<string>"
      ],
      "page": [
        "<string>"
      ],
      "recursive": [
        "<string>"
      ]
    },
    "footnoteReferences": [
      {
        "symbol": "<string>",
        "footnoteTextId": "<string>",
        "referenceTextIds": [
          "<string>"
        ]
      }
    ],
    "altOutputs": {
      "wlbb": {
        "words": [
          {
            "id": "<string>",
            "text": "<string>",
            "page_number": 2,
            "bounding_box": [
              123
            ],
            "average_word_confidence": 123
          }
        ],
        "error": "<string>"
      },
      "html": "<string>",
      "xml": "<string>"
    }
  },
  "bounding_boxes": {
    "Images": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "image_url": "<string>",
        "description": "<string>",
        "classification": {
          "confidence": 0.5,
          "model": "<string>",
          "error": "<string>"
        },
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123,
        "excel_range": "<string>",
        "chart_type": "<string>",
        "chart_title": "<string>",
        "source_ranges": [
          "<string>"
        ],
        "render_error": "<string>",
        "description_error": "<string>"
      }
    ],
    "Tables": [
      {
        "table_info": {
          "id": "<string>",
          "dimensions": [
            123
          ],
          "excel_range": "<string>",
          "sheet_name": "<string>",
          "sheet_index": 123,
          "workbook_sheet_index": 123,
          "section_index": 123,
          "section_type": "<string>",
          "section_name": "<string>",
          "table_name": "<string>",
          "layout_type": "<string>",
          "is_chart": true,
          "chart_type": "<string>",
          "chart_title": "<string>",
          "source_ranges": [
            "<string>"
          ],
          "location": {}
        },
        "cell_data": [
          {}
        ]
      }
    ],
    "Text": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Title": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "Footer": [
      {
        "id": "<string>",
        "content": "<string>",
        "page_number": 2,
        "bounding_box": [
          123
        ],
        "excel_range": "<string>",
        "sheet_name": "<string>",
        "sheet_index": 123,
        "workbook_sheet_index": 123
      }
    ],
    "markdown_with_ids": "<string>"
  },
  "extraction_id": "3c90c3cc-0d44-4b50-8888-8dd25736052a",
  "extraction_url": "<string>",
  "page_count": 2,
  "plan_info": {
    "tier": "<string>",
    "total_credits_used": 123,
    "pages_used": 1,
    "note": "<string>"
  },
  "warnings": [
    "<string>"
  ],
  "credits_used": 123
}

Overview

Large async results may be returned as a URL instead of an inline response body. Use this endpoint to download the full completed result for a job when the job response indicates a large-result URL.
Large-result links are intended for completed jobs. Poll GET /job/{jobId} first, then download the result URL if the job response says the result is URL-backed.

Typical Flow

Working With Large Documents

Plan long-running and large-output workflows.

Poll Job

Check async job status.

Authorizations

x-api-key
string
header
required

Path Parameters

jobId
string
required

Job identifier from the extraction response.

Response

Full extraction result (streamed JSON)

Core extraction result fields shared by the synchronous /extract endpoint and the pipeline extract step.

markdown
string

Primary markdown content extracted from the document. Always present in the new format.

extensions
object

Output from enabled extensions. Each key corresponds to an extension that was enabled in the request under extensions.*. Only keys for enabled extensions are present.

bounding_boxes
object

Positional bounding-box data for text, titles, headers, footers, images, and tables. Images carries chart/image visuals (with image_url when figure_processing.show_images is enabled), Tables the detected tables, and Text/Title/Footer the paragraph/title/footer regions. Additional keys (e.g. markdown_with_ids, defined_names) round-trip without being typed.

extraction_id
string<uuid>

Persisted extraction ID. Present when storage is enabled (default). Use this ID with /split and /schema endpoints.

extraction_url
string

URL to view the extraction on the Pulse platform. Present when storage is enabled.

page_count
integer

Number of pages processed.

Required range: x >= 1
plan_info
object

Billing tier and cumulative usage information. Includes total_credits_used (primary billing metric) and pages_used (legacy compatibility).

warnings
string[]

Non-fatal warnings generated during extraction. Includes deprecation notices when legacy input parameters are used, as well as processing warnings (e.g. word-level bounding box limitations).

credits_used
number<float> | null

Number of credits consumed by this request. Only present when the organization has the credit billing system enabled.