Skip to main content

Goal

Parse an Excel workbook into markdown that preserves sheet structure, while controlling hidden rows, hidden sheets, display formatting, and inflated used ranges.

Use This Workflow

Use this for financial models, loss runs, claims exports, underwriting workbooks, and operational reports.

Platform Preview

Spreadsheet extraction in the Platform showing workbook tabs and markdown output Quarterly profit and loss sheet preview from the ACME workbook

Request

curl -X POST https://api.runpulse.com/extract \
  -H "x-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "file_url": "https://your-storage.example.com/acme-financial-review.xlsx",
    "spreadsheet": {
      "include_hidden_rows": false,
      "include_hidden_cols": false,
      "include_hidden_sheets": false,
      "use_raw_values": true,
      "only_data_rows": true,
      "only_data_cols": true
    },
    "async": true,
    "storage": {"enabled": true}
  }'

Settings

SettingRecommended defaultChange it when
use_raw_valuestrue for financial dataYou want values exactly as displayed in Excel.
only_data_rowstrue for exported workbooksEmpty trailing rows are intentionally meaningful.
only_data_colstrue for exported workbooksEmpty trailing columns are intentionally meaningful.
include_hidden_sheetsfalseHidden support sheets contain auditable source data.
include_hidden_rows / include_hidden_colsfalseHidden content is part of the business record.

Checks

  • Compare totals in extracted markdown against visible workbook totals.
  • Decide whether hidden sheets are data, calculations, or irrelevant workbook machinery.
  • Use raw values for downstream calculations and display values for human-facing reports.
  • Store the workbook filename, sheet name, and extraction ID with downstream records.
  • Test on real exported workbooks; phantom rows are usually system-specific.

Spreadsheet Processing

Spreadsheet parameter guide.

Financial Tables

Table-first PDF workflows.