Preview a PR Before Merging

rocky preview runs the models a PR’s diff actually changes against a per-PR branch, leaves everything else untouched (copied from the base ref), and produces a structural + sampled row-level diff and a cost delta vs. base. This guide walks you through running it locally on a feature branch.

For the design — how Rocky picks the prune set, why CTAS today and clones tomorrow, how the sampling window works — see the How Preview Works concept page. For the full output schemas, see the rocky preview CLI reference.

Prerequisites

You’ll need:

Rocky installed and on $PATH (the Getting Started guide has install instructions).
A repo with a rocky.toml and a models/ directory.
A git working tree on a feature branch with at least one model change vs. the base ref.
The base schema’s tables already materialized — preview create copies them into the per-PR branch schema, so they need to exist. Running rocky run once on main is enough.

The walkthrough below uses --base main, but any git ref works.

Step 1: Create the preview branch

rocky preview create --base main

What this does:

Runs git diff --name-only main HEAD against the models directory to find changed model files.
Loads the working tree into the compiler and computes the prune set — every changed model plus everything that depends on a changed column.
Computes the copy set — every working-DAG model not in the prune set.
Registers a branch in the state store (mirrors rocky branch create).
Issues CREATE TABLE <branch_schema>.<model> AS SELECT * FROM <base_schema>.<model> for each copy-set model.
Calls rocky run --branch <name> with a model selector limited to the prune set.

The output is a PreviewCreateOutput JSON document:

{
  "version": "1.18.0",
  "command": "preview-create",
  "branch_name": "preview-fix-price",
  "branch_schema": "branch__preview-fix-price",
  "base_ref": "main",
  "head_ref": "HEAD",
  "prune_set": [
    { "model_name": "fct_revenue", "reason": "changed", "changed_columns": ["amount_cents"] },
    { "model_name": "rev_by_region", "reason": "downstream_of_changed" }
  ],
  "copy_set": [
    { "model_name": "stg_orders",    "source_schema": "main", "target_schema": "branch__preview-fix-price", "copy_strategy": "ctas" },
    { "model_name": "stg_customers", "source_schema": "main", "target_schema": "branch__preview-fix-price", "copy_strategy": "ctas" }
  ],
  "skipped_set": [],
  "run_id": "run-20260428-141033-002",
  "run_status": "succeeded",
  "duration_ms": 4321
}

The run_id is the handle the next two commands use to look up cost telemetry.

Step 2: Diff the branch against base

rocky preview diff --name preview-fix-price --output markdown

This combines two layers into one report:

Structural diff — column-level added/removed/type-changed, the same shape rocky ci-diff produces.
Row-level diff — per-model row delta surfaced through one of two algorithms (a discriminator on the JSON output picks which):
- kind: "sampled" (default) — LIMIT N rows ordered by primary key (default --sample-size 1000); fast, but miss rows outside the window. Carries a coverage_warning when the sample doesn’t cover the full table.
- kind: "bisection" — exhaustive checksum-bisection over a single-column integer / numeric unique_key. Walks the chunk lattice, recurses into mismatched chunks, surfaces every row-level diff. See the How Preview Works page for the algorithm. Runs only on Merge-strategy models with a single integer PK; other models stay on sampled (logged via tracing::warn with the skip reason).

--output markdown writes a PR-comment-ready snippet to stdout. Use --output json for the full PreviewDiffOutput shape (the JSON output also includes the rendered Markdown in a top-level markdown field, so you can pipe it through jq -r .markdown for the same effect).

Choosing the algorithm

# Default — sampled (fast, may miss out-of-window changes)
rocky preview diff --name preview-fix-price

# Exhaustive — checksum-bisection (covers the whole table)
rocky preview diff --name preview-fix-price --algorithm bisection

Per-model output uses a tagged algorithm discriminator:

"models": [
  {
    "model_name": "fct_revenue",
    "structural": { /* ... */ },
    "algorithm": {
      "kind": "bisection",
      "diff": { "rows_added": 0, "rows_removed": 0, "rows_changed": 1, "samples": [...] },
      "bisection_stats": {
        "chunks_examined": 64, "leaves_materialized": 1,
        "depth_max": 2, "depth_capped": false,
        "split_strategy": "int_range",
        "null_pk_rows_base": 0, "null_pk_rows_branch": 0
      }
    }
  }
]

Direct JSON consumers should read model.algorithm.kind first, then unpack the matching variant. The Dagster typed-resource layer absorbs this automatically.

Step 3: Compare cost vs. base

rocky preview cost --name preview-fix-price --output markdown

This is a diff layer over rocky cost latest. For each model in the prune set, Rocky looks up the latest base-schema RunRecord from the state store and the branch run’s RunRecord, then subtracts the per-model duration, bytes scanned, and USD cost.

The summary fields tell you:

delta_usd — total branch cost minus base cost. Positive means the PR will cost more to run on main after merge.
total_branch_duration_ms and total_branch_bytes_scanned — run-level totals used for budget projection (see below).
savings_from_copy_usd — what the preview itself saved by copying instead of re-running. This is the empirical evidence that the prune-and-copy substrate is doing work.
models_skipped_via_copy — count of models that didn’t run on the branch because they were copy-set.

Pre-merge budget projection

When the project declares a [budget] block in rocky.toml, preview cost projects breaches against the branch totals before merge so a reviewer (and the CI gate) sees this PR would breach max_usd / max_duration_ms / max_bytes_scanned if merged before the merge happens. Output field:

"projected_budget_breaches": [
  { "limit_type": "max_usd",          "limit": 2.5,   "actual": 5.0 },
  { "limit_type": "max_duration_ms",  "limit": 60000, "actual": 90000 }
]

Empty when no budget is configured or the projected totals stay within every limit. Mirrors the RunOutput.budget_breaches shape so the same downstream consumers (PR-comment templates, JSON listeners) can process both with one code path.

The Markdown rendering surfaces a “Budget projection” section only when breaches exist; framing flips between advisory (“would breach”) and “would fail the run” based on [budget].on_breach.

What the prune set means

The prune set is the set of models that re-execute against the branch. Two reasons can put a model in the prune set:

reason: "changed" — the model file itself changed in the diff (changed_columns lists which columns).
reason: "downstream_of_changed" — the model didn’t change but transitively depends on a column that did.

Models in neither bucket are either in the copy set (logically identical to base — they get CTAS’d over) or the skipped set (the column-level pruner determined they’re unaffected and not depended on). The skipped set is the empty-cost residue: nothing copies them, nothing runs them.

If the prune set is empty, your PR doesn’t change any model output (e.g. a whitespace-only edit). The branch run is a no-op and preview cost reports a zero delta.

What `coverage_warning: true` means

The default --algorithm sampled reads a fixed window (ORDER BY <pk> LIMIT N), so it can miss changes outside that window. When the row count outside the window is non-trivial, the per-model diff sets:

"algorithm": {
  "kind": "sampled",
  "sampled": { /* ... */ },
  "sampling_window": {
    "ordered_by": "<column>",
    "limit": <N>,
    "coverage": "first_n_by_order",
    "coverage_warning": true
  }
}

The aggregate summary.any_coverage_warning widens to fire on either condition: a sampled diff with coverage_warning: true or a bisection diff with bisection_stats.depth_capped: true (the recursion bottomed out at the depth cap on a pathologically skewed PK distribution before reaching leaf size). Either signals the per-model findings might be incomplete.

When you see the warning on a sampled diff, your options are:

Re-run with --algorithm bisection — covers the whole table exhaustively. Works for any model with a single-column integer / numeric unique_key.
Re-run with a larger --sample-size if a bounded sample of N more rows is enough confidence for this PR.
Inspect the changed columns directly via rocky compile --model <name> and reason about the change manually.

A clean sample with coverage_warning: true is not evidence the PR is no-op for that model.

Troubleshooting

base ref not found. rocky preview create --base <ref> requires the ref to exist locally. Run git fetch origin <ref> first if you’re working against a remote-only ref like origin/main.

preview cost reports null deltas. Cost requires a prior RunRecord for each compared model on the base schema. If the base schema has never been run end-to-end, base_run_id is null and per-model delta_usd falls back to null. Run rocky run once on main to populate the state store, then re-run preview cost.

preview cost reports null for the branch. The cost rollup uses the same adapter telemetry as rocky cost. DuckDB and unconfigured adapters report null USD by design; duration and bytes still surface. Configure [cost] in rocky.toml to get dollar amounts on Databricks / Snowflake.

Copy step is slow. The copy substrate dispatches per adapter via WarehouseAdapter::clone_table_for_branch. Databricks (SHALLOW CLONE) and BigQuery (CREATE TABLE … COPY) both ship metadata-only overrides as of engine-v1.19.1 — the per-PR branch table is effectively zero-cost at create time. DuckDB and Snowflake fall through to the portable CTAS default, which physically copies bytes; on large tables this is the dominant cost of preview create. Snowflake’s native zero-copy CLONE will land once a Snowflake consumer drives the integration test against a workspace.

The diff finds no changes but the model definitely changed. Check summary.any_coverage_warning in the JSON output. If it’s true, the sampling window missed the changed rows — see the section above.

Posting to a PR

rocky preview ships a composite GitHub Action that runs all three commands on every push to a pull request and upserts a single Markdown comment with the prune/copy/skip plan, the structural diff, and the cost delta. The action lives at .github/actions/rocky-preview/ in the rocky-data repo and is drop-in for any repo with a rocky.toml and a models/ directory.

Setting up the GitHub Action

Add the workflow below to .github/workflows/preview.yml in your own repo:

name: rocky-preview

on:
  pull_request:
    types: [opened, synchronize, reopened]

permissions:
  contents: read
  pull-requests: write

jobs:
  preview:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v6
        with:
          fetch-depth: 0  # required for git diff against the base branch

      - uses: rocky-data/rocky/.github/actions/rocky-preview@main
        with:
          base_ref: ${{ github.event.pull_request.base.ref }}
          branch_name: ${{ github.event.pull_request.head.ref }}
          github_token: ${{ github.token }}
          # working_directory: my-pipeline   # if rocky.toml lives in a subdir
          # models_dir: models               # default
          # rocky_version: latest            # or 1.17.4 / engine-v1.17.4

The first PR after wiring this in will install Rocky and post a comment with the plan, the diff, and the cost delta. Subsequent pushes update that same comment in place via the  marker — no PR-comment spam.

Action inputs

Input	Default	Description
`base_ref`	(required)	Git ref to compare against. Typically `${{ github.event.pull_request.base.ref }}`.
`branch_name`	PR head ref, slugged	Preview branch name passed to `rocky preview --name`. Pre-slug if you pass it explicitly: only `[A-Za-z0-9_-]` are preserved.
`models_dir`	`models`	Directory containing model files. Passed to `rocky preview create --models`.
`working_directory`	`.`	Directory containing `rocky.toml`. The action `cd`s here before each subcommand.
`rocky_version`	`latest`	Engine version. `latest` resolves the highest `engine-v*` tag; otherwise pass `1.17.4` or `engine-v1.17.4`.
`comment_marker`	`<!-- rocky-preview -->`	Magic-string marker used for comment upsert. Override only if you run multiple preview workflows on the same PR.
`fail_on_preview_error`	`false`	When `true`, fail the PR check if any `rocky preview` subcommand errors. The default keeps preview advisory — failures still post a section in the comment.
`github_token`	(required)	Token used to read the PR and upsert the comment. Pass `${{ github.token }}` from the workflow (or a PAT for cross-repo permissions). Required because composite actions cannot reference `${{ github.token }}` in input defaults.

Action outputs

Output	Description
`comment_url`	HTML URL of the upserted PR comment.
`prune_set_size`	Number of models in the prune set (changed + downstream-of-changed).
`delta_usd`	Total branch-vs-base USD cost delta. Empty when no paired runs exist yet (e.g. first preview against an unpopulated base).

Failure modes

The action is designed to never block a PR by default:

A rocky preview <subcommand> failure surfaces as an :x: section in the comment with the captured stderr.
A missing or unfetched base ref produces a hint to add fetch-depth: 0 to actions/checkout.
A PR that touches no model files renders a tight one-liner (This PR does not change any pipeline models.) instead of empty diff/cost tables.

Set fail_on_preview_error: true to turn any of those into a hard PR-check failure.