Skip to content

Examples & POC catalog

The playground ships 88 self-contained POCs across 8 categories. Each is a small project with its own rocky.toml, models/, and a single run.sh that takes it end-to-end. Most run on local DuckDB with no credentials, so you can read the code and run it in one step.

These complement the Quickstart and Playground walkthroughs: those teach the workflow, and the POCs show one capability each, in isolation.

Terminal window
git clone https://github.com/rocky-data/rocky.git
cd rocky/examples/playground
# Run any POC end-to-end (DuckDB, no credentials)
./pocs/02-performance/01-incremental-watermark/run.sh

The demos from the README, each a single run.sh:

What it showsPOC
Schema drift detected and recovered on the next run02-performance/06-schema-drift-recover
Data contracts enforced at compile time (E010 / E013)01-quality/01-data-contracts-strict
Named branches, content-addressed run records, and column lineage00-foundations/06-branches-replay-lineage
Column-level lineage on a branching DAG06-developer-experience/01-lineage-column-level
AI model generation with the compile-verify loop03-ai/01-model-generation
PR-time blast radius with rocky lineage-diff06-developer-experience/11-lineage-diff
Classify columns, mask by environment, gate CI04-governance/05-classification-masking-compliance
Incremental loads with persistent watermark state02-performance/01-incremental-watermark
One run, three views: trace + cost + replay06-developer-experience/17-trace-replay-cost-combo

Each link opens the category folder on GitHub, where every POC has a README that states what it shows, the exact command, and the expected output.

DSL syntax, materialization basics, branches and replay, file ingest, per-tenant routing, and the plan/apply deployment workflow. Credential-free (DuckDB).

Contracts, inline checks, anomaly detection, local testing, SCD-2 snapshots, the standalone quality pipeline, and freshness SLAs. Credential-free (DuckDB).

Incremental, merge, partition checksums, drift recovery, ephemeral CTEs, delete+insert, adaptive concurrency, cost and budgets, and EXPLAIN-based estimation. Credential-free (DuckDB).

Model generation, intent extraction, schema-change sync, test generation, schema-grounded validation, and MCP data-grounding. Most need an ANTHROPIC_API_KEY.

Unity Catalog grants, schema patterns, workspace isolation, tagging, classification and masking, retention, and cross-team contracts. Some need a Databricks workspace; classification/masking and retention run on DuckDB.

Shell hooks, webhook presets, remote state, checkpoint/resume, the Valkey cache, Dagster DAG mode, the circuit breaker, and idempotency keys. DuckDB, with a few using Docker.

Column lineage, the HTTP API, dbt import, shadow mode, CI, trace Gantt views, portability lint, PR preview and data diff, and lineage-diff. Credential-free (DuckDB).

Snowflake dynamic tables, Databricks materialized views, Fivetran discovery, a custom process adapter, BigQuery, a Rust-native adapter skeleton, and Trino via Docker. Most need the matching warehouse; the process-adapter and Rust-skeleton POCs are credential-free.

The complete, always-current list with a one-line description per POC lives in examples/playground/README.md. It also documents the benchmark suite comparing Rocky against dbt Core, dbt Fusion, and PySpark.