Orchestration overview

Building blocks
Design principles
Permissions
Next steps

Orchestration controls when and how pipelines run. You move from manual execution to repeatable operations by attaching schedules, file or cloud event triggers, signed webhooks, and dependencies between pipelines.

Building blocks

Schedules

Run pipelines on cron expressions with explicit time zones.

Triggers

Start runs when objects land in S3, GCS, Azure Blob, or related event feeds.

Webhooks

Invoke pipelines from external systems over HTTPS with optional signatures.

Pipeline chaining

Model multi-step workflows with trigger nodes and dependencies.

Design principles

Idempotency: Assume retries. Design loads so a second run does not corrupt facts (merge keys, dedupe stages).
SLA clarity: Match schedule cadence to business deadlines; faster is not always better if cost or contention rises.
Blast radius: Isolate experimental pipelines on separate triggers so a noisy upstream does not block critical loads.

Permissions

Creating or editing orchestration rules requires roles that can execute the target pipeline and read the connections those pipelines use. Service principals used for cloud triggers need least-privilege policies on bucket notifications or queue subscriptions.

Orchestration metadata (who triggered a run, from which webhook or schedule) appears in run history for audit.

Next steps

Choose a trigger style

Pick schedule vs event vs webhook based on how upstream systems signal readiness.

Prototype runs

Execute manual test runs, then attach automation once outputs stabilize.

Add monitoring

Wire alerts for failures and duration anomalies.

Webhook streams Schedules

Getting started

Platform

Managed Lakehouse

Connections

Pipelines

Nodes

Reverse ETL

AI Copilot

Streaming & CDC

Orchestration

Data governance

Observability

Notebooks

Troubleshooting

Connectors

Integrations

Orchestration overview

Building blocks

Schedules

Triggers

Webhooks

Pipeline chaining

Design principles

Permissions

Next steps

Getting started

Platform

Managed Lakehouse

Connections

Pipelines

Nodes

Reverse ETL

AI Copilot

Streaming & CDC

Orchestration

Data governance

Observability

Notebooks

Troubleshooting

Connectors

Integrations

Documentation Index

​Building blocks

Schedules

Triggers

Webhooks

Pipeline chaining

​Design principles

​Permissions

​Next steps

Building blocks

Design principles

Permissions

Next steps