Skip to main content
The data catalog is your searchable inventory of tables, streams, pipelines, and related assets. Teams document purpose, freshness, ownership, and sensitivity so newcomers and adjacent teams stop relying on tribal knowledge.

Enterprise feature

Full catalog capabilities—automated harvesting from connections, curated business glossary, and team-scoped visibility—ship with Enterprise plans. Lower tiers may expose read-only subsets or manual entries only. Ask your admin if harvesting jobs are enabled for your warehouse and lake connections.

Table discovery

Planasonix can crawl connected systems on a schedule you define:
  • Schemas, tables, and columns with inferred types
  • Row count or profile snapshots where permitted
  • Tags suggested from naming patterns (you approve before promotion)
Harvest jobs use read-only credentials. Large estates should scope crawls to approved databases or schemas to control cost.
Register Snowflake, BigQuery, Redshift, Databricks SQL, and similar endpoints; choose databases to include.
Use keyword search across names, descriptions, column names, and tags. Filters narrow by domain, owner, freshness, certification status, or sensitivity. Save frequent searches for onboarding checklists.
Standardize tags (domain:finance, pii:email) early so search stays usable as the catalog grows.

Documentation

Each asset supports markdown descriptions, column-level notes, and links to runbooks or dbt model pages. Certified assets display a badge when stewards approve accuracy.
Mark tables as deprecated with a replacement link and sunset date; impact analysis highlights downstream pipelines still referencing legacy names.
Tag regulated fields; integrate with access reviews so catalog truth matches actual grants.

Data contracts

Enforce expectations on cataloged assets.

Lineage

See how catalog tables connect to pipelines.