Stream processing - Planasonix

Stream nodes

Common node categories include:

Ingress / parser – Deserialize Avro, Protobuf, JSON, or CSV; validate envelopes and headers.

Filter and route – Drop test traffic, split by tenant, or branch poison messages to a DLQ path.

Map and enrich – Join reference data from a low-latency cache or warehouse snapshot table (with staleness you accept).

Aggregate – Compute counts, sums, or distinct estimates over windows.

Sink – Write to warehouses, lakes, queues, or APIs with batching tuned for the destination.

Each node exposes metrics (throughput, error rate, state size) you can chart in Observability.

Keep expensive enrichment off the critical path when possible: precompute dimension tables on a schedule and stream only key lookups.

Windowing

Windows bound unbounded streams into finite chunks for aggregation:

Tumbling
Sliding
Session

Tumbling windows are fixed, non-overlapping intervals (for example, 1-minute buckets). Every event belongs to exactly one window.

Choose event time when business logic depends on when something happened in the real world, and configure watermarks and lateness allowances so out-of-order events still land in the correct window. Choose processing time only when approximate, low-latency counts are acceptable.

Late data beyond your lateness policy may drop or divert to a side output. Document that behavior for compliance reviews.

Real-time transforms

Transforms in streaming graphs should be associative and deterministic where state is involved:

Prefer idempotent sinks with natural keys so at-least-once delivery does not duplicate business facts.

Use side outputs for bad records instead of failing the whole job when a single malformed event arrives.

Cap state growth: evict TTL’d keys from enrichment caches; avoid unbounded distinct unless you intend approximate algorithms.

Stateful joins

Joining two high-volume streams requires keyed state and retention policies. Size state from peak key cardinality, not average.

Complex event processing

Patterns across multiple event types may need sequence matchers; test with recorded topics before production cutover.

CDC sources

Database and bus ingestion options.

Dead letter queue

Handle persistently bad stream records.

​Stream nodes

​Windowing

​Real-time transforms

​Testing and promotion

​Related topics

CDC sources

Dead letter queue

Stream nodes

Windowing

Real-time transforms

Testing and promotion

Related topics