Docs
Getting StartedMedallion Architecture

Medallion Architecture

DataFlow AI natively constructs and fiercely enforces the logical Medallion Data Architecture. Bronze exists exclusively for raw historical ingestion, Silver for strictly-typed validated data, and Gold for instantaneous BI aggregation metrics.

The Three Pillars of Reliability

Initially pioneered conceptually by Databricks, the structural Medallion Architecture pattern fundamentally organizes scattered data lakehouse storage into strictly progressing logical zones to absolutely guarantee computational reproducibility and quality.

Many external tools treat this architecture as merely an arbitrary guideline. DataFlow AI uniquely hard-codes this philosophical pattern directly into the core execution engine natively, eliminating the massive potential for recursive dependency failures.

Bronze Base

The completely unopinionated raw staging layer. Complex data objects arriving from Kafka CDC streams or periodic JSON API snapshots are uniformly dumped here persistently as strictly append-only, immutable Parquet files.

  • Absolutely infinite historical origin retention guarantees.
  • Zero destructive transformations permitted.

Silver Cleansed

The centralized foundation of organizational truth. At this intermediate mathematical layer, strict YAML Data Contracts eagerly activate to permanently drop null rows, aggressively resolve casting errors, and apply precise PII blurring natively.

  • Perfectly filtered columnar datasets.
  • Relentlessly enforces absolute Data Quality tests natively.

Gold Aggregation

Highly specific business-level star schemas perfectly tuned for immediate executive dashboarding usage continuously rendered inside platforms like external Tableau matrices, Microsoft Power BI, or our own native AI Copilot module.

  • Completely denormalized metrics.
  • Exceptional parallelized rapid read speeds.
← Back to Main App

© 2026 DataFlow AI Docs