HomeInterview QuestionsWhat is the Medallion Architecture? Can you explai…

What is the Medallion Architecture? Can you explain?

🟡 Medium Conceptual Mid level
1Times asked
Jun 2026Last seen
Jun 2026First seen

💡 Model Answer

The Medallion Architecture is a layered approach to building data lakes that emphasizes data quality, governance, and reusability. It consists of three primary layers:

  1. Bronze (Raw) – Ingested data in its native format (JSON, Parquet, CSV, etc.) with minimal transformation. This layer preserves the original source for auditability and debugging.
  2. Silver (Cleansed) – Data is cleaned, deduplicated, and enriched. Business rules are applied, missing values handled, and schema is enforced. The goal is to produce a reliable, query‑ready dataset.
  3. Gold (Business‑Ready) – Aggregated, curated, and optimized for specific analytics or reporting use cases. This layer often contains materialized views, dashboards, or machine‑learning training data.

Key benefits include:

  • Incremental Processing – Each layer can be refreshed independently, reducing compute costs.
  • Data Lineage – Clear traceability from raw to business‑ready data.
  • Governance – Policies can be applied at each stage (e.g., masking sensitive data in Bronze, enforcing quality in Silver).
  • Reusability – Multiple downstream consumers can share the same Silver or Gold datasets.

Implementations typically use Spark, Delta Lake, or similar technologies to support ACID transactions, schema evolution, and time‑travel across layers.

This answer was generated by AI for study purposes. Use it as a starting point — personalize it with your own experience.

🎤 Get questions like this answered in real-time

Assisting AI listens to your interview, captures questions live, and gives you instant AI-powered answers — invisible to screen sharing.

Get Assisting AI — Starts at ₹500