What is an Iceberg table?
💡 Model Answer
Apache Iceberg is an open table format for huge analytic datasets. It stores data in a columnar format (Parquet, ORC, Avro) and keeps a separate metadata layer that tracks schema, partitioning, and snapshots. This design enables ACID transactions, schema evolution, and efficient pruning of data during queries. Iceberg tables are often used with engines like Spark, Hive, Presto, or Trino, and they integrate with cloud storage such as S3 or ADLS. The metadata is stored in a manifest list that can be queried quickly, allowing fast reads and writes even for petabyte‑scale tables. Because Iceberg separates data files from metadata, it supports features like time‑travel, incremental loads, and concurrent writes without locking the entire table.
This answer was generated by AI for study purposes. Use it as a starting point — personalize it with your own experience.
🎤 Get questions like this answered in real-time
Assisting AI listens to your interview, captures questions live, and gives you instant AI-powered answers — invisible to screen sharing.
Get Assisting AI — Starts at ₹500