Back to glossaryDefinition

What is Data Lineage?

The tracking of data's origins, movements, and transformations throughout its lifecycle.

Data lineage tracks how data flows from its original source through various transformations, systems, and processes to its final destination. It answers critical questions: Where did this data come from? How was it transformed? Who changed it? What downstream systems depend on it? Lineage is essential for regulatory compliance (proving data handling), impact analysis (understanding what breaks when a source changes), debugging (tracing data quality issues to their root cause), and trust (validating that analytics are built on reliable foundations).

Put this into practice

Assess your maturity, discover initiatives, and build your transformation roadmap.

Start free assessment