All the data in an organization has a story; data lineage is about telling the story of that data as it travels through the various systems and platforms. So, data lineage is metadata over time, in that it reveals the “who, what, where, when, why, and how” of data. This course examines the core concepts of data lineage, including: definitions, types of data lineage, why organizations need to be concerned about data lineage, important use cases, getting started, and more.
References
-
-
A guide to data lineage best practices and processes.🔗Segment
-
Learn about data lineage and how companies are using it to improve business insights.🔗ibm.com
-
Data lineage is a must-have feature of the modern data stack, yet we're struggling to derive value from it. Here's why and how we can fix this.🔗Monte Carlo Data
-
Data lineage is the process of understanding and visualizing data flows from source to current location and tracking changes made to the data on its journey.🔗Qlik
-
Data lineage records data flows through your company, helping you improve data quality, security, compliance, and decision-making.🔗Starburst
-
Learn what data lineage is and how lineage tools help simplify data governance and data quality processes by tracking data flows and changes to data sets.🔗Data Management