Data lineage tools for AWS Glue Data Catalog

Data lineage tools are software that allows to extract, view and analyze data lineage. Data lineage is the process of understanding and visualizing data flow from the source to different destinations. It allows to create a map of the data journey through the entire ecosystem.

Dataedo

Dataedo allows you to extract lineage automatically or design flows manually and visualize how data moves through the system with interactive diagrams. Dataedo supports object and column-level data lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: No
Dataedo Data Lineage
Dataedo Data Lineage Tableau
Dataedo Data Lineage Tableau 2
Dataedo Data Lineage Snowflake
Dataedo Data Lineage PowerBI

erwin Data Catalog

erwin Data Catalog automates enterprise metadata management, data mapping, code generation, and data lineage for faster time to value and greater accuracy for data movement and deployment projects. It lets you generate on-demand lineage down to the column level and visualize data flows from source systems all the way to the reporting layers, including all transformations. Fully configurable and navigable lineage diagrams provide high-level business views as well as detailed technical depictions.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Collibra Data Lineage

Collibra Data Lineage automatically maps relationships between data to show how data flows from system to system and how data sets are built, aggregated, sourced and used, providing complete, end-to-end lineage visualization.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

Octopai

Octopai Data Lineage XD is a complete, in-depth, and trustworthy automated lineage tool. With 3 different types of lineage, you’ll find everything you need in one easy-to-use platform. The 3 linage types include Cross-System Lineage (provides end-to-end lineage at the system level from the entry point into the BI landscape, all the way to reporting and analytics),
End-to-End Column Lineage (view column to column-level lineage between systems from the entry point into the BI landscape, all the way through to reporting and analytics), and Inner-System Lineage (details the column-level lineage within an ETL process, report, or database object).

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Data lineage forms the foundation for accurate data analytics and management. The core features of data lineage focuses on:

• Identifying data quality issues.
• Performing root cause analysis.
• Enabling to understand which data sources are outdated or which datasets are relevant.
• Minimizing the risk of migration projects.
• Providing transparency over the life cycle of data.

Data lineage tools map the data flow and help you understand where the data originated, how it flows and transforms. To help you find the right tool for your company, we prepared a list that includes some of the best data lineage tools.