Data lineage tools

IBM InfoSphere Information Governance Catalog

IBM InfoSphere Information Governance Catalog is a web-based tool that allows you to explore, understand, and analyze information. It lets you run data lineage to create trusted information that supports data governance and compliance efforts. You can perform lineage analysis to understand where data comes from or goes to by using shared table information, job design information, or operational metadata from job runs.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Kylo

Kylo is an open source enterprise-ready data lake management software platform. It lets you search and explore data and metadata, view lineage, and profile statistics. Visual process lineage and provenance provide confidence in the origin of data. Automatic data profiling provides capabilities for data scientists and assurance in data quality.

BI Tools lineage: No
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: No
Free edition: Yes
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Keboola

Keboola is a cloud-based data integration platform that helps clients combine, enhance, and publish crucial information for their internal analytics projects and data products in a quick and easy fashion. It collects all kinds of operational metadata, describing user activity, job activity, data flow, schema evolution, data pipeline performance, compliance with a client’s security rules, etc. Based on the metadata, we are able to build data lineage on the fly and automatically. This makes it possible to understand where the data is coming from and how it is used, both for analytical and regulatory purposes.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: Yes
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes