Data lineage tools

Collibra Data Lineage

Collibra Data Lineage automatically maps relationships between data to show how data flows from system to system and how data sets are built, aggregated, sourced and used, providing complete, end-to-end lineage visualization.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

MANTA

MANTA is a data lineage platform that automatically scans your data environment to build a powerful map of all data flows and deliver it through a native UI and other channels to both technical and non-technical users. It automatically scans every nook and cranny to get immediate, accurate, and up-to-date lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: No
Pipelines lineage: Yes
RDBMS: Yes

SQLFlow

SQLFlow is a SQL data lineage tool and provides a visual representation of the overall flow of data. It offers automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud, and Hadoop environments by parsing SQL Script and stored procedure. It enables impact analysis at a granular level, drill down into table, column, and query-level lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

ASG Data Intelligence

ASG Data Intelligence (ASG DI) is the solution for data distrust. It is a metadata-driven platform that makes technical data “smarter” with end-to-end views of the data and its movements (data lineage) combined with business meaning and usage guardrails. It lets you visualize data flows mapped to business context, and it uniquely traces lineage by parsing code from data sources, applications, tools, and source code.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Global IDs

Global IDs Data Lineage provides automated analysis of the actual flow of data through your enterprise, enabling you to understand – in real-time – where data originates, how it flows through the ecosystem, and how it is transformed en route.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Alteryx Connect

Alteryx Connect uses powerful search capabilities to find and reuse information contained in data files, databases, visualizations, dashboards, workflows, analytic apps, and more. It lets you automatically capture and visualize data lineage between assets, improving the overall quality and reliability of shared information between data, process, and people. You can get technical data lineage by loading metadata from source and target systems and interpreting Alteryx workflows.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

openAudit

openAudit by mixing data lineage, audit log analysis and other techniques, instantly defines on a single screen the operational sources of each data and its end uses: who is viewing the data, how and when. Due to a joint analysis of different data processing technologies, openAudit makes it possible to understand end-to-end multitechnlological data flows and to zoom in on the underlying code. Breaks linked to views and dynamic procedures are handled.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: No
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Informatica Metadata Management

Informatica Metadata Manager is a web-based metadata management tool. You can view data lineage for objects in the Metadata Manager warehouse. Data lineage shows the origin of the data, describes the path, and shows how it arrives at the target. Use data lineage to analyze data flow and troubleshoot data transformation errors.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

MetaCenter

MetaCenter automates data lineage analysis across Databases, ETL, Business Intelligence, Cloud, and Hadoop environments. It lets you reduce data management costs by automating data lineage and impact analysis documentation.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Azure Purview

Azure Purview is a unified data governance solution that offers automated data discovery, lineage identification, and data classification. It lets you understand the origin of your data with interactive data lineage visualization.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

OvalEdge

OvalEdge offers a comprehensive lineage solution to show a complete the complete data cycle. OvalEdge algorithms parse various kinds of source code to build the lineage automatically and then it is enhanced by experts with proper descriptions.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

Truedat

Truedat is an open source data governance business solution tool that lets you have an end to end vision of your data from a business and technical point of view. Truedat data lineage module allows the visualization of the information life cycle, as well as the interconnection between each system of the organization, which allows to have a complete traceability of the data, as well as impact analysis in the event of possible changes in data structures or processes.

BI Tools lineage: Yes
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: No
Free edition: Yes
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

ER/Studio

ER/Studio is an enterprise data modeling, architecture, and governance tool. It comes with a data lineage tab which is used to primarily document ETL processes from scratch. With visual data lineage support, you can visually document source/target mapping and sourcing rules for data movement across systems.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Dremio

Dremio is a SQL Lakehouse Platform built from the ground up to deliver high-performing BI dashboards and interactive analytics directly on the data lake. It offers effective data lineage support, as the relationships between your data sources, virtual datasets, and all your queries are maintained in Dremio’s data graph, telling you exactly where each dataset came from.

BI Tools lineage: Yes
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: No
Free edition: Yes
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

erwin Data Catalog

erwin Data Catalog automates enterprise metadata management, data mapping, code generation, and data lineage for faster time to value and greater accuracy for data movement and deployment projects. It lets you generate on-demand lineage down to the column level and visualize data flows from source systems all the way to the reporting layers, including all transformations. Fully configurable and navigable lineage diagrams provide high-level business views as well as detailed technical depictions.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes