Data lineage tools for Amazon Aurora

Data lineage tools are software that allows to extract, view and analyze data lineage. Data lineage is the process of understanding and visualizing data flow from the source to different destinations. It allows to create a map of the data journey through the entire ecosystem.

Dataedo

Dataedo allows you to extract lineage automatically or design flows manually and visualize how data moves through the system with interactive diagrams. Dataedo supports object and column-level data lineage. It improves transparency, supports impact analysis, and ensures data integrity across an organization.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: No
Sysem-level Data Lineage
Data Lineage
Data Lineage 2
Column flow details

Tokern

Tokern Lineage Engine is a fast and easy to use platform to collect, visualize and analyze column-level data lineage in databases, data warehouses and data lakes in AWS and GCP. You can use the API or library to access column-level lineage and automate data quality triage, scan and tag PII/PHI/sensitive data, programmatically monitor and manage ACLs, data and ETL pipeline cleanup, and impact analysis.

BI Tools lineage: No
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: Yes
Hadoop: No
NoSQL: No
Pipelines lineage: Yes
RDBMS: Yes

Informatica Metadata Management

Informatica Metadata Manager is a web-based metadata management tool. You can view data lineage for objects in the Metadata Manager warehouse. Data lineage shows the origin of the data, describes the path, and shows how it arrives at the target. Use data lineage to analyze data flow and troubleshoot data transformation errors.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Secoda

Secoda is a data discovery tool that offers an intuitive, collaborative, and easy to implement data discovery built. It automatically extracts queries to generate data lineage. Currently, it is supporting table lineage for Snowflake, dbt, Redshift, and BigQuery, with support for Postgres, MySQL, and Microsoft SQL Server coming soon. Secoda data lineage can help data teams identify the downstream and upstream dependencies of a table easily. On each dependency, you will be able to see how many levels away a particular table is, with the ability to view the data in a visual form coming soon.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: No
Free edition: No
Hadoop: No
NoSQL: No
Pipelines lineage: No
RDBMS: Yes
Secoda data lineage tool

Alteryx Connect

Alteryx Connect uses powerful search capabilities to find and reuse information contained in data files, databases, visualizations, dashboards, workflows, analytic apps, and more. It lets you automatically capture and visualize data lineage between assets, improving the overall quality and reliability of shared information between data, process, and people. You can get technical data lineage by loading metadata from source and target systems and interpreting Alteryx workflows.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Talend Data Catalog

Talend Data Catalog gives your organization a single, secure point of control for your data. Its data flow lineage feature allows you to narrow in on specific objects and shows you how these objects are related to each other, within a model, an external metadata repository, or a configuration. The data flow lineage is based upon connection definitions to data stores and physical transformation rules which transform and move the data.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

Atlan

Atlan provides effortless data lineage & governance by letting you auto-construct data lineage & deploy best-in-class data access governance without compromising on data democratization. It automatically parses through your SQL query logs in your data warehouses and BI tools to create a visual view of data lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: No
Pipelines lineage: Yes
RDBMS: Yes

erwin Data Catalog

erwin Data Catalog automates enterprise metadata management, data mapping, code generation, and data lineage for faster time to value and greater accuracy for data movement and deployment projects. It lets you generate on-demand lineage down to the column level and visualize data flows from source systems all the way to the reporting layers, including all transformations. Fully configurable and navigable lineage diagrams provide high-level business views as well as detailed technical depictions.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Data lineage forms the foundation for accurate data analytics and management. The core features of data lineage focuses on:

• Identifying data quality issues.
• Performing root cause analysis.
• Enabling to understand which data sources are outdated or which datasets are relevant.
• Minimizing the risk of migration projects.
• Providing transparency over the life cycle of data.

Data lineage tools map the data flow and help you understand where the data originated, how it flows and transforms. To help you find the right tool for your company, we prepared a list that includes some of the best data lineage tools.