Data lineage tools for Amazon RDS

Data lineage tools are software that allows to extract, view and analyze data lineage. Data lineage is the process of understanding and visualizing data flow from the source to different destinations. It allows to create a map of the data journey through the entire ecosystem.

Dataedo

Dataedo allows you to extract lineage automatically or design flows manually and visualize how data moves through the system with interactive diagrams. Dataedo supports object and column-level data lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: No
Dataedo Data Lineage
Dataedo Data Lineage Tableau
Dataedo Data Lineage Tableau 2
Dataedo Data Lineage Snowflake
Dataedo Data Lineage PowerBI

MANTA

MANTA is a data lineage platform that automatically scans your data environment to build a powerful map of all data flows and deliver it through a native UI and other channels to both technical and non-technical users. It automatically scans every nook and cranny to get immediate, accurate, and up-to-date lineage.

BI Tools lineage: Yes
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: Yes
NoSQL: No
Pipelines lineage: Yes
RDBMS: Yes

Truedat

Truedat is an open source data governance business solution tool that lets you have an end to end vision of your data from a business and technical point of view. Truedat data lineage module allows the visualization of the information life cycle, as well as the interconnection between each system of the organization, which allows to have a complete traceability of the data, as well as impact analysis in the event of possible changes in data structures or processes.

BI Tools lineage: Yes
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: No
Free edition: Yes
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Tokern

Tokern Lineage Engine is a fast and easy to use platform to collect, visualize and analyze column-level data lineage in databases, data warehouses and data lakes in AWS and GCP. You can use the API or library to access column-level lineage and automate data quality triage, scan and tag PII/PHI/sensitive data, programmatically monitor and manage ACLs, data and ETL pipeline cleanup, and impact analysis.

BI Tools lineage: No
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: Yes
Hadoop: No
NoSQL: No
Pipelines lineage: Yes
RDBMS: Yes

Data lineage forms the foundation for accurate data analytics and management. The core features of data lineage focuses on:

• Identifying data quality issues.
• Performing root cause analysis.
• Enabling to understand which data sources are outdated or which datasets are relevant.
• Minimizing the risk of migration projects.
• Providing transparency over the life cycle of data.

Data lineage tools map the data flow and help you understand where the data originated, how it flows and transforms. To help you find the right tool for your company, we prepared a list that includes some of the best data lineage tools.