SQL Lineage Tools for Amazon Aurora

Dataedo

Dataedo provides SQL lineage tracking by visualizing how data flows through queries, transformations, and databases. This enhances transparency, supports troubleshooting, and ensures data accuracy across SQL-based environments.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: On-premises
Free edition: No
Metadata management: Yes
Version control integration: Yes
SQL Lineage

Secoda

Secoda provides end-to-end data lineage across your entire data stack. It automates column and table level data lineage. In additional, Secoda also brings in tests, events, and ETL into data lineage. All of Secoda's lineage is automated, but users can also manually contribute to lineage using Secoda's API.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: Yes
Metadata management: Yes
Version control integration: Yes

Tokern

Tokern is a simple to use open source data lineage engine. It automates data engineering tasks with column-level data lineage. You can use the API or library to access column-level lineage and automate data quality triage, scan and tag PII/PHI/sensitive data, programmatically monitor and manage ACLs, data and ETL pipeline cleanup, and impact analysis.

Automatic discovery: Yes
Data flow visualization: No
Environment: On-premises
Free edition: Yes
Metadata management: Yes
Version control integration: No

Informatica Data Lineage

Informatica Data Lineage tool provides automated end-to-end data lineage with detailed and summary views of data movement across data pipelines. With Informatica, you can derive lineage from code in SQL scripts, stored procedures and AI/ML code. It streamlines tracking data flow from system- to column-level for detailed impact analysis.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes

Fivetran

Fivetran is the automated data movement platform moving data out of, into and across your cloud data platforms. It allows you to monitor data movement, logs, and status from connector extract to successful warehouse load through modeling — all in one data lineage graph (DLGs). DLGs show the dependencies between your dbt models so that you can track the flow of data from your connectors to your destination.

Automatic discovery: No
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: No
Version control integration: No