SQL Lineage Tools for Apache Parquet

Dataedo

Dataedo provides SQL lineage tracking by visualizing how data flows through queries, transformations, and databases. This enhances transparency, supports troubleshooting, and ensures data accuracy across SQL-based environments.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: On-premises
Free edition: No
Metadata management: Yes
Version control integration: Yes
SQL Lineage

Informatica Data Lineage

Informatica Data Lineage tool provides automated end-to-end data lineage with detailed and summary views of data movement across data pipelines. With Informatica, you can derive lineage from code in SQL scripts, stored procedures and AI/ML code. It streamlines tracking data flow from system- to column-level for detailed impact analysis.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes

Secoda

Secoda provides end-to-end data lineage across your entire data stack. It automates column and table level data lineage. In additional, Secoda also brings in tests, events, and ETL into data lineage. All of Secoda's lineage is automated, but users can also manually contribute to lineage using Secoda's API.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: Yes
Metadata management: Yes
Version control integration: Yes