SQL Lineage Tools for Apache Spark SQL

Dataedo

Dataedo allows you to extract lineage automatically or design flows manually and visualize how data moves through the system with interactive diagrams.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: On-premises
Free edition: No
Metadata management: Yes
Version control integration: Yes
Dataedo Data Lineage

Informatica Data Lineage

Informatica Data Lineage tool provides automated end-to-end data lineage with detailed and summary views of data movement across data pipelines. With Informatica, you can derive lineage from code in SQL scripts, stored procedures and AI/ML code. It streamlines tracking data flow from system- to column-level for detailed impact analysis.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes