SQL Lineage Tools for Apache Parquet
Dataedo
Dataedo provides SQL lineage tracking by visualizing how data flows through queries, transformations, and databases. This enhances transparency, supports troubleshooting, and ensures data accuracy across SQL-based environments.
| Automatic discovery: | 
                             | 
                    
|---|---|
| Data flow visualization: | 
                             | 
                    
| Environment: | On-premises | 
| Free edition: | 
                             | 
                    
| Metadata management: | 
                             | 
                    
| Version control integration: | 
                             | 
                    
Informatica Data Lineage
Informatica Data Lineage tool provides automated end-to-end data lineage with detailed and summary views of data movement across data pipelines. With Informatica, you can derive lineage from code in SQL scripts, stored procedures and AI/ML code. It streamlines tracking data flow from system- to column-level for detailed impact analysis.
| Automatic discovery: | 
                             | 
                    
|---|---|
| Data flow visualization: | 
                             | 
                    
| Environment: | Online | 
| Free edition: | 
                             | 
                    
| Metadata management: | 
                             | 
                    
| Version control integration: | 
                             | 
                    
Secoda
Secoda provides end-to-end data lineage across your entire data stack. It automates column and table level data lineage. In additional, Secoda also brings in tests, events, and ETL into data lineage. All of Secoda's lineage is automated, but users can also manually contribute to lineage using Secoda's API.
| Automatic discovery: | 
                             | 
                    
|---|---|
| Data flow visualization: | 
                             | 
                    
| Environment: | Online | 
| Free edition: | 
                             | 
                    
| Metadata management: | 
                             | 
                    
| Version control integration: | 
                             | 
                    
                                
                                Amazon Redshift
                            
                                
                                Azure SQL Database
                            
                                
                                DBT
                            
                                
                                Google Big Query
                            
                                
                                IBM DB2
                            
                                
                                MariaDB
                            
                                
                                SAP HANA
                            
                                
                                Snowflake
                            
                                
                                SQLite
                            
                                
                                Teradata