SQL Lineage Tools for Looker

Collibra Data Lineage

Collibra Data Lineage extracts and maintains lineage automatically from source systems, SQL dialects, ETL tools, and BI tools. It provides detailed technical lineage at the table, column, transformation, and SQL query levels to quickly understand the impact of potential changes. Moreover, its interactive lineage diagram shows a summary of lineage from source to destination.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: No
Version control integration: No

Alvin

Alvin provides column-level highly accurate lineage and usage data across your entire data stack, and a host of features that help your team deliver measurable business impact. It lets you search and filter your assets by usage, last used, tags, data sensitivity, parent, platform, type, any pretty much anything else you can think of.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: No

OpenMetadata

OpenMetadata is a single place to discover, collaborate and get your data right. It supports a comprehensive lineage for all data assets by capturing the relation between entities. It traces the path of data across tables, pipelines, and dashboards, while the manual lineage helps augment the lineage captured from machine metadata with user knowledge.

Automatic discovery: No
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes

Monte Carlo

Monte Carlo is a data observability platform that offers field-level data lineage functionality, making it faster and easier to conduct root cause and impact analysis for critical data issues. With field-level lineage fully automated, data engineers and analysts can confidently make changes to tables without losing trust and visibility in their data at each stage of its life cycle.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: No

Secoda

Secoda provides end-to-end data lineage across your entire data stack. It automates column and table level data lineage. In additional, Secoda also brings in tests, events, and ETL into data lineage. All of Secoda's lineage is automated, but users can also manually contribute to lineage using Secoda's API.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: Yes
Metadata management: Yes
Version control integration: Yes

Dremio

Dremio is a SQL Lakehouse Platform that offers effective data lineage support, as the relationships between your data sources, virtual datasets, and all your queries are maintained in Dremio’s data graph, telling you exactly where each dataset came from. It provides self-service analytics with unified data access, modern and intuitive U/I, semantic layer, and built for SQL.

Automatic discovery: Yes
Data flow visualization: No
Environment: Online
Free edition: No
Metadata management: No
Version control integration: No

Azure Purview

Microsoft Purview provides a unified data governance solution that offers many capabilities, one of which is showing lineage between datasets created by data processes. It supports automated asset-level lineage for the datasets and processes, while manual lineage allows you to document lineage metadata for sources where automation isn't yet supported without using any code.
Metadata collected in Microsoft Purview from enterprise data systems are stitched across to show an end-to-end data lineage.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes

Informatica Data Lineage

Informatica Data Lineage tool provides automated end-to-end data lineage with detailed and summary views of data movement across data pipelines. With Informatica, you can derive lineage from code in SQL scripts, stored procedures and AI/ML code. It streamlines tracking data flow from system- to column-level for detailed impact analysis.

Automatic discovery: Yes
Data flow visualization: Yes
Environment: Online
Free edition: No
Metadata management: Yes
Version control integration: Yes