SQL Lineage Tools
Dataedo
Dataedo allows you to extract lineage automatically or design flows manually and visualize how data moves through the system with interactive diagrams.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | On-premises |
Free edition: | |
Metadata management: | |
Version control integration: |
SQLFlow
SQLFlow is an online SQL data lineage tool that visually represents the overall flow of data. It offers automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud, and Hadoop environments by parsing SQL Script and stored procedure. It enables impact analysis at a granular level, drilling down into table, column, and query-level lineage.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Collibra Data Lineage
Collibra Data Lineage extracts and maintains lineage automatically from source systems, SQL dialects, ETL tools, and BI tools. It provides detailed technical lineage at the table, column, transformation, and SQL query levels to quickly understand the impact of potential changes. Moreover, its interactive lineage diagram shows a summary of lineage from source to destination.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Alvin
Alvin provides column-level highly accurate lineage and usage data across your entire data stack, and a host of features that help your team deliver measurable business impact. It lets you search and filter your assets by usage, last used, tags, data sensitivity, parent, platform, type, any pretty much anything else you can think of.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Octopai
Octopai Data Lineage XD is the most complete, in-depth, and automated lineage tool that provides end-to-end lineage at the system level from the entry point into the data landscape, all the way to reporting and analytics. Its cross-system lineage reflects data flows and dependencies using automated and augmented methods to provide the most extensive cross-system view of the entire data landscape.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
OvalEdge
OvalEdge provides a powerful and automated way to visualize your Data Flow Effortlessly. Its algorithms build the lineage from the most accurate source, source code. It crawls the source code from various connectors. After that, it parses its language like SQL, PL/SQL, XML, etc., and then builds the lineage.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Aggua
Aggua Complete end-to-end mapping of your data pipeline automatically from source to target with column-level dependencies. Its lineage not only shows the flow but also the important events in the flow. Track and map the transformation (calculations, alias), see the latest version changes, and identify PII on a column level.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
OpenMetadata
OpenMetadata is a single place to discover, collaborate and get your data right. It supports a comprehensive lineage for all data assets by capturing the relation between entities. It traces the path of data across tables, pipelines, and dashboards, while the manual lineage helps augment the lineage captured from machine metadata with user knowledge.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Datafold
Datafold provides plug and play column-level lineage for the modern data stack. It analyzes every SQL statement in your data warehouse and produces a graph of dependencies. It provides a high-level overview of your pipelines, zoom in on particular tables, trace flow on a columnal level, and see the SQL statements for each step.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Monte Carlo
Monte Carlo is a data observability platform that offers field-level data lineage functionality, making it faster and easier to conduct root cause and impact analysis for critical data issues. With field-level lineage fully automated, data engineers and analysts can confidently make changes to tables without losing trust and visibility in their data at each stage of its life cycle.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Engrafo
Engrafo automatically creates data catalogs, data flow diagrams, and data lineage. It can extract data lineage from all common SQL dialects and generate flow charts to create overview and information about critical ETL flows. Moreover, it extracts metadata from your BI Tools and automatically creates data lineage back to the data catalogs.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Secoda
Secoda provides end-to-end data lineage across your entire data stack. It automates column and table level data lineage. In additional, Secoda also brings in tests, events, and ETL into data lineage. All of Secoda's lineage is automated, but users can also manually contribute to lineage using Secoda's API.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
Dremio
Dremio is a SQL Lakehouse Platform that offers effective data lineage support, as the relationships between your data sources, virtual datasets, and all your queries are maintained in Dremio’s data graph, telling you exactly where each dataset came from. It provides self-service analytics with unified data access, modern and intuitive U/I, semantic layer, and built for SQL.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | Online |
Free edition: | |
Metadata management: | |
Version control integration: |
MANTA
Manta offers a powerful scanner for the Microsoft SQL Server technology. Once configured, Manta can automatically connect to the MS SQL resource for extracting and analyzing the pertinent metadata within the selected databases. This metadata includes but is not limited to tables, views, indexes, SQL procedures, schemas, databases, columns, and triggers. Then, Manta can parse all the SQL programming code and logic stored within. This allows Manta to generate lineage down to the column level while showing all transformation logic associated with individual column elements.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | On-premises |
Free edition: | |
Metadata management: | |
Version control integration: |
Tokern
Tokern is a simple to use open source data lineage engine. It automates data engineering tasks with column-level data lineage. You can use the API or library to access column-level lineage and automate data quality triage, scan and tag PII/PHI/sensitive data, programmatically monitor and manage ACLs, data and ETL pipeline cleanup, and impact analysis.
Automatic discovery: | |
---|---|
Data flow visualization: | |
Environment: | On-premises |
Free edition: | |
Metadata management: | |
Version control integration: |