Data lineage tools for MySQL

Data lineage tools are software that allows to extract, view and analyze data lineage. Data lineage is the process of understanding and visualizing data flow from the source to different destinations. It allows to create a map of the data journey through the entire ecosystem.

Secoda

Secoda is a data discovery tool that offers an intuitive, collaborative, and easy to implement data discovery built. It automatically extracts queries to generate data lineage. Currently, it is supporting table lineage for Snowflake, dbt, Redshift, and BigQuery, with support for Postgres, MySQL, and Microsoft SQL Server coming soon. Secoda data lineage can help data teams identify the downstream and upstream dependencies of a table easily. On each dependency, you will be able to see how many levels away a particular table is, with the ability to view the data in a visual form coming soon.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: No
Free edition: No
Hadoop: No
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

erwin Data Catalog

erwin Data Catalog automates enterprise metadata management, data mapping, code generation, and data lineage for faster time to value and greater accuracy for data movement and deployment projects. It lets you generate on-demand lineage down to the column level and visualize data flows from source systems all the way to the reporting layers, including all transformations. Fully configurable and navigable lineage diagrams provide high-level business views as well as detailed technical depictions.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: Yes
Data warehouses lineage: Yes
ETLs: Yes
Free edition: No
Hadoop: No
NoSQL: Yes
Pipelines lineage: No
RDBMS: Yes

Kylo

Kylo is an open source enterprise-ready data lake management software platform. It lets you search and explore data and metadata, view lineage, and profile statistics. Visual process lineage and provenance provide confidence in the origin of data. Automatic data profiling provides capabilities for data scientists and assurance in data quality.

BI Tools lineage: No
Commercial: Free
Data migration tools lineage: No
Data warehouses lineage: No
ETLs: No
Free edition: Yes
Hadoop: Yes
NoSQL: No
Pipelines lineage: No
RDBMS: Yes

Keboola

Keboola is a cloud-based data integration platform that helps clients combine, enhance, and publish crucial information for their internal analytics projects and data products in a quick and easy fashion. It collects all kinds of operational metadata, describing user activity, job activity, data flow, schema evolution, data pipeline performance, compliance with a client’s security rules, etc. Based on the metadata, we are able to build data lineage on the fly and automatically. This makes it possible to understand where the data is coming from and how it is used, both for analytical and regulatory purposes.

BI Tools lineage: No
Commercial: Commercial
Data migration tools lineage: No
Data warehouses lineage: Yes
ETLs: Yes
Free edition: Yes
Hadoop: Yes
NoSQL: Yes
Pipelines lineage: Yes
RDBMS: Yes

Data lineage forms the foundation for accurate data analytics and management. The core features of data lineage focuses on:

• Identifying data quality issues.
• Performing root cause analysis.
• Enabling to understand which data sources are outdated or which datasets are relevant.
• Minimizing the risk of migration projects.
• Providing transparency over the life cycle of data.

Data lineage tools map the data flow and help you understand where the data originated, how it flows and transforms. To help you find the right tool for your company, we prepared a list that includes some of the best data lineage tools.