Metadata Management tools for Databricks

List of metadata management tools

Metadata management tool is a solution that supplements the data stored by the enterprise environment with useful information. Proper metadata management is a crucial factor to make information searchable, easy to locate and understand. Such tools add meaningful context to raw data, making it convenient to discover even by non-IT members of an organization.

Dataedo

Dataedo centralizes metadata management by organizing, documenting, and categorizing data assets across multiple sources. Key features include Auto AI Documentation, Business Glossary, Data Classification, Data Lineage, and Data Profiling.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: On premises
Rating of assets: Yes
Reference data: Yes
Support for workflow: No
Data Dictionary
Data Lineage
Data Profiling
Data Classification
Business Glossary
ERD

Informatica Metadata Management

Informatica’s metadata management approach is designed to help enterprises fully harness the value of all their data with active metadata. It scans the metadata from all of the enterprise’s data systems including databases and filesystems, integration tools and processes, and analytics and data science tools. It discovers, classifies, and documents key data elements and provides detailed metadata and lineage to bridge technical and business context for data governance.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: Cloud
Rating of assets: Yes
Reference data: No
Support for workflow: No

Data3Sixty Govern

Infogix Data360 Govern is an enterprise metadata management, data governance, and data catalog solution that discovers the quality, value, and trustworthiness of your data sets. It enables you to quickly crawl, profile, score and manage complex metadata. You’ve then built a single, searchable inventory of data assets for future use.

Business Glossary: Yes
Change history: No
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: On premises
Rating of assets: Yes
Reference data: No
Support for workflow: Yes

erwin Data Catalog

erwin Data Catalog (erwin DC) is metadata management software that helps organizations learn what data they have and where it’s located, including data at rest and in motion. It tells you the data and metadata available for a certain topic so those particular sources and assets can be found quickly for analysis and decision-making.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: On premises
Rating of assets: No
Reference data: Yes
Support for workflow: Yes

Ataccama Metadata Management & Data Catalog

Ataccama Metadata Management & Data Catalog is an AI-powered metadata management module. It’s a central storage for all of your metadata—imported from other sources, crowdsourced, or automatically captured in continuous data discovery processes.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: On premises
Rating of assets: No
Reference data: No
Support for workflow: No

Octopai

Octopai automates metadata management and analysis, enabling enterprise BI groups to quickly, easily and accurately find and understand their data for improved operations, data quality and data governance.

Business Glossary: Yes
Change history: No
Data Classification: No
Data Lineage: Yes
Data Profiling: No
On premises/cloud: Cloud
Rating of assets: -
Reference data: No
Support for workflow: No

Alteryx Connect

Alteryx provides metadata management that natively extracts and refreshes field names, schemas, and more to speed integration and flex with change. It automatically loads field names, types, objects, schemas, and relationships from common sources to speed analytics and cataloging. 

Business Glossary: Yes
Change history: No
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: Cloud
Rating of assets: Yes
Reference data: Yes
Support for workflow: Yes

Atlan

Atlan ML driven metadata management approach automatically scans, discovers and tags data assets across your data ecosystem and constructs relationships via SQL parsing, and a bot based ecosystem. This knowledge graph engine, then manifests in an easy to use data discovery tool, data catalog and business glossary that allows data consumers to find, understand and trust data in one collaborative workspace.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: Cloud
Rating of assets: Yes
Reference data: Yes
Support for workflow: Yes

Alation Data Catalog

Alation increases the value of your metadata with machine learning, automation, and human knowledge. Hundreds of organizations use Alation to power metadata solutions, such as analytics, data governance, privacy, risk & compliance, and cloud migration.

Business Glossary: Yes
Change history: Yes
Data Classification: No
Data Lineage: Yes
Data Profiling: Yes
On premises/cloud: Cloud
Rating of assets: Yes
Reference data: Yes
Support for workflow: Yes

Select Star

Select Star automatically catalogs & documents your database tables and BI dashboards. You can find out where your data is coming from, which dashboards are built on top of it, who is using the data, and how they are using it.

Business Glossary: Yes
Change history: Yes
Data Classification: Yes
Data Lineage: Yes
Data Profiling: No
On premises/cloud: Cloud
Rating of assets: Yes
Reference data: Yes
Support for workflow: No

Metadata management tools are usually multifunctional programs that provide a wide spectrum of usability. They include functionalities such as:
• Data catalog,
• compatibility with multiple connectors, making the tool a single source of truth about the data from different repositories,
• Business glossary,
• Data lineage,
• Data profiling,
• Impact analysis,
• Metadata ingestion and translation.

From the organization's point of view, the ability to export the created documentation into user-friendly formats is also important. What is more, some of the tools offer a community module, which facilitates the information flow.

Metadata management solutions oversee data across its entire lifecycle. This typically covers four primary areas: data analysis, data value, data governance, risk and compliance.

 Proper metadata management implementation allows for standardized metadata definitions, management, and maintenance of information across the organization for greater business efficiency. 

 Today, as data becomes increasingly important to the growth of company performance, choosing the right way to manage it is equally essential.

To help you find the right tool for your organization, we have put together this list of best metadata management solutions.