Data catalogs

List of data catalogs tools

Data catalog is a structured collection of data used by an organization. It is a kind of data library where data is indexed, well-organized, and securely stored. Most data catalog tools contain information about the source, data usage, relationships between entities as well as data lineage. This provides a description of the origin of the data and tracks changes in the data to its final form.

Collibra Catalog

Collibra Catalog empowers business users to quickly discover, understand, contribute, and govern the data that matters so they can generate impactful insights that drive business value. It also allows data stewards to certify datasets so that business users can trust the data that they use in their analysis.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: MS Excel
Free edition: No
Rating of assets: Yes

Alation

Alation is the data catalog tool where everyone in your organization can find the data they need to collaborate. Alation's data catalog uses AI to capture the rich context of enterprise data, including relationships between data sets, analyst usage & trusted insights. In short, Alation is a complete repository for all the data assets & data knowledge in your organization.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: -
Free edition: No
Rating of assets: -

Dataedo

Dataedo enables you to catalog, document and understand your data with data dictionary, business glossary and ERDs. Dataedo will help you document your existing relational databases. It reads your schema and lets you easily describe each data element (tables and columns - Data Dictionary) and other database objects (like triggers, stored procedures, etc.).

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: No
Data Profiling: No
Export: HTML,MS Excel,PDF
Free edition: No
Rating of assets: No
Database Web Data Lineage
Dataedo Web Community
Database Web Reference Data

Informatica Enterprise Data Catalog

Informatica Data Catalog is a machine learning-based data catalog that lets you classify and organize data assets across any environment to maximize data value and reuse, and provides a metadata system of record for the enterprise. It automatically scans and catalogs data across the enterprise, indexing it for enterprise-wide discovery using simple, Google-like search.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV,Plain text
Free edition: No
Rating of assets: Yes

Lumada Data Catalog

Lumada Data Catalog software leverages AI, machine learning, and patented fingerprinting technology to automate the discovery, classification, and management of your enterprise data. It simplifies access and promotes collaboration allowing an organization to more intelligently use their data.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: JSON,XML
Free edition: No
Rating of assets: Yes

Ataccama Metadata Management & Data Catalog

Ataccama ONE Data Catalog is an AI-powered metadata management module. It’s a central storage for all of your metadata—imported from other sources, crowdsourced, or automatically captured in continuous data discovery processes.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV
Free edition: No
Rating of assets: No

OvalEdge

OvalEdge is a data catalog tool that automatically organizes and catalogs your data using machine learning and advance algorithms. You can organize data using tags, usage statistics, user names, and other markers – so it’s easily retrievable with everyday language.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: MS Excel
Free edition: No
Rating of assets: No

Alteryx Connect

Alteryx Connect is a social data cataloging and data exploration platform for the enterprise. The powerful data cataloging provided by Alteryx Connect centralizes business terms and definitions, metrics, and information assets for maximum consistency, discoverability, and collaboration.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV,MS Excel,PDF
Free edition: No
Rating of assets: Yes

Truedat

Truedat is an open source data cataloging and governance tool that allows to quickly unify and explore combined metadata from different sources on the same interface. It enables to organize & enrich information through configurable workflows and monitor data governance activity.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: No
Commercial: Free
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV
Free edition: Yes
Rating of assets: No

Io-Tahoe

Io-Tahoe is an enterprise smart data discovery and AI-driven data catalog product that enables enterprises to accelerate to next-generation data management practices, radically improving data governance and regulatory compliance. Population of the data catalog is automated by using artificial intelligence and leveraging the discovery functionality and natural language analysis to automatically tag data.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV,JSON,MS Excel,Plain text,XML
Free edition: No
Rating of assets: Yes

Dawizz MYDATACATALOGUE

MYDATACATALOGUE is a data catalog and mapping tool that uses smart algorithms to list your information system’s data giving you a better understanding of your data assets. It accurately identifies the location of your data, and by cataloging your data sources, applications, and processes, you can intuitively search for the information you need.

Automated Cataloging: Yes
Business Glossary: No
Commenting/Community: No
Commercial: Commercial
Data Classification: No
Data Lineage: No
Data Profiling: Yes
Export: -
Free edition: No
Rating of assets: No

Azure Data Catalog

Azure Data Catalog is an enterprise-wide metadata catalog that makes data asset discovery straightforward. It’s a fully-managed cloud service that lets any user (analyst, data scientist, or developer) register, enrich, discover, understand, and consume data sources.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: No
Data Lineage: No
Data Profiling: Yes
Export: JSON
Free edition: Yes
Rating of assets: No

Qlik Data Catalyst

Qlik Data Catalyst is a metadata driven data catalog that has technical and business descriptions, data profiles, data lineage, and data tags that make data search and delivery simple. It builds a secure, enterprise-scale catalog of all the data your organization has available for analytics, no matter where it is.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: CSV,JSON,XML
Free edition: No
Rating of assets: Yes

erwin Data Catalog

erwin Data Catalog (erwin DC) automates the processes involved in harvesting, integrating, activating and governing enterprise data according to business requirements. To summarize, erwin DC creates and maintains a sustainable metadata foundation for data preparation, management, governance and consumption, automating manual tasks to increase efficiencies, quality and time to value for data development and deployment.

Automated Cataloging: Yes
Business Glossary: Yes
Commenting/Community: Yes
Commercial: Commercial
Data Classification: Yes
Data Lineage: Yes
Data Profiling: Yes
Export: MS Excel,PDF
Free edition: No
Rating of assets: No

Data catalogs are part of data management tools. They enable automatic metadata management with user-friendly form that makes data easy to understand even for non-IT members of the organisation.

The key feature of data catalogs is to provide metadata context to the user in a way that allows different teams within the organization (both IT and Non-IT) to discover and understand relevant data.

From the organization's perspective, the important functions of data catalog tools are also:
• storage of data resources from different repositories as well as from different engine systems - compatibility with multiple connectors,
• automation of data management processes,
• advanced resource search by name, type, date of change, owner, etc.
• data lineage,
• automated data Classification,
• Discovering data relationship and dependencies between objects,
• Business Glossary, unifying nomenclature and definitions of terms,
• Data Profiling,

Data stewards, business teams, and data analysts often struggle with the problem of what specific data means, where it comes from, and which elements it is directly related to. These are just a few problems for which Data catalog tools have been created. Based on the imported repositories, data catalogs enable automated cataloging and organizing of data, solving the problem of time-consuming querying of the resources.

To avoid misunderstandings data catalog tools provide a Business Glossary, through which the nomenclature is systematized. It contains business terms along with their definition, relationship to each other, as well as its location in the hierarchy of all data assets.

There are many apps for data catalog tasks on the market. We have listed complex data cataloging software that can also solve data profiling, data lineage, and data classification problems, as well as open-source data catalog tools.