Data catalog tools for Datastax
List of data catalogs tools
Data catalog is a structured collection of data used by an organization. It is a kind of data library where data is indexed, well-organized, and securely stored. Most data catalog tools contain information about the source, data usage, relationships between entities as well as data lineage. This provides a description of the origin of the data and tracks changes in the data to its final form.
IBM Watson Knowledge Catalog
IBM Watson® Knowledge Catalog is an open and intelligent data catalog for managing enterprise data and AI model governance, quality and collaboration. It enables you to organize, define and manage enterprise data to provide the right context to drive value across imperatives like regulatory compliance to data monetization.
Automated Cataloging: | |
---|---|
Business Glossary: | |
Commenting/Community: | |
Commercial: | Commercial |
Data Classification: | |
Data Lineage: | |
Data Profiling: | |
Export: | JSON,MS Excel,XML |
Free edition: | |
Rating of assets: |
Talend Data Catalog
Talend Data Catalog automatically crawls, profiles, organizes, links, and enriches all your metadata. It makes easy to search and access data, then verify its validity before sharing with peers. Up to 80% of the information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning, continually delivering the most meaningful data to the user.
Automated Cataloging: | |
---|---|
Business Glossary: | |
Commenting/Community: | |
Commercial: | Commercial |
Data Classification: | |
Data Lineage: | |
Data Profiling: | |
Export: | CSV,JSON,MS Excel,XML |
Free edition: | |
Rating of assets: |
Data catalogs are part of data management tools. They enable automatic metadata management with user-friendly form that makes data easy to understand even for non-IT members of the organisation.
The key feature of data catalogs is to provide metadata context to the user in a way that allows different teams within the organization (both IT and Non-IT) to discover and understand relevant data.
From the organization's perspective, the important functions of data catalog tools are also:
• storage of data resources from different repositories as well as from different engine systems - compatibility with multiple connectors,
• automation of data management processes,
• advanced resource search by name, type, date of change, owner, etc.
• data lineage,
• automated data Classification,
• Discovering data relationship and dependencies between objects,
• Business Glossary, unifying nomenclature and definitions of terms,
• Data Profiling,
Data stewards, business teams, and data analysts often struggle with the problem of what specific data means, where it comes from, and which elements it is directly related to. These are just a few problems for which Data catalog tools have been created. Based on the imported repositories, data catalogs enable automated cataloging and organizing of data, solving the problem of time-consuming querying of the resources.
To avoid misunderstandings data catalog tools provide a Business Glossary, through which the nomenclature is systematized. It contains business terms along with their definition, relationship to each other, as well as its location in the hierarchy of all data assets.
There are many apps for data catalog tasks on the market. We have listed complex data cataloging software that can also solve data profiling, data lineage, and data classification problems, as well as open-source data catalog tools.