Data dictionary tools for Apache Parquet

List of tools that enable design and building of data dictionaries .

Data Dictionary is a set of important information about data used within an organization (metadata). This information includes names, definitions, and attributes about data, owners, and creators of assets. Data Dictionary tools provide insights into meaning and purposes of data elements. They add useful aliases about the scope and characteristics of data elements, as well as the rules for their usage and application.

Dataedo

Dataedo allows you to connect and scan metadata from multiple sources and build data dictionary automatically in a couple of minutes.

Desktop/Cloud: Desktop
ER Diagram: Yes
Export: HTML,MS Excel,PDF
Metadata stored in: Documentation repository/file
Commercial: Commercial
Free edition: No
Notable features: ER diagrams, metadata repository, schema change tracking, organizing with modules, documenting missing FKs, custom fields, description suggestions, documentation progress tracking, rich text with images
Runs on: (for desktop): Mac OS,Windows
Dataedo Data Catalog
Dataedo Data Profiling
Dataedo Data Search
Dataedo ERD

Key functionality of Data Dictionary tools is to give users the ability to document data. Moreover, very important is the possibility to create a collection of multiple repositories, based on different system engines. For a better understanding of the data, some tools allow visualization of the data structure using ERD (Entity-Relationship Diagrams).

From the organization's point of view, a community module within a data dictionary tool proves to be useful. It facilitates the proper information flow, as well as provides sharing opinions on specific objects among the members of an organization.

Nowadays, data discovery and understanding becomes crucial for proper organization performance. There are many benefits to using Data Dictionaries, such as:
• helps avoid data inconsistencies problems,
• it allows introducing unified nomenclature used in the project,
• Make data searchable, and understandable,
• Create a single source of truth about the data from different repositories,

The prepared list includes simple, open-source data dictionaries as well as more advanced software.