Data discovery tools for Amazon S3

Data enables companies to make informed decisions and understand needs of the customer and the business. In order to leverage data, data discovery tools help organizations visualize, analyze and get insights from it. They start by collecting data from multiple sources to then consolidate, classify, and catalog it into a single repository to simplify its evaluation.

Dataedo

Dataedo is a powerful data governance & data catalog solution. With its intuitive interface, Dataedo enables organizations to locate, understand, and leverage their data assets efficiently. By promoting seamless data discovery, Dataedo enhances collaboration and innovation, empowering users to make informed decisions effortlessly.

Business Glossary: Yes
Data Lineage: Yes
Data Profiling: Yes
Data visualization: No
Free edition: No
Metadata management: Yes
Notifications: Yes
Dataedo Data Catalog
Dataedo Sensitive Data Discovery
Dataedo Business Glossary
Dataedo Data Lineage
Dataedo Data Profiling
Dataedo Data Search

OneTrust

OneTrust offers automated data discovery in unstructured file shares, structured databases, Big Data storage, SaaS applications, and other cloud solutions. It leverages advanced ML-based classification to label and tag data with both out-of-the-box and custom classifiers. Moreover, it identifies at-risk data, implements security controls, and monitors over time with advanced analytics.

Business Glossary: Yes
Data Lineage: Yes
Data Profiling: Yes
Data visualization: Yes
Free edition: No
Metadata management: Yes
Notifications: No

Zeenea

Zeenea provides an intelligent data discovery platform that provides a multi-dimensional search engine, able to retrieve the right information whether you know what you are looking for (high intent) or not (low intent). It automatically indexes and updates your data from all your data sources.

Business Glossary: Yes
Data Lineage: Yes
Data Profiling: No
Data visualization: Yes
Free edition: No
Metadata management: Yes
Notifications: Yes

Datapine

Datapine is a modern business intelligence platform that provides powerful features for data discovery and to extract more insights from the data. It provides one central place to view all data with features like fast & easy data connectors, powerful & interactive BI dashboards, and modern predictive analytics tools.

Business Glossary: No
Data Lineage: No
Data Profiling: No
Data visualization: Yes
Free edition: No
Metadata management: No
Notifications: Yes

Domo

Domo provides AI-powered and automated data discovery with a simple drag-and-drop interface, making complex datasets consumable with a few clicks. Moreover, it allows to create customizable, interactive dashboards to fit any business scenario with Variables. Variables are custom-defined fields that users can adjust inside dashboards.

Business Glossary: No
Data Lineage: Yes
Data Profiling: No
Data visualization: Yes
Free edition: Yes
Metadata management: No
Notifications: Yes

Ohalo Data X-Ray

Ohalo Data X-Ray provides unstructured data discovery by acting as a centralized tool that offers native and bespoke connectors to support modern and legacy data sources. It can scan across multiple unstructured data repositories simultaneously in seconds and identify data assets at risk. Moreover, it consistently integrates and auto-populates data catalogs to build an inventory of data and metadata for diverse teams to use.

Business Glossary: No
Data Lineage: No
Data Profiling: No
Data visualization: Yes
Free edition: No
Metadata management: Yes
Notifications: No

Dataiku

Dataiku interactively explores data and creates statistical analyses, charts, and dashboards to share insights with the broader team. It provides automatic profiling of columns, including the counts and distribution of values, top values, outliers, invalids, and summary statistics. Moreover, it includes a broad range of statistical tests and analyses, including univariate / bivariate / multivariate analyses, fit curves and distributions, location and distribution tests, and time series-specific analyses.

Business Glossary: No
Data Lineage: Yes
Data Profiling: Yes
Data visualization: Yes
Free edition: No
Metadata management: Yes
Notifications: No

erwin Data Intelligence

erwin Data Intelligence combines automation with market-leading data cataloging, data stewardship, and self-service data discovery in a single software suite so your enterprise can discover data, harvest data, structure and deploy data sources, analyze metadata, assess and manage data quality, and do a lot more. With erwin Data Intelligence, you can automate the discovery, assessment, and governance of enterprise data assets.

Business Glossary: Yes
Data Lineage: Yes
Data Profiling: Yes
Data visualization: Yes
Free edition: No
Metadata management: Yes
Notifications: Yes

Ketch

Ketch is a dynamic, integrated data discovery and classification toolset. It is always on and constantly scanning internal data systems and classifying data sets using responsive machine learning models, giving you a complete and growing picture of data across your organization. Ketch’s user interface gives your administrators secure, centralized data control across your various data systems.

Business Glossary: No
Data Lineage: Yes
Data Profiling: No
Data visualization: No
Free edition: No
Metadata management: No
Notifications: No

Data enables companies to make informed decisions and understand needs of the customer and the business. In order to leverage data, data discovery tools help organizations visualize, analyze and get insights from it. They start by collecting data from multiple sources to then consolidate, classify, and catalog it into a single repository to simplify its evaluation.

Collecting, cleaning and preparing data in order to gain insights from it is crucial for any business. Data discovery tools enable organizations to look deeper into data, get important insights and share them across the whole company. But at the same time, they help prevent exposure of sensitive data, and enable the organization to implement appropriate security measures.

Other features data discovery tools provide with are:

Machine learning capabilities, including predictive analytics

In-memory analytics, enabling faster query response times

Data preparation and tools to improve data quality

Metadata management

In order to help you find the right data discovery tool for your organization, we’ve listed some of the best softwares that will empower your team to detect informative patterns and extract valuable insights.