Data quality tools

Dataedo

Dataedo is a metadata management & data catalog tool that helps you ensure data quality while documenting data. It is equipped with a data profiling feature, which allows you to use sample data to learn what data is stored in your data assets and if it is of good quality. It also has a data community module, which allows everyone to share their observations and feedback to data through comments, ratings, questions, to-dos, and warnings.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No
Data Profiling - Dataedo Web
Dataedo Web Community

Global IDs Data Quality Suite

Global IDs Data Quality Suite ensures the quality of the data by establishing control points and read-only quality controls at the database level. It continuously monitors quality metrics, while it also automates control generation for critical data elements across all kinds of data sources.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Trillium Quality

Trillium Quality rapidly transform high-volume, disconnected data into trusted and actionable business insights with scalable enterprise data quality. It has been designed to run natively in cloud or on-premises big data environments, ensuring your business information is integrated, fit-for-purpose, and accessible across the enterprise, regardless of volume.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Talend Open Studio for Data Quality

Talend Open Studio for Data Quality profiles your data and provides a graphical drill-down of the details. It is a free data quality tool and provides advanced data profiling that includes fraud pattern detection using Benford Law, advanced statistics with indicator thresholds, column set analysis, advanced matching analysis, and time column correlation analysis.

Commercial: -
Data cleansing: -
Data Discovery & Search: -
Data Profiling: -
Data standarization: -
Free edition: -

SAS Data Quality

SAS Data Quality is a comprehensive tool that meets all the data quality requirements of a business. It makes it easy to profile and identify problems, preview data, and set up repeatable processes to maintain a high level of data quality.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Open Source Data Quality and Profiling

Open Source Data Quality and Profiling tool is an open source project dedicated to data quality and data preparation solutions. This tool is developing high performance integrated data management platform which will seamlessly do data integration, data profiling, data quality, data preparation, dummy data creation, meta data discovery, anomaly discovery, data cleansing, reporting, and analytic.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

Talend Data Quality

As an integral part of Talend Data Fabric, Talend Data Quality profiles, cleans, and masks data in real time. It lets you quickly identify data quality issues, discover hidden patterns, and spot anomalies through summary statistics and graphical representations.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

DQLabs

DQLabs.ai is an augmented data quality platform to manage your entire data quality life cycle. With ML and self-learning capabilities, organizations can measure, monitor, remediate and improve data quality across any type of data – all in one agile, innovative self-service platform.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No

Trifacta

Trifacta is an open and interactive cloud platform for data engineers and analysts to collaboratively profile, prepare, and pipeline data for analytics and machine learning. It presents automated visual representations of data based upon its content in the most compelling visual profile. In addition, every profile is completely interactive, allowing the user to simply select certain elements of the profile to prompt transformation suggestions.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

DataCleaner

DataCleaner is a premier open source data quality solution. The heart of DataCleaner is a strong data profiling engine for discovering and analyzing the quality of your data. Find the patterns, missing values, character sets, and other characteristics of your data values.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: Yes

BigID

BigID offers actionable data intelligence by empowering you to find, classify, catalog, profile and get context for all your data in the cloud or data-center, at-rest or in-motion, structured or unstructured at petabyte scale. It lets you identify sensitive data using hundreds of pre-built NLP, Deep Learning, and pattern classifiers.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No

Ataccama ONE

Ataccama ONE offers self-driving data quality management by letting you quickly understand the state of your data, validate & improve it, prevent bad data from entering your systems, and continuously monitor data quality.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

Infogix Data360 DQ+

Infogix Data360 DQ+ is an enterprise data quality solution that automates data quality checks across the entire data supply chain from the time information enters your organization throughout its whole journey. You can score the likelihood of possible inaccuracies based on historical data characteristics and issue reconciliations by leveraging machine learning algorithms.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: No
Free edition: No

Aperture Data Studio

Aperture Data Studio combines self-service data quality with globally curated data sets into a single data quality platform. This empowers modern data practitioners to build a consistent, accurate, and holistic view of their consumer data quickly and effortlessly. It lets you set custom workflows for data profiling, cleansing, validation, transformation, enrichment, and deduplication.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

TIBCO Clarity

TIBCO Clarity is data preparation, profiling, and cleansing tool. You can use TIBCO Clarity to discover, profile, cleanse, and standardize raw data collected from disparate sources, and provide good quality data for accurate analysis and intelligent decision-making.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No