Data quality tools

Dataedo

Dataedo is a metadata management & data catalog tool that helps you ensure data quality while documenting data. It is equipped with a data profiling feature, which allows you to use sample data to learn what data is stored in your data assets and if it is of good quality. It also has a data community module, which allows everyone to share their observations and feedback to data through comments, ratings, questions, to-dos, and warnings.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No
Dataedo Web Community
Data Profiling - Dataedo Web

Global IDs Data Quality Suite

Global IDs Data Quality Suite ensures the quality of the data by establishing control points and read-only quality controls at the database level. It continuously monitors quality metrics, while it also automates control generation for critical data elements across all kinds of data sources.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Trillium Quality

Trillium Quality rapidly transform high-volume, disconnected data into trusted and actionable business insights with scalable enterprise data quality. It has been designed to run natively in cloud or on-premises big data environments, ensuring your business information is integrated, fit-for-purpose, and accessible across the enterprise, regardless of volume.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Talend Open Studio for Data Quality

Talend Open Studio for Data Quality profiles your data and provides a graphical drill-down of the details. It is a free data quality tool and provides advanced data profiling that includes fraud pattern detection using Benford Law, advanced statistics with indicator thresholds, column set analysis, advanced matching analysis, and time column correlation analysis.

Commercial: -
Data cleansing: -
Data Discovery & Search: -
Data Profiling: -
Data standarization: -
Free edition: -

SAS Data Quality

SAS Data Quality is a comprehensive tool that meets all the data quality requirements of a business. It makes it easy to profile and identify problems, preview data, and set up repeatable processes to maintain a high level of data quality.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Oracle Data Profiling and Data Quality for Data Integrator

Oracle Data Quality for Data Integrator is a comprehensive award-winning data quality platform that meets even the most complex data quality requirements. Oracle Data Quality addresses the enterprise data quality needs of all projects, including data warehousing and business intelligence, master data management, data integration, migration, service-oriented integration processes.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

Open Source Data Quality and Profiling

Open Source Data Quality and Profiling tool is an open source project dedicated to data quality and data preparation solutions. This tool is developing high performance integrated data management platform which will seamlessly do data integration, data profiling, data quality, data preparation, dummy data creation, meta data discovery, anomaly discovery, data cleansing, reporting, and analytic.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

IBM InfoSphere QualityStage

IBM InfoSphere QualityStage is designed to support your data quality and information governance initiatives. It enables you to investigate, cleanse and manage your data, helping you maintain consistent views of key entities. It provides capabilities including data profiling, standardization, probabilistic matching, and data enrichment.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Talend Data Quality

As an integral part of Talend Data Fabric, Talend Data Quality profiles, cleans, and masks data in real time. It lets you quickly identify data quality issues, discover hidden patterns, and spot anomalies through summary statistics and graphical representations.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

DQLabs

DQLabs.ai is an augmented data quality platform to manage your entire data quality life cycle. With ML and self-learning capabilities, organizations can measure, monitor, remediate and improve data quality across any type of data – all in one agile, innovative self-service platform.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No

Trifacta

Trifacta is an open and interactive cloud platform for data engineers and analysts to collaboratively profile, prepare, and pipeline data for analytics and machine learning. It presents automated visual representations of data based upon its content in the most compelling visual profile. In addition, every profile is completely interactive, allowing the user to simply select certain elements of the profile to prompt transformation suggestions.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

WinPure Clean & Match

WinPure Clean & Match is a complete data quality, cleansing, matching, and de-duplication software suite for your mailing lists, databases, spreadsheets, CRM's, etc. It lets you instantly fix data quality issues, as it scans each data list and provides over 30 different statistics ranging from % filled/empty cells to most common values & counts. It also features red & amber coloring to highlight potential data quality issues.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

DataCleaner

DataCleaner is a premier open source data quality solution. The heart of DataCleaner is a strong data profiling engine for discovering and analyzing the quality of your data. Find the patterns, missing values, character sets, and other characteristics of your data values.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: Yes

BigID

BigID offers actionable data intelligence by empowering you to find, classify, catalog, profile and get context for all your data in the cloud or data-center, at-rest or in-motion, structured or unstructured at petabyte scale. It lets you identify sensitive data using hundreds of pre-built NLP, Deep Learning, and pattern classifiers.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No

DQ*Plus

DQ*Plus data quality software cleanse data, verify addresses, find and consolidate duplicate records in batch and real-time. It accurately matches duplicate records and intelligently merges them into a single best record.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: No
Data standarization: No
Free edition: No