Data quality tools

Dataedo

Dataedo is a metadata management & data catalog tool that helps you ensure data quality while documenting data. It is equipped with a data profiling feature, which allows you to use sample data to learn what data is stored in your data assets and if it is of good quality. It also has a data community module, which allows everyone to share their observations and feedback to data through comments, ratings, questions, to-dos, and warnings.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No
Dataedo Web Community
Data Profiling - Dataedo Web

Global IDs Data Quality Suite

Global IDs Data Quality Suite ensures the quality of the data by establishing control points and read-only quality controls at the database level. It continuously monitors quality metrics, while it also automates control generation for critical data elements across all kinds of data sources.

Commercial: Commercial
Data cleansing: No
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Datactics

Datactics is a data quality tool that provides a comprehensive set of data quality operations to profile, measure, clean, de-duplicate, and match multiple data types. It lets you gain deep insights into the quality of data through rich visualizations in off-the-shelf tools.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Trillium Quality

Trillium Quality rapidly transform high-volume, disconnected data into trusted and actionable business insights with scalable enterprise data quality. It has been designed to run natively in cloud or on-premises big data environments, ensuring your business information is integrated, fit-for-purpose, and accessible across the enterprise, regardless of volume.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Talend Open Studio for Data Quality

Talend Open Studio for Data Quality profiles your data and provides a graphical drill-down of the details. It is a free data quality tool and provides advanced data profiling that includes fraud pattern detection using Benford Law, advanced statistics with indicator thresholds, column set analysis, advanced matching analysis, and time column correlation analysis.

Commercial: -
Data cleansing: -
Data Discovery & Search: -
Data Profiling: -
Data standarization: -
Free edition: -

SAS Data Quality

SAS Data Quality is a comprehensive tool that meets all the data quality requirements of a business. It makes it easy to profile and identify problems, preview data, and set up repeatable processes to maintain a high level of data quality.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Oracle Data Profiling and Data Quality for Data Integrator

Oracle Data Quality for Data Integrator is a comprehensive award-winning data quality platform that meets even the most complex data quality requirements. Oracle Data Quality addresses the enterprise data quality needs of all projects, including data warehousing and business intelligence, master data management, data integration, migration, service-oriented integration processes.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

Innovative Systems Enlighten

Enlighten is an integrated data quality suite that offers customizable and comprehensive solutions for any size organization. It provides a wide range of capabilities including data profiling and standardization, cleansing, linking and deduplicating records, monitoring data quality over time, validating addresses, and geocoding.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Open Source Data Quality and Profiling

Open Source Data Quality and Profiling tool is an open source project dedicated to data quality and data preparation solutions. This tool is developing high performance integrated data management platform which will seamlessly do data integration, data profiling, data quality, data preparation, dummy data creation, meta data discovery, anomaly discovery, data cleansing, reporting, and analytic.

Commercial: Free
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: Yes

Uniserv Data Cleansing

Data Cleansing is the Uniserv product that enables you to ensure data quality by performing highly-efficient cleansing during batch processing. The cleansing tool can, e.g., extract data required from source systems through connectors, cleanse postal data, enrich data with supplementary information, and identify duplicates (i.e., their complex consolidation).

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: No
Data standarization: Yes
Free edition: No

IBM InfoSphere QualityStage

IBM InfoSphere QualityStage is designed to support your data quality and information governance initiatives. It enables you to investigate, cleanse and manage your data, helping you maintain consistent views of key entities. It provides capabilities including data profiling, standardization, probabilistic matching, and data enrichment.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

Talend Data Quality

As an integral part of Talend Data Fabric, Talend Data Quality profiles, cleans, and masks data in real time. It lets you quickly identify data quality issues, discover hidden patterns, and spot anomalies through summary statistics and graphical representations.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: No
Data Profiling: Yes
Data standarization: Yes
Free edition: No

DataMatch Enterprise

Data Ladder’s DataMatch Enterprise is a full-fledged data quality solution that enables organizations to perform key data management operations – including, but not limited to: data profiling, data cleansing, data preparation, data standardization and most importantly, data matching.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No

DQLabs

DQLabs.ai is an augmented data quality platform to manage your entire data quality life cycle. With ML and self-learning capabilities, organizations can measure, monitor, remediate and improve data quality across any type of data – all in one agile, innovative self-service platform.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: No
Free edition: No

Trifacta

Trifacta is an open and interactive cloud platform for data engineers and analysts to collaboratively profile, prepare, and pipeline data for analytics and machine learning. It presents automated visual representations of data based upon its content in the most compelling visual profile. In addition, every profile is completely interactive, allowing the user to simply select certain elements of the profile to prompt transformation suggestions.

Commercial: Commercial
Data cleansing: Yes
Data Discovery & Search: Yes
Data Profiling: Yes
Data standarization: Yes
Free edition: No