Data profiling tools

Dataedo

Dataedo is a metadata management & data catalog tool with a data profiling feature. It allows you to use sample data to learn what data is stored in your data assets. You can browse min, max, average and median values, see top values, as well as value and row distribution to understand the data better before using it.

Access control: Yes
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): Windows
Sensitive data discovery: Yes
SQL sources: Yes
Statistics of data: Avg,Max,Min,Stdev
Tagging data: -
Data Profiling - Dataedo Web
Data Profiling - Dataedo Desktop
Database Web Table Diagram

Aperture Data Studio

Aperture Data Studio is a powerful and easy-to-use data management suite that helps you quickly and easily profile data to understand deficiencies as an essential first step to cleansing, joining, and validating data. It profiles the complete data set and audits every step in readiness for statutory reporting and enhanced transparency of data and processes, de-risking compliance initiatives.

Access control: No
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: No
NoSQL sources: Yes
Runs on: (for desktop): Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: Avg,Stdev
Tagging data: Yes

Toad Data Point

Toad Data Point is a multi-platform database query, data prep, and reporting tool. It lets you visually profile and sample database tables and data sets for patterns, unique values, duplicates, missing information, min./max. values and more.

Access control: No
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: Avg,Max,Min,Stdev
Tagging data: Yes

Trillium Discovery

Trillium Discovery provides industry-leading data profiling at scale, designed specifically to meet the challenges presented by today’s data environments, with native connectivity to cloud and big data sources to execute data profiling tasks. It lets you visually assess the quality of your data and support data governance with comprehensive profiling, customized to your business

Access control: No
Commercial: Commercial
Desktop/Cloud: Cloud
Excel workbooks: No
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): -
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: -
Tagging data: No

Astera Centerprise

Astera Centerprise is an end-to-end data integration software that enables you to integrate, cleanse, and transform data in a code-free environment. Its built-in data profiling feature lets you easily examine your source data and get detailed information about its structure, quality, and integrity. Custom data integration and quality rules can also be defined to validate incoming data and identify missing or invalid records.

Access control: Yes
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: No
NoSQL sources: No
Runs on: (for desktop): Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: Avg,Max,Min
Tagging data: No

CloverDX

CloverDX Data Profiler is a CloverDX module that lets you perform various analyses of your data. It is a part of CloverDX Designer and helps to do various profiling tasks, such as finding the maximum value, median, the most unique value, and many others.

Access control: No
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): Linux,Mac OS,Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: Avg,Max,Min,Stdev
Tagging data: No

MIOvantage

MIOvantage is a single solution platform that lets you profile data, run rules, deduplicate data, identify entities, generate reports, and more. From entity resolution to complex deduplication, MIOvantage builds a better, clearer picture from your data.

Access control: No
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: No
Free edition: No
Metadata identification: No
NoSQL sources: No
Runs on: (for desktop): Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: -
Tagging data: No

SAS Data Quality

SAS Data Quality gives you a single interface to manage the entire data quality life cycle: profiling, standardizing, matching, and monitoring. It lets you validate data against standard measures and customized business rules. Uncover relationships across tables, databases, and source applications. Verify that the data in your tables matches the appropriate description. Establish trends and commonalities in business information and examine numerical trends via mean, median, mode, and standard deviation.

It makes it easy to profile and identify problems, preview data, and set up repeatable processes to maintain a high level of data quality.

Access control: Yes
Commercial: Commercial
Desktop/Cloud: Cloud
Excel workbooks: No
Flat files: No
Free edition: No
Metadata identification: Yes
NoSQL sources: No
Runs on: (for desktop): -
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: Avg,Stdev
Tagging data: No

Alation Data Catalog

Alation’s data profiling capabilities help reduce the time spent in the data exploration phase. With Alation’s data profile, data consumers have the metrics they need to easily discern the quality of any data object. Alation displays important characteristics, statistics, and numerical graphs about the data — enabling data scientists and data engineers to quickly take action. The data profiling now also includes new charts and customizations.

Access control: No
Commercial: Commercial
Desktop/Cloud: Cloud
Excel workbooks: No
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): -
Sensitive data discovery: Yes
SQL sources: Yes
Statistics of data: -
Tagging data: Yes