Data profiling tools for Amazon RDS

Data Profiling tools allow analyzing, monitoring, and reviewing data from existing databases in order to provide critical insights. Data profiling can help organizations improve data quality and decision-making process by identifying problems and addressing them before they arise.

Dataedo

Dataedo is a metadata management & data catalog tool with a data profiling feature. It allows you to use sample data to learn what data is stored in your data assets. You can browse min, max, average and median values, see top values, as well as value and row distribution to understand the data better before using it.

Access control: Yes
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): Windows
Sensitive data discovery: Yes
SQL sources: Yes
Statistics of data: Avg,Max,Min,Stdev
Tagging data: -
Dataedo Data Profiling
Data Profiling in Desktop

Ataccama ONE

Ataccama One lets you discover, analyze, understand critical patterns in your data. You can see data domains and data quality highlights for each attribute. Re-profile data in one click and check whether the problems you identified were fixed. You can select as many tables in a data source as you need and profile them all in one click.

Access control: No
Commercial: Commercial
Desktop/Cloud: Desktop
Excel workbooks: Yes
Flat files: Yes
Free edition: Yes
Metadata identification: Yes
NoSQL sources: No
Runs on: (for desktop): Windows
Sensitive data discovery: Yes
SQL sources: Yes
Statistics of data: -
Tagging data: Yes

Idera SQL Data Profiler

SQL Data Profiler analyzes and summarizes data to produce valuable insights into data patterns. It lets you profile data in SQL server tables, analyze subsets of data types at a time, adjust profiling thresholds to customize the analysis, display summary of data in selected table and its columns, receive recommendations based on data per column, view summary of value distribution per column, do many other functions.

Access control: No
Commercial: Free
Desktop/Cloud: Desktop
Excel workbooks: No
Flat files: No
Free edition: Yes
Metadata identification: No
NoSQL sources: No
Runs on: (for desktop): Windows
Sensitive data discovery: No
SQL sources: Yes
Statistics of data: -
Tagging data: No

Alation Data Catalog

Alation’s data profiling capabilities help reduce the time spent in the data exploration phase. With Alation’s data profile, data consumers have the metrics they need to easily discern the quality of any data object. Alation displays important characteristics, statistics, and numerical graphs about the data — enabling data scientists and data engineers to quickly take action. The data profiling now also includes new charts and customizations.

Access control: No
Commercial: Commercial
Desktop/Cloud: Cloud
Excel workbooks: No
Flat files: Yes
Free edition: No
Metadata identification: Yes
NoSQL sources: Yes
Runs on: (for desktop): -
Sensitive data discovery: Yes
SQL sources: Yes
Statistics of data: -
Tagging data: Yes

The use of data profiling tools can lead to higher-quality, more reliable data or eliminating errors that add costs to data-driven projects. Eliminating these costly errors involve processes such as:

• Collecting descriptive statistics.
• Collecting data types, length and recurring patterns.
• Tagging data with keywords, descriptions or categories.
• Performing data quality assessment.
• Discovering metadata and assessing its accuracy.

The most efficient way of handling the data profiling process is to automate it with a data management solution. We prepared a list of open-source data profiling tools that help you carry out the analysis of your data and identify the issues.