Data profiling tools for Excel
Data Profiling tools allow analyzing, monitoring, and reviewing data from existing databases in order to provide critical insights. Data profiling can help organizations improve data quality and decision-making process by identifying problems and addressing them before they arise.
Dataedo
Dataedo is a data governance platform with a data profiling feature. It allows you to use sample data to learn what data is stored in your data assets. You can browse min, max, average and median values, see top values, as well as value and row distribution to understand the data better before using it.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min,Stdev |
Tagging data: | - |
DataCleaner
The heart of DataCleaner is a strong data profiling engine for discovering and analyzing the quality of your data. Find the patterns, missing values, character sets and other characteristics of your data values.
Access control: |
|
---|---|
Commercial: | Free |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
Atlan
Atlan automatically profiles your data to identify missing values, outliers & other data anomalies. Data profiles are fully configurable, and admins can schedule data profile updates, run profiles on random/stratified samples or custom filters. Atlan's data profile is an open ecosystem, allowing teams to import data quality metrics from external ecosystems like data pipeline tools for key metrics, such as timeliness, or other internal tools or frameworks.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Cloud |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | - |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Stdev |
Tagging data: |
|
Ataccama ONE
Ataccama One lets you discover, analyze, understand critical patterns in your data. You can see data domains and data quality highlights for each attribute. Re-profile data in one click and check whether the problems you identified were fixed. You can select as many tables in a data source as you need and profile them all in one click.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
Trifacta
Trifacta is an open and interactive cloud platform for data engineers and analysts to collaboratively profile, prepare, and pipeline data for analytics and machine learning. For ease of data profiling, Trifacta automatically identifies dataset formats, schemas, specific attributes, and relationships across attributes and datasets, along with associated metadata for each dataset.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Cloud |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | - |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min,Stdev |
Tagging data: |
|
Datiris Profiler (discontinued)
Datiris Profiler is an intuitive data profiling tool. Its key features include cross-table analysis, domain validation, pattern analysis, conditional profiling, command-line interface, and many more. Besides that, with features such as batch profiling, you can queue up and profile data quickly and spend your time analyzing instead of gathering it.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
Melissa Data Profiler
Melissa Data Profiler analyzes data before it’s merged into your warehouse, then helps ensure consistent data quality once it’s there. It lets you identify data quality issues, monitor improvements over time, and utilize reference data to determine if your input is consistent with expected data. It can also determine if the input data is consistently fielded using the data contained in the entire record to analyze the context of data.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Stdev |
Tagging data: |
|
Global IDs Data Profiling Suite
Global IDs Data Profiling Suite is a data discovery and profiling tool that automates the discovery of data assets, automates data profiling, and provides an active inventory of all data assets.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
StarDQ
StarDQ is a powerful enterprise solution for profiling, cleansing, augmenting, and standardizing the data to significantly improve returns on corporate intelligence initiatives.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Cloud |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | - |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
MIOvantage
MIOvantage is a single solution platform that lets you profile data, run rules, deduplicate data, identify entities, generate reports, and more. From entity resolution to complex deduplication, MIOvantage builds a better, clearer picture from your data.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
OpenDQ
OpenDQ integrates data profiling, standardization, enhancement, fuzzy matching, and de-duplication components with enterprise-class data extraction, transformation, and loading software, to create a comprehensive and complete view of enterprise data. It lets you identify your data’s current state, resolve missing values/erroneous values, discover formats and patterns, reveal hidden business rules, report on column minimums, averages, and maximums, measure business rule compliance across data sets, and provide point in time data profiling history.
Access control: |
|
---|---|
Commercial: | Free |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min |
Tagging data: |
|
CloverDX
CloverDX Data Profiler is a CloverDX module that lets you perform various analyses of your data. It is a part of CloverDX Designer and helps to do various profiling tasks, such as finding the maximum value, median, the most unique value, and many others.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min,Stdev |
Tagging data: |
|
Experian Pandora for Data Profiling
Experian Pandora for Data Profiling helps to focus on fixing data errors by enabling business users to conduct profiling analysis and relationship discovery with incredible speed. It automatically discovers broken keys, orphaned records, and thousands of content quality issues using the highly intuitive fault detection features of our data management platform.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
Astera Centerprise
Astera Centerprise is an end-to-end data integration software that enables you to integrate, cleanse, and transform data in a code-free environment. Its built-in data profiling feature lets you easily examine your source data and get detailed information about its structure, quality, and integrity. Custom data integration and quality rules can also be defined to validate incoming data and identify missing or invalid records.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min |
Tagging data: |
|
WinPure Clean & Match
The Data Profiling / Statistics module within WinPure Clean & Match is a user-friendly and powerful data profiling tool that can help your business to discover patterns and meaning in your data and to check the quality of your data by analyzing formats, types, completeness, and value counts. It presents you with a complete set of statistics that you can use to help clean and correct your data, and to prepare it better for data matching.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
The use of data profiling tools can lead to higher-quality, more reliable data or eliminating errors that add costs to data-driven projects. Eliminating these costly errors involve processes such as:
• Collecting descriptive statistics.
• Collecting data types, length and recurring patterns.
• Tagging data with keywords, descriptions or categories.
• Performing data quality assessment.
• Discovering metadata and assessing its accuracy.
The most efficient way of handling the data profiling process is to automate it with a data management solution. We prepared a list of open-source data profiling tools that help you carry out the analysis of your data and identify the issues.