Data profiling tools
Talend Data Fabric
Talend Data Fabric combines data integration, integrity, and governance in a single, unified platform. Talend Data Fabric's capabilities allow you to extract, process, and profile data from virtually any source to your data warehouse. Data profiling lets you quickly identify data quality issues, discover hidden patterns, and spot anomalies through summary statistics and graphical representations.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
IBM InfoSphere Information Analyzer
IBM InfoSphere Information Analyzer provides data profiling and analysis to accurately evaluate the content and structure of your data for consistency and quality. It utilizes a reusable rules library and supports multi-level evaluations by rule record and pattern. It also facilitates the management of exceptions to established rules to help identify data inconsistencies, redundancies and anomalies, and make inferences about the best choices for structure.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Stdev |
Tagging data: |
|
DataRobot Data Prep
DataRobot Data Prep enables both novice and expert users to quickly and interactively explore, profile, clean, enrich and shape diverse data into AI assets ready for machine learning model development and deployment. It offers a visually interactive user interface that presents data in familiar tabular or spreadsheet style with no coding required. DataRobot provides profiles for every record and feature, including how many values are unique or missing and the statistical mean, standard deviation, median, minimum value, and maximum value.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Cloud |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | - |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min,Stdev |
Tagging data: |
|
Informatica Data Profiling
Informatica’s data profiling solution, Data Explorer, is available in two editions—Standard and Advanced—that employ powerful data profiling capabilities to scan every single data record, from any source, to find anomalies and hidden relationships. It works regardless of complexity or of the relationship between your data sources.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Cloud |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | - |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | Avg,Max,Min,Stdev |
Tagging data: |
|
DataCleaner
The heart of DataCleaner is a strong data profiling engine for discovering and analyzing the quality of your data. Find the patterns, missing values, character sets and other characteristics of your data values.
Access control: |
|
---|---|
Commercial: | Free |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux,Mac OS,Windows |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|
Global IDs Data Profiling Suite
Global IDs Data Profiling Suite is a data discovery and profiling tool that automates the discovery of data assets, automates data profiling, and provides an active inventory of all data assets.
Access control: |
|
---|---|
Commercial: | Commercial |
Desktop/Cloud: | Desktop |
Excel workbooks: |
|
Flat files: |
|
Free edition: |
|
Metadata identification: |
|
NoSQL sources: |
|
Runs on: (for desktop): | Linux |
Sensitive data discovery: |
|
SQL sources: |
|
Statistics of data: | - |
Tagging data: |
|