We have hosted the application dataprofiler in order to run this application in our online workstations with Wine or directly.


Quick description about dataprofiler:

DataProfiler is an AI-powered tool for automatic data analysis and profiling, designed to detect patterns, anomalies, and schema inconsistencies in structured and unstructured datasets. The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy. Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI), and more. Data Profiles can then be used in downstream applications or reports.

Features:
  • Automatically detects schema, types, and distributions in datasets
  • Supports structured (CSV, SQL) and unstructured (text, logs) data
  • Identifies Personally Identifiable Information (PII)
  • Provides statistical summaries and data quality metrics
  • Works with large-scale datasets efficiently
  • Open-source with Python API integration


Programming Language: Python.
Categories:
Natural Language Processing (NLP)

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.