Readme

README

Name: VerityDotNet
Author: Technik Interlytics

Overview

VerityDotNet is a DotNet (6.0) library by Technik Interlytics for Better Data = Better Decisions. It is a Data Inspector, Remediator, Normalizer on large structured data sets. It combines curated human expert knowledge with Machine Learning (ML) into software components to accelerate development and signficantly lower the level of effort needed to make data ready for high quality Data Science, Machine Learning, and predictive models.

The libraries contain expert algorithms developed from in-depth investigations, forensic tracing, and specialized remediation on large data systems across many fields, always with successful outcomes. Our human experts discovered where to look, what to look for, and what to do about problems especially deeply buried ones missed by traditional tools. These were then tuned and tested on a variety of data sets to refine their performance and determine the characteristics and statistics most useful to enable faster and more accurate data processing.

Features

Inspect: deep analysis and characterization of data for structural and value consistency, anomalies, and errors especially those deeply buried problems that are not detected by common DataOps tools. Some examples are:
- data variations (types, formats, encoding) from import/export in different systems especially spreadsheets and legacy mainframes
- special characters not visible to users causing downstream problems
- small number of anomalies buried within very large data sets overwhelming tools
- mismatched joint field values such as geographic location codes and names
- long strings of digits used as codes (e.g. accounting, ERP) cast into number format stripping digits thereby corrupting codes
- records broken into multiple lines causing fields to be misaligned, partial records, and incorrect values
- open source data with embedded information-only records (e.g. IRS USA Migration demographics, Covid disease census) unknown to users
Remediate: make data accurate for structure (field positions, data types, formats), and values (numeric ranges, codes, lists of allowed values). Detect and fix parsing errors including challenging multi-line records. Some examples are:

gmalafsky/VerityDotNetv1.1.8

Get Started

Readme

README

Overview

Features

Why Use VerityDotNet

What VerityDotNet Does

Analysis

Normalize & Enrich

License