Readme

Parquet.Data.Ado

A .NET library that provides ADO.NET support for Parquet files, enabling seamless integration of Parquet data into .NET applications through familiar ADO.NET abstractions. This library is part of the Data Automation framework SQLFlow.

Overview

Parquet.Data.Ado bridges the gap between the Parquet file format and .NET applications by implementing standard ADO.NET interfaces. This allows developers to work with Parquet files using the same patterns they use for traditional database access.

Features

ADO.NET Integration

DbConnection implementation: Connect to Parquet files using the familiar connection string pattern
DbCommand support: Execute commands against Parquet files
DbDataReader interface: Process Parquet data using forward-only, read-only cursor pattern
DataTable conversion: Convert Parquet data to DataTable objects for easy manipulation

SQL Support

SQL query execution: Query Parquet files using a subset of SQL syntax
WHERE clause filtering: Filter data using SQL-like conditions
Column projection: Select specific columns from Parquet files
SQL Parser: Parse and execute SQL SELECT statements against Parquet files

Advanced Features

Virtual columns: Add computed or placeholder columns that aren't physically present in the Parquet file
Batch reading: Process large Parquet files efficiently with batch operations
Asynchronous operations: Full async support for I/O bound operations
Parallel processing: Multi-threaded reading of row groups for improved performance
Type conversion: Smart handling of data type conversions between Parquet and .NET types
Export capabilities: Convert DataTable objects to Parquet format

SQLFlow/Parquet.Data.Readerv1.1.0

Get Started

Readme

Parquet.Data.Ado

Overview

Features

ADO.NET Integration

SQL Support

Advanced Features

Getting Started

Installation

Basic Usage

Using SQL Queries

Working with Virtual Columns

Batch Processing

Converting DataTable to Parquet

Key Components

Advanced SQL Capabilities

Requirements

Testing and Extensibility

Part of SQLFlow

License