AiDotNet.Tensors

Name: AiDotNet.Tensors
Author: ooples

High-performance tensor operations with SIMD and GPU acceleration for .NET.

Features

Zero Allocations: Uses ArrayPool<T> and Span<T> for hot path performance
SIMD Acceleration: Automatic vectorization using SSE, AVX, AVX2, AVX-512, and ARM NEON
GPU Acceleration: Optional CUDA, HIP/ROCm, and OpenCL support via separate packages
Multi-Target: Supports .NET 10.0 and .NET Framework 4.7.1
Generic Math: Works with any numeric type via INumericOperations<T> interface

Installation

# Core package (CPU SIMD acceleration)
dotnet add package AiDotNet.Tensors

# Optional: OpenBLAS for optimized CPU BLAS operations
dotnet add package AiDotNet.Native.OpenBLAS

# Optional: CLBlast for OpenCL GPU acceleration (AMD/Intel/NVIDIA)
dotnet add package AiDotNet.Native.CLBlast

# Optional: CUDA for NVIDIA GPU acceleration (requires NVIDIA GPU)
dotnet add package AiDotNet.Native.CUDA

Quick Start

using AiDotNet.Tensors.LinearAlgebra;

// Create vectors
var v1 = new Vector<double>(new[] { 1.0, 2.0, 3.0, 4.0 });
var v2 = new Vector<double>(new[] { 5.0, 6.0, 7.0, 8.0 });

// SIMD-accelerated operations
var sum = v1 + v2;
var dot = v1.Dot(v2);

// Create matrices
var m1 = new Matrix<double>(3, 3);
var m2 = Matrix<double>.Identity(3);

// Matrix operations
var product = m1 * m2;
var transpose = m1.Transpose();

Instruction Set	Vector Width	Supported
AVX-512	512-bit (16 floats)	.NET 8+
AVX2	256-bit (8 floats)	.NET 6+
AVX	256-bit (8 floats)	.NET 6+
SSE4.2	128-bit (4 floats)	.NET 6+
ARM NEON	128-bit (4 floats)	.NET 6+

ooples/AiDotNet.Tensorsv0.8.0

Get Started

Readme

AiDotNet.Tensors

Features

Installation

Quick Start

Performance

Check Available Acceleration

Optional Acceleration Packages

AiDotNet.Native.OpenBLAS

AiDotNet.Native.CLBlast

AiDotNet.Native.CUDA

Requirements

License

Contributing