Readme

Atomic.Ring.CPU.SCU

Logo

High‑performance, Lock‑free, Thread‑safe and timing-stable counter that absolutely eliminates CPU 'false sharing'
13.8x faster than 'Interlocked'

Solved Problem: CPU 'False Sharing'

Modern CPUs store data in small fixed‑size chunks called cache lines (typically 64 bytes). When one core modifies a cache line, that line must be invalidated on all other cores – an expensive operation.
False sharing happens when multiple threads modify different variables that happen to reside on the same cache line. The CPU is forced to constantly bounce that line between cores, even though the threads are using unrelated data. This can cripple performance in highly concurrent code.
This solution eliminate false sharing by placing each counter on its own dedicated cache line inside a ring buffer. Threads are automatically assigned to distinct slots, guaranteeing that no two threads ever compete for the same line. The result is a lock‑free counter that scales linearly with the number of cores.

Nuget

multi-target package:
✅ .net7.0
✅ .net8.0
✅ .net9.0

https://www.nuget.org/packages/Atomic.Ring.CPU.SCU

dotnet add package Atomic.Ring.CPU.SCU

NuGet\Install-Package Atomic.Ring.CPU.SCU

Features

Two implementations:
- UnsafeAtomicCounter – uses aligned native memory (NativeMemory.AlignedAlloc) to completely eliminate CPU 'false sharing'
  ✅ Timing‑stable – performance is consistently fast even under extreme contention
  ⚠️ Implements (native memory freed via ), but Dispose is not very required because of SafeHandle specific

Method	Expansion	Threads	Mean (μs)	vs Simple
`SimpleCounter` (baseline)	0(not used)	64	166 592	1.00x
`UnsafeAtomicCounter`	0	64	18 597	8.96x faster
`UnsafeAtomicCounter`	4	64	12 053	13.8x faster
`ManagedAtomicCounter64` (via factory, base class)	0	64	41 067	4.06x faster
`ManagedAtomicCounter64` (concrete type)	0	64	41 010	4.06x faster
`ManagedAtomicCounter64` (concrete type)	4	64	12 473	13.4x faster

Test	Time
`TestSimpleCounter`	904 ms
`TestAtomicCounter` (`UnsafeAtomicCounter`)	63 ms
`SimplifiedAtomicCounterWithSpecificType` (concrete)	65 ms
`SimplifiedAtomicCounterWithoutSpecificType` (base class)	67 ms

VSapozhnikov/Atomic.Ring.CPU.SCUv1.0.0.1

Get Started

Readme

Atomic.Ring.CPU.SCU

Solved Problem: CPU 'False Sharing'

Nuget

Features

Performance

Selected results (full data in Benchmarks.CounterBenchmark-report.html )

Unit test example (single run, Release mode)

Usage

Unsafe version (recommended for maximum performance)

Managed version (choose the right cache‑line size)

Adjust ring size

Manual indexing (only if you really need it)

Which one should I choose?

License

Maintainers