VectorTA

Technical analysis library with 340 indicators, built in Rust + CUDA, with Python (PyO3) and WebAssembly bindings (SIMD128; AVX2/AVX-512 where beneficial)

0

Implemented

ALMA benchmark snapshot

cargo bench results for VectorTA’s ALMA indicator, comparing scalar, AVX2, and AVX-512 performance on 10k candles, plus AVX-512 versus CUDA on a roughly 250 million-operation batch workload.

ALMA on CPU (10k candles)

ALMA batch: AVX-512 vs CUDA (~250M ops)

Benchmarks shown were run on an AMD 9950X CPU and NVIDIA RTX 4090 GPU.