Skip to content
View ashvardanian's full-sized avatar
πŸ’­
Less Slow
πŸ’­
Less Slow

Sponsoring

@martinus
@jart
@danmar
@jedisct1
@nlohmann
@networkx
@python
@skypjack
@sharkdp
@ETLCPP
@mrousavy
@maxbachmann

Highlights

  • Pro

Block or report ashvardanian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ashvardanian/README.md

Hey, I'm Ash!

I am best known for designing search technologies deployed on hundreds of millions of devices, which power some of the largest tech companies, unicorns and decacorns, AI research labs, and defense and intelligence organizations worldwide. In short:

  • CS/AI researcher.
  • Astrophysics dropout.
  • Ex Investor: cloud & semiconductors.
  • Fluent in English, Russian, & Armenian.
  • Building the Unum AI infra project since 2015.

I spent most of the last 20 years writing code. Mostly GPGPU and SIMD Assembly. Prefer spaces over tabs, east-const, and procedural programming over object-oriented or functional. Abstractions are evil. Assembly is nice. If you want to get in touch and contribute - my handle is @ashvardanian on most platforms, including Twitter and LinkedIn.

USearch Python installs UCall Python installs SimSIMD Python installs StringZilla Python installs GitHub Org's stars: unum-cloud GitHub User's stars: ashvardanian HackerNews User Karma

Repositories

I've designed and maintain the following libraries, datasets, and AI models:

  • StringZilla - faster strings with SIMD/SWAR for C, C++, Rust, Python, and Swift, replacing GlibC and STL
  • USearch - single-file search engine for C, C++, Rust, Swift, Go, Java, C#, Python, JS..., replacing FAISS
  • UForm - small multimodal pre-trained AI models with SDKs for Python, JS, and Swift, replacing CLIP
  • UCall - networking server library for C and Python designed for Linux kernel bypass, replacing FastAPI
  • SimSIMD - fast vector-vector math library for C, Python, Rust, and JavaScript, replacing BLAS level 1

Micro repositories:

  • USearch-Molecules - 28 billion embeddings - one of the largest datasets in Computational Chemistry
  • UStore - multimodal embedded database for C, C++, and Python designed around key-value stores
  • SwiftSemanticSearch - example of on-device real-time AI using UForm and USearch on iOS
  • cpp-cuda-python-starter-kit - C++ and CUDA starter kit for Python developers
  • TinySemVer - Semantic versioning GitHub CI tool that doesn't take 300K lines of JavaScript
  • affine-gaps - Less wrong local and global Gotoh sequence alignments in one NumBa Python file
  • ParallelReductionsBenchmark - GPGPU benchmarks for SyCL, CUDA, OpenCL, Vulkan, etc.
  • BenchmarkingTutorial - C/C++ tutorial for performance-oriented programming using Google Benchmark
  • memchr_vs_stringzilla - Rust micro-benchmark comparing StringZilla to the MemChr crate
  • usearch-benchmarks - Billion-scale Vector Search benchmarks against FAISS, Weaviate, Qdrant, etc.
  • ucsb - parallel benchmarks for ACID persistent key-value stores, like RocksDB
  • scaling-democracy - combinatorial voting method implemented in CUDA for H100 GPUs

Materials

Cherry picks:

Pinned Loading

  1. unum-cloud/usearch unum-cloud/usearch Public

    Fast Open-Source Search & Clustering engine Γ— for Vectors & πŸ”œ Strings Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

    C++ 2.1k 121

  2. StringZilla StringZilla Public

    Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores,…

    C++ 2k 67

  3. unum-cloud/ucall unum-cloud/ucall Public

    Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

    C 1.1k 39

  4. unum-cloud/uform unum-cloud/uform Public

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and πŸ”œ video, up to 5x faster than OpenAI CLIP and LLaVA πŸ–ΌοΈ & πŸ–‹οΈ

    Python 1k 60

  5. SimSIMD SimSIMD Public

    Up to 200x Faster Dot Products & Similarity Metrics β€” for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, & SVE πŸ“

    C 854 43

  6. cpp-cuda-python-starter-kit cpp-cuda-python-starter-kit Public template

    Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11

    Cuda 15 1