- Shanghai
-
00:11
(UTC +08:00)
Lists (29)
Sort Name ascending (A-Z)
benchmark
chaos
ChatGPT
CICD
CLI
cloud
cloud native
database
debug
DL
embedding
FE
fun
golang
Infa
IR
information retrieveLLM
Math
ML
Ml4Comm.
operation system
Python
RAG
Rust
SD
๐พ storage
system desgin
Vector Search
web framework
- All languages
- Arduino
- C
- C#
- C++
- CMake
- CSS
- CoffeeScript
- Cuda
- Dockerfile
- Elixir
- Emacs Lisp
- Go
- HTML
- JSON
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- M
- MATLAB
- MDX
- Makefile
- Markdown
- Mustache
- OpenEdge ABL
- PHP
- PureBasic
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- Zig
Starred repositories
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Cโฆ
RAGChecker: A Fine-grained Framework For Diagnosing RAG
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation
๐บ๏ธ Data Cleaning and Textual Data Visualization ๐บ๏ธ
Empowering RAG with a memory-based data interface for all-purpose applications!
Qdrant integration with BEIR, simplifying quality checks on standard datasets
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
A Comprehensive Benchmark for Code Information Retrieval.
AAAI-24 CFEVER: A Chinese Fact Extraction and VERification Dataset
StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation
This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".
Recipes for shrinking, optimizing, customizing cutting edge vision models. ๐
Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.
Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
An open-source RAG-based tool for chatting with your documents.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, leโฆ
Awesome utilities for performance profiling
Data Preview ๐ธ extension for importing ๐ค viewing ๐ slicing ๐ช dicing ๐ฒ charting ๐ & exporting ๐ฅ large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
AdalFlow: The โPyTorchโ library to auto-optimize any LLM tasks.
vsag is a vector indexing library used for similarity search.
Up to 200x Faster Dot Products & Similarity Metrics โ for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, โฆ
pytest plugin that allows tagging tests using arbitrary strings
A cloud native embedded storage engine built on object storage.