- Shanghai
-
19:45
(UTC +08:00)
Lists (29)
Sort Name ascending (A-Z)
benchmark
chaos
ChatGPT
CICD
CLI
cloud
cloud native
database
debug
DL
embedding
FE
fun
golang
Infa
IR
information retrieveLLM
Math
ML
Ml4Comm.
operation system
Python
RAG
Rust
SD
💾 storage
system desgin
Vector Search
web framework
- All languages
- Arduino
- C
- C#
- C++
- CMake
- CSS
- CoffeeScript
- Cuda
- Dockerfile
- Elixir
- Emacs Lisp
- Go
- HTML
- JSON
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- M
- MATLAB
- MDX
- Makefile
- Markdown
- Mustache
- OpenEdge ABL
- PHP
- PureBasic
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- Zig
Starred repositories
Tansu is an Apache Kafka API compatible broker written in async 🚀 Rust 🦀
Benchmark of Nearest Neighbor Search on High Dimensional Data
🕳 bore is a simple CLI tool for making tunnels to localhost
Run pytest on markdown code fence blocks
The fastest way to create an HTML app
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…
RAGChecker: A Fine-grained Framework For Diagnosing RAG
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation
🗺️ Data Cleaning and Textual Data Visualization 🗺️
Empowering RAG with a memory-based data interface for all-purpose applications!
Qdrant integration with BEIR, simplifying quality checks on standard datasets
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
A Comprehensive Benchmark for Code Information Retrieval.
AAAI-24 CFEVER: A Chinese Fact Extraction and VERification Dataset
StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation
This is the official code for the EMNLP 2023 paper "GLEN: Generative Retrieval via Lexical Index Learning".
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Python pathlib-style classes for cloud storage services such as Amazon S3, Azure Blob Storage, and Google Cloud Storage.
Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
An open-source RAG-based tool for chatting with your documents.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…