Zhe Zhang’s Post

View profile for Zhe Zhang, graphic

Building the future of AI infra. Apache Software Foundation Member; Former Head of Open Source (Ray) + Head of Field Engineering @ Anyscale.

If you are building #RAG applications, the first step is to create embeddings from your data. Checkout how you can do this with #Ray and Anyscale at scale and with unprecedented efficiency (90% cost reduction 🤯 vs. #OpenAI embedding API). If you are interested in trying this, please fill out https://lnkd.in/gDnvqZvR This is achieved through Ray's *unified* support for all steps of the pipeline with full *flexibility* on hardware / ML model for each operation (e.g. a mix of CPU, A10, and A100). 🚀 https://lnkd.in/gFeZ5akX (bit.ly/rag-embedding) Great collaboration with Pinecone team (check out their major launch https://lnkd.in/guYY7KPF!). Nathan Cordeiro Ram Sriharsha Roy Miara ❤️

RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone

RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone

anyscale.com

Zhe Zhang

Building the future of AI infra. Apache Software Foundation Member; Former Head of Open Source (Ray) + Head of Field Engineering @ Anyscale.

7mo

If you have an embedding workload and are interested in trying this out, please fill https://forms.gle/gSQrj6XDAVQGkRvk8 Thanks!

Rohan Paul

Bridging the gap between AI research and practical applications. → Join my LLM Newsletter. AI Engineer and Entrepreneur (Ex Investment Banking)

7mo

Awesome - 10x cost reduction to generate the 1 billion embeddings using OpenAI is $60,000 vs $6000 with Anyscale

See more comments

To view or add a comment, sign in

Explore topics