Big news!!! 🎊 Redpanda has acquired the popular stream processing and connectivity framework Benthos. That means with our new Redpanda Connect we're now the most comprehensive end-to-end streaming data platform! 🔗 Gone are the days of suffering through multiple projects that manually stitch data from your streams to your data warehouse or operational database. Read what our founder Alexander Gallego has to say about all that's new at Redpanda. 👇 https://lnkd.in/gpUfGYgs
Redpanda Data
Software Development
San Francisco, CA 13,608 followers
Redpanda is a simple, powerful, cost-efficient, and Kafka® API compatible platform that eliminates Kafka complexity.
About us
Redpanda is a simple, powerful, and cost-efficient streaming data platform written in C++. Fully compatible with Kafka® APIs. None of the Kafka complexity.
- Website
-
https://redpanda.com
External link for Redpanda Data
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Francisco, CA
- Type
- Privately Held
- Founded
- 2019
Locations
-
Primary
San Francisco, CA 94118, US
Employees at Redpanda Data
Updates
-
Redpanda Data reposted this
Hybrid search is a powerful search method that combines the precision of symbolic (keyword) search with the contextual understanding of neural(vector) search. Keyword search relies on traditional keyword matching and Boolean logic to deliver exact matches, while vector search utilizes deep learning models to grasp the semantic meaning and context of queries. By merging these two approaches, hybrid search enhances the accuracy and relevance of search results, offering users a more comprehensive and effective way to find information. For example, consider a user typing this search query “How to train a puppy to sit?” The keyword search looks for documents containing the exact words “train,” “puppy,” and “sit.” It uses operators like AND, OR, and NOT to refine the search results. The results are based on the presence of these keywords in the documents. It could yield results like: - “Training a Puppy to Sit: Step-by-Step Guide” - “Best Methods to Train Your Puppy” - “How to Teach a Puppy to Sit and Stay” The vector search interprets the query’s intent, understanding that “train a puppy to sit” means teaching a dog the sit command. It looks for documents that discuss related concepts, even if they don’t contain the exact keywords. The search leverages models trained on vast amounts of text to understand context and meaning. It could yield results like: - “Step-by-Step Puppy Training Techniques” - “Effective Ways to Teach Commands to Your Dog” - “Best Practices for Puppy Obedience Training” Ultimately, hybrid search merges keyword precision with contextual understanding and uses both keyword matching and semantic analysis to fetch the most relevant documents. It also ranks results based on both exact keyword presence and contextual relevance. This fusion leverages the strengths of both methods, resulting in an improved search experience that addresses the limitations of each individual approach. Special thanks go to Mark Needham for the idea and the feedback. #search #hybridsearch #vectorsearch #keywordsearch #fulltextsearch #sketchnote
-
🎓 JULY MASTERCLASS: Day 2 Operations with Redpanda Customers like Zafin describe our Day 2 ops as "dead simple." This interactive two-hour masterclass will show you exactly why 👀 Join our Solution Architects on: 🗓 Tuesday, July 30th 🕘 9 am PT | 12 pm ET | 5 pm BST Can't make it? Sign up anyway and we'll send you the recording after the session👇 https://lnkd.in/dEm-pw4G
-
Redpanda Data reposted this
📢 Announcing Our Latest Blog Post 📢 'Real-time Data Platform with Redpanda Data, Bytewax, Arrow, and ClickHouse.' Discover how we utilize Arrow to simplify and improve streaming workloads through efficient micro-batch compression. Inspired by Chris Comeau's innovative work, we've outlined a complete streaming architecture leveraging the power of Kafka and Bytewax. 🔍 Key Highlights: - Understanding Arrow and IPC serialization format - Bridging batch and stream data seamlessly - Exploring event-driven vs. analytical streaming patterns - Building an efficient, real-time data pipeline Check out the full blog here 🌟 https://lnkd.in/eU99WxDt
-
We love seeing all the ways the #streamingdata community uses Redpanda, and this tutorial by Parseable is a winner 🤩 Change Data Capture (CDC) is a technique to track changes in a #database and capture them in destination systems. #Debezium uses database logs as the source of truth and streams the changes to systems like #Kafka — or a simpler alternative like yours truly 🤘 Check out their tutorial to learn how to use Redpanda, #PostgreSQL, and #Debezium to seamlessly ingest CDC events into Parseable. https://lnkd.in/gzT27Afx
How to set up a CDC pipeline to capture and analyze real-time database changes with Parseable | Parseable
parseable.com
-
🚀 The Live Data Stack was a blast! We tapped into some of the brightest minds in the industry for personal insights into global topics. The result? A session brimming with inspiring stories, practical tips, and a tour of the unmistakable power of real-time #data 🔥 It's impossible to pick just one nugget of wisdom from the event, but we think Shubham Dhal from ShareChat perfectly summarized the takeaway. If you missed the event, don't worry, we recorded it for you! Watch here👇 https://lnkd.in/ggcyhe4a
-
Redpanda is all about doing more with less, so it makes sense to integrate with easy-to-use tech like Quix, which lets #developers use the entire #Python ecosystem to build stream processing pipelines with fewer lines of code. In this tutorial, Merlin Carter walks through three simple steps to build a #streamprocessing application using the Quix Streams Python library with Redpanda 🐾 https://lnkd.in/gEVsEuaT
Stream processing in Python with Redpanda and Quix
redpanda-data.medium.com
-
🎥 TUNE IN LIVE: Emergence of the Live Data Stack Join us for a live virtual event hosted by Joe Reis, co-author of The Fundamentals of Data Engineering, and Alex Gallego, founder and CEO of streaming data pioneer Redpanda.
Emergence of the Live Data Stack
www.linkedin.com
-
Redpanda Data reposted this
Author | Data Engineer and Architect | Recovering Data Scientist ™ | Global Keynote Speaker | Professor | Podcaster & Writer | Advisor & Investor
One hour to go until 🚀 Alexander Gallego and I host Redpanda Data’s Emergence of the Live Data Stack! We will discuss the next generation of data intensive applications and how to move from batch to real-time. You also get to hear from special guests from ShareChat and The Hotels Network. Data doesn’t sleep. Join today. Register here: https://lnkd.in/gZJQVPgN
Redpanda Webinar - Emergence of the Live Data Stack
go.redpanda.com
-
Redpanda Data reposted this
We are excited to announce that Shubham Dhal, one of our tech engineers, will speak at Redpanda’s events on the Emergence of Live Data Stack on July 17th at 9:30 PM IST / 9 AM PST. It is a perfect opportunity to learn how leading tech companies like ShareChat leverage streaming systems at scale to power their ML Pipelines and Infra. Also, learn how real-time processing replaces batch with SLAs moving from days/hours to seconds! Be part of the event: https://bit.ly/3Y6XEnH. Redpanda Data #Redpanda #ML #AI #Data #streaming