Cartesia

Cartesia

Software Development

Real-time multimodal intelligence, on a device near you.

About us

Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Try Sonic at https://play.cartesia.ai and join our Discord at https://discord.com/invite/gAbbHgdyQM.

Website
https://cartesia.ai
Industry
Software Development
Company size
11-50 employees
Type
Privately Held
Founded
2023

Employees at Cartesia

Updates

  • View organization page for Cartesia, graphic

    2,236 followers

    Today, we’re unveiling a significant milestone in our journey toward ubiquitous artificial intelligence: On-Device. Our team pioneered a radically more efficient architecture for AI with state space models (SSMs). Now, we’ve optimized and deployed them at the edge. We believe the future of AI runs on your device, where it can process continuously and is reliable, private and secure. Read our full blog post here: cartesia.ai/blog/on-device As part of this milestone, we’re announcing Edge, Rene, and Sonic On-Device. 📏 Edge: an open-source library for on-device applications across all environments, starting with Apple hardware. 🐍 Rene: an open-source 1.3B parameter language model with a hybrid Mamba-2 SSM backbone, tailored to on-device inference. Now on Edge. 📲 Sonic On-Device: The Sonic model you know and love, running on-device in private beta. It’s the first ultra-realistic voice model of its kind to support real-time streaming on devices. We’re excited to keep building for the next generation of hardware-optimized use cases, allowing any device to benefit from real-time, multimodal AI. If you're interested in partnering with us on any of our On-Device initiatives, please reach out via our form at bit.ly/cartesiaondevice.

    The On‑Device Intelligence Update

    The On‑Device Intelligence Update

    cartesia.ai

  • View organization page for Cartesia, graphic

    2,236 followers

    🗝 We’re excited to announce that Cartesia is now SOC 2 Type 2 Certified with no exceptions! 🗝 We are committed to safeguarding our users’ data and privacy, and this certification is another step towards ensuring that we provide secure, reliable and trustworthy services to all our customers. 🔐

    View organization page for Johanson Group LLP, graphic

    2,065 followers

    Major kudos to Cartesia for hitting the mark on SOC 2 compliance! 🎉 It's a testament to their unwavering commitment to security and transparency. With the SOC 2 report, they're showcasing their strong controls and processes, earning trust and respect. Bravo to the team! 🌟 #SOC2 #Compliance #Security #Trust

    • No alternative text description for this image
  • View organization page for Cartesia, graphic

    2,236 followers

    We're thrilled to welcome Ivory Tang our team, who will be focused on growth strategies and developer engagement. 🚀 Ivory brings a unique blend of AI expertise and business acumen to our organization. As the founder of ScaleConvo (YCW24), a customer support AI agent, she has hands-on experience with leading voice providers in the industry. Her impressive background includes roles in consulting at BCG and technical stints at Jane Street and HubSpot. Ivory graduated from MIT where she studied Mathematics, Computer Science, and Business. During her time there, she also conducted research at the Computer Science and Artificial Intelligence Laboratory (CSAIL). On joining our team, Ivory shared: "Having spent over a year in the voice AI agent space, I've tried every existing provider. Cartesia stood out as the only one that successfully blended a research-focused approach with top-tier voice quality and low latency." We're excited about the growth initiatives Ivory will be spearheading. Stay tuned for updates 👀

  • View organization page for Cartesia, graphic

    2,236 followers

    🎉 Congrats to the team at toby for earning a top spot on Product Hunt with their recent launch! 🎉 We’re excited to share how we’re supporting them on their mission to empower global workforces with live speech translation on every video call. Toby chose to launch on Sonic because of our: 🚀 Industry-leading latency: Sub-100ms across ALL languages 💬 Multilingual Support: Consistent quality and latency across major languages 👄 Localization Control:  Match voices to local accents Read more of their story here 👇

    toby partners with Cartesia to eliminate language barriers in the workforce

    toby partners with Cartesia to eliminate language barriers in the workforce

    cartesia.ai

  • View organization page for Cartesia, graphic

    2,236 followers

    Aviv Bick, one of Professor Albert Gu's students and a summer intern here at Cartesia, co-authored “Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models,” outlining a new way to distill quadratic Transformers into more efficient, subquadratic architectures like Mamba. By leveraging pretrained Transformers, they achieved state-of-the-art Mamba models using 1% of the compute! The key insight here was to project the (quadratic) attention matrices onto (structured) SSM matrix mixers before end-to-end training. Congrats to Aviv and his co-authors on this new publication! Blog ✍: https://lnkd.in/gwmRpDdG Paper 📜 : https://lnkd.in/gESrWSei

    • No alternative text description for this image
  • View organization page for Cartesia, graphic

    2,236 followers

    Congrats to the team Daily on launching Daily Bots 🎉 We're thankful to get to work with them as their default voice provider. Developers love Pipecat for its: 🏃♂️ ultra-low latency—perfect for real-time applications 💡 rich example library that enables the creation of AI agents in minutes 🚿 clean abstractions Plus, with the recent release of the open standard for Real-time Voice and Video Inference (RTVI-AI), Pipecat continues to set the pace in the industry. We're lucky to get to work with Kwindla Hultman Kramer and his team in pushing the boundaries of multimodal AI.

    View organization page for Daily, graphic

    2,917 followers

    Today we’re launching Daily Bots, the ultra low latency Open Source cloud for voice, vision, and video AI. Build voice and video AI bots, with any LLM, at conversational latencies as low as 500ms. Developers can: ✳️ build with Open Source SDKs ✳️ mix and match the best Generative AI models for specific use cases ✳️ run at scale on Daily’s real-time global infrastructure We've partnered with Anthropic Cartesia Deepgram Together AI for this launch. Our goal is to combine the best tools, best developer ergonomics, and best infrastructure for real-time AI into a single platform. Daily Bots is the culmination of the last 18 months of work we've done with customers and partners. The two fastest growing Open Source real-time AI projects came out of this work: Pipecat and RTVI. *️⃣ Daily Bots apps are built using the RTVI Open Source SDKs for the Web, iOS, and Android. Your Daily Bots code will run anywhere that supports the RTVI standard. (Or you can run your own infrastructure.) *️⃣ Your Daily Bots can also answer the phone. (You can buy a phone number from us with a single curl command.) *️⃣ Bring your own API Keys and use any inference provider that supports OpenAI-compatible APIs. Or run your own models and connect your bots to your infrastructure. For a highly non-serious take on the kinds of things we’ve been creating for ourselves, as we’ve worked on Daily Bots, check out a fun demo video 👇 and more links in thread. We'd love to hear what multi-modal, real-time AI directions are most interesting to you, and excited to support developers building! #conversationalAI #voicetovoice #ai #claude #llama #generativeai

  • View organization page for Cartesia, graphic

    2,236 followers

    We’re excited to welcome Brandon Wang to our technical staff 🎉 . Brandon recently graduated from MIT where he studied computer science, molecular biology, and math. While at MIT, Brandon was an active researcher in the Computer Science and Artificial Intelligence Library (CSAIL), and published a paper on Constant-Time Routability that was featured in the 2024 Symposium on Theory of Computing. Brandon also previously interned at Jane Street and was a 2019 International Math Olympiad Gold Medalist. “I heard about Cartesia through a college friend who loved his intern experience here. I’m excited to work on challenges like making AI more accessible with a really great group of people,” he said. You can read Brandon's most recent paper below 👇

    Lenzen’s Distributed Routing Generalized: A Full Characterization of Constant-Time Routability | Proceedings of the 56th Annual ACM Symposium on Theory of Computing

    Lenzen’s Distributed Routing Generalized: A Full Characterization of Constant-Time Routability | Proceedings of the 56th Annual ACM Symposium on Theory of Computing

    dl.acm.org

  • View organization page for Cartesia, graphic

    2,236 followers

    Excited to discuss all things real time AI with our friends Pilot.com!

    View organization page for Pilot.com, graphic

    14,373 followers

    📢 Calling all SF founders! Our Founders & Funders AI series continues this September with the one and only Karan Goel, CEO and Co-Founder at Cartesia! Join us to discuss the latest happenings in AI and connect with fellow founders over drinks and pizza 🥤🍕 Link to attend in the comments. See you there!

    • No alternative text description for this image
  • View organization page for Cartesia, graphic

    2,236 followers

    Super excited to partner with Tavus and power the fastest conversational video experience with Sonic. We believe the next generation of real-time, multimodal AI experiences are going to change everything. Try talking to Carter, it's responsive and super fast!

    View organization page for Tavus, graphic

    7,854 followers

    Today, we’re thrilled to introduce the world’s fastest Conversational Video Interface for developers. Build rich, real-time video experiences with digital twins that can speak, see, and hear. ⏱️ Less than one second of latency 🤖 Realistic, intelligent digital twins  🔌 Plug and play end-to-end building blocks 🏷️ Fully white-labeled tech 🧩 Modular, customizable components like LLM and TTS See the magic for yourself! Try talking to Carter on our live demo: www.tavus.io And if you like what you see, support us on Product Hunt https://lnkd.in/gieE-kww We’re excited to see the amazing products y’all build with our APIs. 

Similar pages