Groq’s Post

View organization page for Groq, graphic

68,427 followers

Tune in remotely at 8:00 PT on July 10 for a TPC seminar by Valentin Reis. Learn how Groq co-designed a compilation-based software stack and a class of accelerators called LPUs, with high utilization and low end-to-end system latency. He’ll review the challenges of breaking models apart over networks of LPUs, and outline how this hardware/software system architecture keeps enabling breakthrough LLM inference latency at all model sizes. https://lnkd.in/gdQBuS7U

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics