Neocloud

Meet Together AI

Powering open-source AI at unmatched speed and scale

AREAS OF FOCUS
  • AI/ML
  • GPU Cloud
  • Neocloud
  • GPU Acceleration
Region
  • Global
Customer Link

Together AI is advancing open-source artificial intelligence by providing high-performance infrastructure and tools for developers at scale. To support the rapid growth and development of its industry-leading inference engine, Together AI uses the WEKA® Data Platform to reduce latency and deliver ultra-fast data access and scalable performance across its massive GPU-powered cloud environment that serves a community of over 500,000 AI developers today.

“At Together AI, we are obsessed with speed and efficiency. That’s why we built the Together Inference Engine that provides the fastest inference speeds in the industry. We are excited to leverage WEKA’s Augmented Memory Grid capability to reduce the time involved in prompt caching and improve the flexibility of leveraging this cache across multiple nodes— reducing latency and benefitting the more than 500,000 AI developers building on Together AI.”

Ce Zhang, Chief Technology Officer at Together AI.