WEKA for AWS

The fastest most scalable storage solution on AWS.

Win the AI Race with Optimized AWS Infrastructure

Accelerate Distributed Model Training

WEKA provides a high-performance data platform for SageMaker HyperPod optimized for every phase of FM model training across data loading, pre-processing, model training, checkpointing, verification, tuning, and data set archiving.

Increase AI Developer Productivity

WEKA software accelerates model data load times by 50%, accelerating model training times and improving data scientist and developer productivity.

Increase Cluster Resilience

WEKA reduces FM model checkpoint times by 90%, enabling faster training times and increasing the resilience of HyperPod deployments.

Increase GPU Cluster Utilization

High-performance storage from WEKA eliminates data bottlenecks driving up GPU infrastructure utilization above 90% (from an avg 30%) and ensuring your HyperPod infrastructure is never starved for training data.

Reduce epoch times from
Weeks to Hours

90%
Increase GPU Utilization to 90%
64%
Reduce manual DataOps by 64%
WEKA on AWS Solutions

Amazon SageMaker HyperPod

With WEKA support for Amazon SageMaker HyperPod, customers can build a high-performance data platform for distributed model training that scales massively, increases GPU infrastructure utilization, and reduces infrastructure costs.

Use AWS for more of your workloads

Accelerate AI and HPC applications on AWS

Deliver performance for your most demanding applications running in AWS with the world’s fastest cloud native data platform supporting high I/O, low latency, small files, and mixed workloads with zero tuning.

Seamlessly tier data in AWS

Intelligent tiering automatically moves data between high performance flash-based storage on Amazon EC2 instances to low cost, massively scalable object storage in Amazon S3, all in a single namespace for the best performance, scale, and economics.

Scale your data in AWS

Autoscaling enables you to add and remove high performance storage capacity on the fly to meet the needs of your most demanding applications without paying for resources you don’t use.

Move data to AWS

Send snapshots of a filesystem to any Amazon S3 object store for backup and disaster recovery. Full and incremental snapshots include metadata to enable seamless data portability between on-prem and AWS.

Burst data analysis to AWS

Maintain a usable copy of your on-premises data in your AWS environment, where you can use elastic compute resources to run calculations and analyses and gain new insights.

Build agile data pipelines in AWS

Your researchers, data scientists, creative teams, and more can collaborate faster by using a single copy of data optimized to meet the performance needs of every step in your workflow.

How it works

Story telling

WEKA is deployed on Amazon EC2 I3en instances with local NVMe storage to form a high-performance storage layer. The software extends the namespace to an Amazon S3 bucket for large scale capacity and optimal cost. The entire data set is available to the applications without the need to move or copy data. The namespace can extend from Terabytes to Petabytes. The same data can be accessed by multiple protocols – S3, NFS, SMB, and POSIX.

WEKA is in the AWS Builder Studio Melbourne

The AWS Builder Studio allows you to get hands-on with AWS technology while learning our unique methodologies to invent, honing your own culture of experimentation, and discovering the ‘Art of the Possible’ for your organization. Visit WEKA at the AWS Builder’s Studio Melbourne!

Testimonial Logo
Testimonial Logo
Testimonial Logo
Testimonial Logo
Testimonial Logo
Testimonial Logo
Testimonial Logo
“We can now reach 93% GPU utilization when running our AI model training environment using WEKA on AWS.”
Richard Vencu, MLOps Lead, Stability AI
“With WEKA, we can migrate our entire data set virtually with the push of a button. We continued doing model training across two AWS Regions simultaneously over two weeks while we migrated the compute.”
​​Alex Balan, Technical Lead, Synthesia
“WEKA is purpose-built for the cloud. It provides an extendable file system that offers the scale and economics of cloud object stores and the speed of SSD.”
​​Gavin Burris, Senior IT Leader
“​​I looked at a lot of storage systems in AWS and chose WEKA because of the ability to tier to S3 storage for best cost given the volume of data we are creating.​”
​​Samuel Reid, Head of Technology, Untold Studios​
“​​We needed to replicate our on-premises high performance computing environment in the cloud to be more scalable and more agile. WEKA and AWS enabled us to deliver a highly performant and compatible cloud environment which researchers could tell the difference in operations. By using the WEKA and AWS, we’re able to deliver breakthrough results for our customers faster.​”
​​Arnold De Leon, Operations Manager, 23andMe​
“Getting WEKA up and running was so easy, so fast. It just felt so slick. But I wasn’t really expecting what I saw once we ran performance metrics. I’ve never seen anything like it. Without WEKA, my life would be much harder, and my job as a studio technologist would be incredibly difficult.​”
​​Alan McSeveney, Head of Technology, Preymaker​
“We are using the WEKA shared file system on AWS instead of Lustre for its stability and stellar support for our geospatial workflows. Using WEKA, demanding analyses that used to take 2-3 months, are completed in less than 2 weeks.”
​​Alessandro Menegaz, Cloud IT Manager, Tre Altamira​