MosaicML Selects Oracle Cloud Infrastructure to Accelerate the Training of AI Models for Businesses

MosaicML, a software development provider that offers infrastructure and tools for building large-scale machine learning models, selected Oracle Cloud Infrastructure (OCI) as its preferred cloud infrastructure to help enterprises extract more value from their data.

With OCI’s high-performance AI infrastructure, MosaicML states that it has seen up to 50% faster performance and cost savings of up to 80% compared to other cloud providers.

“Hundreds of organizations rely on MosaicML’s platform to develop and train large, complex generative AI models. We provide the complex systems and hardware so our customers can focus on building and deploying their own high-performing custom models,” said Naveen Rao, CEO and co-founder, MosaicML. “We selected OCI as we believe it is the best foundation for MosaicML. When training models with massive troves of data in the cloud, every minute counts—and with OCI, we pay less than with other cloud providers and can scale almost linearly because of the way Oracle configured its interconnects.”

MosaicML helps organizations make training and inferencing of AI models more efficient and accessible with its model training capabilities.

To scale its business to support the growing demand for AI services, MosaicML selected OCI. With OCI, MosaicML has been able to gain access to the latest NVIDIA GPUs, a very high bandwidth interconnect between nodes, and large compute block sizes for scaling to thousands of GPUs. This has enabled MosaicML to help enterprises and startups operationalize AI models, including Twelve Labs.

OCI offers several capabilities for AI, including AI infrastructure. OCI Compute virtual machines and bare metal GPU instances can power applications for computer vision, natural language processing, recommendation systems, and more.

“We are seeing an influx of AI companies come to OCI to run generative AI models, because we can run them faster and more economically than other cloud providers. It is not uncommon to train a 10 billion-parameter model within a few hours on OCI versus a few days on other platforms,” said Greg Pavlik, senior vice president, Oracle. “OCI’s architecture and non-blocking, low latency network design is fundamentally different than anything on the market.”

For more information about this news, visit