Key takeaways
- AWS AI Factories deliver dedicated infrastructure combining the latest NVIDIA accelerated computing platformTrainium chipsAWS AI servicesand AWS high-speedlow-latency networking.
- Customers can leverage their existing data center spacenetwork connectivityand power while AWS handles the complexity of deployment and management of the integrated infrastructure.
- AWS AI Factories help enterprises and public sector organizations meet their data sovereignty and regulatory requirementswith accelerated deployment timelines.
As governments and large organizations seek to scale AI projectssome are turning to the concept of an “AI factory” to address their unique sovereignty and compliance needs. But building a high-performance AI factory requires a comprehensive set of managementdatabasestorageand security services—complexity that few customers want to take on themselves. To address this needtoday we announced AWS AI Factoriesa new offering that provides enterprises and governments with dedicated AWS AI infrastructure deployed in their own data centers.
AWS AI Factories combine the latest AI acceleratorsincluding cutting-edge NVIDIA AI computing and Trainium chipsAWS high-speedlow-latency networkinghigh-performance storage and databasessecurityand energy-efficient infrastructuretogether with comprehensive AI services like Amazon Bedrock and SageMaker AI so customers can rapidly develop and deploy AI applications at scale.
Organizations in regulated industries and the public sector face a critical AI infrastructure challenge in getting their large-scale AI projects deployed. Building their own AI capabilities requires massive capital investments in GPUsdata centersand powerplus navigating complex procurement cyclesselecting the right AI model for their use caseand licensing models from different AI providers. This creates multi-year timelines and operational complexity that diverts focus from their core business goals.
AWS AI Factories address this challenge by deploying dedicated AWS AI infrastructure in customers’ own data centersoperated exclusively for them. AWS AI Factories operate like a private AWS Region that gives securelow-latency access to computestoragedatabaseand AI services. This approach lets you leverage existing data center space and power capacity you’ve already acquired and gives access to AWS AI infrastructure and services—from the latest AI chips for training and inference to tools for buildingtrainingand deploying AI models. It also provides managed services that offer access to leading foundation models without having to negotiate separate contracts with model providers—all while helping you meet securitydata sovereigntyand regulatory requirements for where data is processed and stored.
Leveraging nearly two decades of cloud leadership and unmatched experience in architecting large-scale AI systemswe are able to deploy securereliable AI infrastructure faster than most organizations can on their ownsaving years of buildout effort and managing operational complexity.
AWS and NVIDIA expand collaboration to accelerate customer AI infrastructure deployments
The relationship between AWS and NVIDIA goes back 15 yearsto when we launched the world’s first GPU cloud instanceand today we offer the widest range of GPU solutions for customers. Building on our longstanding collaboration to deliver advanced AI infrastructureAWS and NVIDIA make it possible for customers to build and run large language models fasterat scaleand more securely than anywhere else—now in your own data centers. With the NVIDIA-AWS AI Factories integrationAWS customers have seamless access to the NVIDIA accelerated computing platformfull-stack NVIDIA AI softwareand thousands of GPU-accelerated applications to deliver high performanceefficiencyand scalability for building next-generation AI solutions. We continue to bring the best of our technologies together. The AWS Nitro SystemElastic Fabric Adapter (EFA) petabit-scale networkingand Amazon EC2 UltraClusters support the latest NVIDIA Grace Blackwell and the next-generation NVIDIA Vera Rubin platforms. In the futureAWS will support NVIDIA NVLink Fusion high-speed chip interconnect technology in next-generation Trainium4 and Graviton chipsand in the Nitro System. This integration makes it possible for customers to accelerate time to market and achieve better performance.
“Large-scale AI requires a full-stack approach—from advanced GPUs and networking to software and services that optimize every layer of the data center. Together with AWSwe’re delivering all of this directly into customers’ environments,” said Ian Buckvice president and general manager of Hyperscale and HPC at NVIDIA. “By combining NVIDIA’s latest Grace Blackwell and Vera Rubin architectures with AWS’s securehigh-performance infrastructure and AI software stackAWS AI Factories allow organizations to stand up powerful AI capabilities in a fraction of the time and focus entirely on innovation instead of integration.”
Helping the public sector accelerate AI adoption
AWS AI Factories are built to meet AWS's rigorous security standards of providing governments with the confidence to run their most sensitive workloads across all classification levels: UnclassifiedSensitiveSecretand Top Secret. AWS AI Factories will also provide governments around the world with the availabilityreliabilitysecurityand control they need to help their own economies advance and take advantage of the benefits of AI technologies.
AWS and NVIDIA are collaborating on a strategic partnership with HUMAINthe global company based in Saudi Arabia building full-stack AI capabilitieswith AWS building a first-of-its-kind "AI Zone" in Saudi Arabia featuring up to 150,000 AI chipsincluding GB300 GPUsdedicated AWS AI infrastructureand AWS AI servicesall within a HUMAIN purpose-built data center. “The AI factory AWS is building in our new AI Zone represents the beginning of a multi-gigawatt journey for HUMAIN and AWS. From inceptionthis infrastructure has been engineered to serve both the accelerating local and global demand for AI compute,” said Tareq AminCEO of HUMAIN. “What truly sets this partnership apart is the scale of our ambition and the innovation in how we work together. We chose AWS because of their experience building infrastructure at scaleenterprise-grade reliabilitybreadth of AI capabilitiesand depth of commitment to the region. Through a shared commitment to global market expansionwe are creating an ecosystem that will shape the future of how AI ideas can be builtdeployedand scaled for the whole world.”
Get the latest news from AWS re:Inventincluding all things agentic and generative AIproduct and service announcementsand more.









