Page Inspect
Internal Links
123
External Links
51
Images
67
Headings
49
Page Content
Title:Designing & Building Data Center AI Infrastructure at Scale
Description:Penguin Solutions designs, builds, deploys, and manages large, complex Al and high-performance computing (HPC) infrastructures at scale.
HTML Size:775 KB
Markdown Size:23 KB
Fetched At:November 17, 2025
Page Structure
h1We Make AIPossible.Scalable.Powerful.Sustainable.Reliable.
h2Harness the Power ofAccelerated Computing
h325+
h389,000+
h33.3+ Billion
h2Customers TrustPenguin Solutions
h2Unmatched Expertise inIndustry-Specific Solutions
h2AI InfrastructureComprehensive Services
h2Precision Engineered forAccelerated Performance
h2OriginAI®
h2ICE ClusterWare™
h2Delivering NVIDIA DGX-Ready Managed Services
h2Stratus ztC Endurance™
h2Stratus ztC Edge™
h2Stratus everRun®
h2Introducing the New Family of CXL® Add-in-Cards (AICs)
h2Ultra-High Reliability Zefr ZDIMM Memory Modules
h2Next-Generation Data Center SSDs
h2Latest from Penguin Solutions
h3Penguin Solutions Releases ICE ClusterWare Management Software 13.0
h3Our CEO, Mark Adams, Recently Spoke with Scott McGrew on NBC News
h3CEO & President Mark Adams Joins the Micro Journeys Podcast
h3SK Telecom Launches Sovereign AI Infrastructure, Powered by NVIDIA
h3Five Critical Design Considerations for AI Infrastructure
h3Penguin Solutions Signs Agreement with CDW Expanding Customer Reach
h3Stratus ztC Endurance Named “HPC Solution of the Year”
h3Penguin Solutions' OriginAI Honored as a Winner in the 2025 AI Excellence Awards
h3Penguin Solutions Supports Pure Storage Introduction of FlashBlade//EXA™
h3Rebellions Partners on Strategic Collaboration Initiative
h3Penguin Solutions Expands Its AI Infrastructure Management Software
h3Mark Seamans Discusses Simplifying AI Complexity with Data Management
h3Penguin Solutions Signs AI Data Center Collaboration Agreement with SK Telecom and SK hynix
h3Penguin Solutions Named in Top Five Vendors to Watch in 2024 HPCwire Readers’ and Editors’ Choice Awards
h3OriginAI Infrastructure Now Available with Additional GPUs and Enhanced Cluster Management Capabilities
h3Penguin Solutions Accelerates Time to Value for AI Factories
h3Penguin Solutions Selected as the Managed Services Partner for Voltage Park’s NVIDIA Clusters
h3@HPCpodcast Industry View: Penguin Solutions on Getting AI Infrastructure Right
h3Sandia Partners With NextSilicon and Penguin Solutions to Deliver ‘First of its Kind’ Runtime Reconfigurable Accelerator Technology
h3AI Makes Mark on Engineering Education
h3Georgia Tech Unveils New AI Makerspace in Collaboration with NVIDIA
h3The Infrastructure Behind the Outputs: Cloud and HPC Unlock the Power of AI
h3Shell Deploys Cooling Immersion Pods in Texas Data Center
h3Air Force Research Lab Adds 12PFLOPS HPC System
h3Supercomputing Platform From Penguin Solutions Installed at DoD Site
h2Talk to Our Experts
h2Solving complexity. Accelerating results.
h3Get in touch
h3Partners
h3Company
Markdown Content
Designing & Building Data Center AI Infrastructure at Scale Skip to main content - 1-415-954-2800 - Support - English English 日本語 한국어 中文 Español 繁體中文 - What We Do Our Expertise AI & HPC Data Centers - AI Infrastructure - Cluster Management - Private Cloud - Power & Cooling Fault Tolerant Solutions - Edge - Data Centers Integrated Memory - Expansion & Pooling - Protective Coating - Specialty Memory Solutions Challenges Solved - Infrastructure Cost & ROI - Computational Power & Scalability - Energy Consumption & Sustainability - Unplanned Operational Downtime - Scaling the Memory Wall Industries Served - Oil & Gas - Financial Services - Life Sciences & Healthcare - Higher Education - Government - Weather - Manufacturing - Retail - Critical Infrastructure - Transportation Featured Post See All See All See All Tech Insights & Strategies Designing Resilient Edge Systems for the AI Era Read full article Read full article - Our Products Accelerated Computing Solutions - OriginAI® Infrastructure Software - ICE ClusterWare™ - ICE RemoteWare™ Hardware - Altus AMD EPYC™ Servers - Relion Intel® Xeon® Servers - GPU Accelerated Servers - CXL Memory Expansion Servers - Dell AI Optimized Hardware - NVIDIA DGX™ Systems Fault Tolerant Computing Computing Platforms - Stratus ztC Edge® - Stratus ztC Endurance® - Stratus ftServer™ - Stratus everRun® - Stratus V Series Integrated Memory Advanced Serial Memory - SMART Modular CXL®-Based Solutions DRAM Modules - SMART Modular Memory Modules - SMART Modular Zefr® ZDIMMs - SMART Modular Rugged / Industrial - SMART Modular Value Memory Flash Storage - SMART Modular Embedded SSDs - SMART Modular Data Center SSDs - SMART Modular RUGGED™ SSDs SMARTsemi® - SMARTsemi DRAMs - SMARTsemi Flash Storage Services Accelerated Computing Expertise - Design Services - Build Services - Deployment Services - Managed Services - Cluster Integrity Assessment - Support Services Fault Tolerant Computing Expertise - Managed Services - Professional Services - Educational Services Support Services Integrated Memory Expertise - SMART Memory Test Labs - Zefr Enterprise Memory - Our Company Who We Are With over two decades of experience as trusted advisors to our valued customers, Penguin Solutions is an end-to-end solutions provider helping solve complex challenges in computing, memory, and LED solutions. Our Culture - About Us - Leadership - Locations ESG Impact - Overview - Environment - People - Social - Governance - Supply Chain Investors - Overview - Financials - Stock Information - Events & Presentations - Governance - Resources - News Additional Brands - SMARTsemi® - Cree LED Featured See All See All See All June 17, 2025 Penguin Solutions Announces Second Generation Stratus ztC Endurance Fault Tolerant Computing Platforms Read full article Read full article April 7, 2025 Stratus ztC Endurance Named “HPC Solution of the Year” in the Data Breakthrough Awards Program Read full article Read full article - Our Partners - Solution Partners Meet our compute, storage, technology, and uptime automation partners for high-performance, high-availability enterprise solutions. - Channel Partners Our partner focused programs reward those partner organizations that see the mutual benefits of collaborating to drive success. - Alliance Partners Together, our solutions simplify, protect, and automate your digitally transforming business-critical operations. - System Integrators Gain the training, certification, accreditation, systems, and support to develop automation and control solutions. Quick Links Partner Program - Plan Overview - Become a Partner - Find a Partner Partner Portal - Partner Login Where to Buy - SMART Modular Distributors, OEM, & Channel Sales Partner News May 6, 2025 Penguin Solutions Signs Agreement with CDW Expanding Customer Reach for AI Infrastructure Offerings Read full article Read full article March 4, 2025 Rebellions Partners on Strategic Collaboration Initiative to Advance Global AI Data Center Ecosystem Read full article Read full article - Resource Hub - Blog Explore the extensive library of thought leadership articles about AI, HPC, Cloud, Edge, and IoT from Penguin Solutions. - Newsroom The latest press releases, featuring company news, innovations, and key announcements. - Events Engage with experts from Penguin Solutions and discuss your solution requirements and challenges. - AI Resource Hub A free Education Hub geared to inform and equip you with the latest AI factory and Generative AI trends. Resources - Analyst Reports - Brochures - Case Studies - Datasheets - eBooks - EDGEucation Hubs - Infographics - On-Demand Webinars - Solution Briefs - Videos - Whitepapers Featured Post See All See All See All Partnership Sovereign AI Clusters Powered by Penguin Solutions and SK Telecom Read full article Read full article - Careers - Careers Overview Whether you're a seasoned expert or just starting your career, this is where your journey toward meaningful innovation begins. - How We Hire Expect thoughtful conversations, timely feedback, and a candidate experience that puts people first, just like everything we do. - Life At Penguin From mentorship programs to continuous learning, we create an environment where everyone can thrive, grow, and make a difference. - Become A Penguin Explore open roles, connect with our talent community, and discover how your skills can help solve some of the world’s most complex challenges. Openings By Location - California - Massachussetts - North Carolina - Remote Stories That Shape Us Max Marinovich “Be ready to tackle challenges head-on and dive into the latest tech. You’ll grow fast and you’ll have a great time doing it.” Penguin Stories Penguin Stories - Contact - Search Search Search - What We Do Our Expertise AI & HPC Data Centers - AI Infrastructure - Cluster Management - Private Cloud - Power & Cooling Fault Tolerant Solutions - Edge - Data Centers Integrated Memory - Expansion & Pooling - Protective Coating - Specialty Memory Solutions Challenges Solved - Infrastructure Cost & ROI - Computational Power & Scalability - Energy Consumption & Sustainability - Unplanned Operational Downtime - Scaling the Memory Wall Industries Served - Oil & Gas - Financial Services - Life Sciences & Healthcare - Higher Education - Government - Weather - Manufacturing - Retail - Critical Infrastructure - Transportation Featured Post See All See All See All Tech Insights & Strategies Designing Resilient Edge Systems for the AI Era Read full article Read full article - Our Products Accelerated Computing Solutions - OriginAI® Infrastructure Software - ICE ClusterWare™ - ICE RemoteWare™ Hardware - Altus AMD EPYC™ Servers - Relion Intel® Xeon® Servers - GPU Accelerated Servers - CXL Memory Expansion Servers - Dell AI Optimized Hardware - NVIDIA DGX™ Systems Fault Tolerant Computing Computing Platforms - Stratus ztC Edge® - Stratus ztC Endurance® - Stratus ftServer™ - Stratus everRun® - Stratus V Series Integrated Memory Advanced Serial Memory - SMART Modular CXL®-Based Solutions DRAM Modules - SMART Modular Memory Modules - SMART Modular Zefr® ZDIMMs - SMART Modular Rugged / Industrial - SMART Modular Value Memory Flash Storage - SMART Modular Embedded SSDs - SMART Modular Data Center SSDs - SMART Modular RUGGED™ SSDs SMARTsemi® - SMARTsemi DRAMs - SMARTsemi Flash Storage Services Accelerated Computing Expertise - Design Services - Build Services - Deployment Services - Managed Services - Cluster Integrity Assessment - Support Services Fault Tolerant Computing Expertise - Managed Services - Professional Services - Educational Services Support Services Integrated Memory Expertise - SMART Memory Test Labs - Zefr Enterprise Memory - Our Company Who We Are With over two decades of experience as trusted advisors to our valued customers, Penguin Solutions is an end-to-end solutions provider helping solve complex challenges in computing, memory, and LED solutions. Our Culture - About Us - Leadership - Locations ESG Impact - Overview - Environment - People - Social - Governance - Supply Chain Investors - Overview - Financials - Stock Information - Events & Presentations - Governance - Resources - News Additional Brands - SMARTsemi® - Cree LED Featured See All See All See All June 17, 2025 Penguin Solutions Announces Second Generation Stratus ztC Endurance Fault Tolerant Computing Platforms Read full article Read full article April 7, 2025 Stratus ztC Endurance Named “HPC Solution of the Year” in the Data Breakthrough Awards Program Read full article Read full article - Our Partners - Solution Partners Meet our compute, storage, technology, and uptime automation partners for high-performance, high-availability enterprise solutions. - Channel Partners Our partner focused programs reward those partner organizations that see the mutual benefits of collaborating to drive success. - Alliance Partners Together, our solutions simplify, protect, and automate your digitally transforming business-critical operations. - System Integrators Gain the training, certification, accreditation, systems, and support to develop automation and control solutions. Quick Links Partner Program - Plan Overview - Become a Partner - Find a Partner Partner Portal - Partner Login Where to Buy - SMART Modular Distributors, OEM, & Channel Sales Partner News May 6, 2025 Penguin Solutions Signs Agreement with CDW Expanding Customer Reach for AI Infrastructure Offerings Read full article Read full article March 4, 2025 Rebellions Partners on Strategic Collaboration Initiative to Advance Global AI Data Center Ecosystem Read full article Read full article - Resource Hub - Blog Explore the extensive library of thought leadership articles about AI, HPC, Cloud, Edge, and IoT from Penguin Solutions. - Newsroom The latest press releases, featuring company news, innovations, and key announcements. - Events Engage with experts from Penguin Solutions and discuss your solution requirements and challenges. - AI Resource Hub A free Education Hub geared to inform and equip you with the latest AI factory and Generative AI trends. Resources - Analyst Reports - Brochures - Case Studies - Datasheets - eBooks - EDGEucation Hubs - Infographics - On-Demand Webinars - Solution Briefs - Videos - Whitepapers Featured Post See All See All See All Partnership Sovereign AI Clusters Powered by Penguin Solutions and SK Telecom Read full article Read full article - Careers - Careers Overview Whether you're a seasoned expert or just starting your career, this is where your journey toward meaningful innovation begins. - How We Hire Expect thoughtful conversations, timely feedback, and a candidate experience that puts people first, just like everything we do. - Life At Penguin From mentorship programs to continuous learning, we create an environment where everyone can thrive, grow, and make a difference. - Become A Penguin Explore open roles, connect with our talent community, and discover how your skills can help solve some of the world’s most complex challenges. Openings By Location - California - Massachussetts - North Carolina - Remote Stories That Shape Us Max Marinovich “Be ready to tackle challenges head-on and dive into the latest tech. You’ll grow fast and you’ll have a great time doing it.” Penguin Stories Penguin Stories - Contact - Search Search Search # We Make AI Possible. Scalable. Powerful. Sustainable. Reliable. Explore How We Solve These Challenges: Infrastructure Cost & ROI Infrastructure Cost & ROI Infrastructure Cost & ROI Computational Power & Scalability Computational Power & Scalability Computational Power & Scalability Energy Consumption & Sustainability Energy Consumption & Sustainability Energy Consumption & Sustainability Scaling the Memory Wall Scaling the Memory Wall Scaling the Memory Wall Unplanned Operational Downtime Unplanned Operational Downtime Unplanned Operational Downtime Want To Know How We’d Solve Your Challenge? Talk to Our Experts Talk to Our Experts Talk to Our Experts ## Harness the Power of Accelerated Computing At Penguin Solutions, we understand the boundless potential of technology. We help our customers turn cutting-edge ideas into outcomes—faster and at any scale. ### 25+ Years Experience ### 89,000+ GPUs Deployed & Managed ### 3.3+ Billion Hours of GPU Runtime Customer Stories ## Customers Trust Penguin Solutions - Voltage Park relies on Penguin Solutions to get maximum GPU performance and cluster availability from their large-scale AI infrastructure to meet their compute-hungry customers’ demands. Read full story Read full story Read full story - Shell powers its sustainable high-performance data centers with Penguin’s high-performance computing (HPC) solutions, including immersion cooling. Read full story Read full story Read full story - Penguin Solutions designed, built, and deployed the infrastructure to support the Georgia Tech AI Makerspace. Read full story Read full story Read full story - Penguin Solutions deploys NextSilicon accelerator technology as part of the Vanguard program at Sandia National Labs. Read full story Read full story Read full story Industry Expertise ## Unmatched Expertise in Industry-Specific Solutions Oil & Gas Oil & Gas Oil & Gas Financial Services Financial Services Financial Services Life Sciences & Healthcare Life Sciences & Healthcare Life Sciences & Healthcare Higher Education Higher Education Higher Education Government Government Government Weather Weather Weather Manufacturing Manufacturing Manufacturing Retail Retail Retail Critical Infrastructure Critical Infrastructure Critical Infrastructure Transportation Transportation Transportation Our Process ## AI Infrastructure Comprehensive Services Penguin Solutions is dedicated to our customers’ success. With 25 years of HPC experience designing, building, deploying, and managing AI and accelerated computing clusters, we have enabled some of the world’s most sophisticated workloads. - Design Accelerate time to value by basing system architectures on a proven set of designs that have been validated at scale in numerous production deployments. Design Services Design Services Design Services - Build Achieve high rates of system stability with our in-factory experts who integrate and validate all components of the compute cluster including rack integration, network configuration, and burn-in testing. Build Services Build Services Build Services - Deploy Drive on-site installations with coordination of data center staff, data storage partners, and infrastructure cooling providers—and utilize ICE ClusterWare software to validate production readiness. Deploy Services Deploy Services Deploy Services - Manage Assure production readiness and change management by working with a certified NVIDIA DGX Managed Services provider, the offers a full set of end-to-end services. Manage Services Manage Services Manage Services > “After a thorough RFP process, it was clear early on that Penguin was the right partner for us. Not only do they have the technical expertise and decades of experience, but they’re able to move very fast.” Ozan Kaya | CEO > “It takes a village to do AI well, it takes an infrastructure, it takes a data center, and it takes experts. And, I think in that regard, having Georgia Tech, NVIDIA, and Penguin—that’s what it takes.” Matthieu Bloch | Associate Dean of Academic Affairs Our Products ## Precision Engineered for Accelerated Performance AI & HPC Fault Tolerance Memory OriginAI ICE ClusterWare AI Managed Services ## OriginAI® OriginAI® is an AI factory infrastructure solution built on proven, pre-defined AI architectures that can scale from hundreds to over 16,000 GPU clusters. OriginAI integrates these validated technologies with Penguin’s intelligent, intuitive cluster management software and expert services for designing, building, deploying, and managing AI infrastructure at scale. Discover OriginAI Discover OriginAI Discover OriginAI ## ICE ClusterWare™ Simplify the deployment and management of AI clusters to realize greater productivity at speed. With ICE ClusterWare™, bare-metal hardware, network, and software resources are transformed into high-performance cluster environments, reducing administration complexity and optimizing resource availability. Discover ICE ClusterWare Discover ICE ClusterWare Discover ICE ClusterWare ## Delivering NVIDIA DGX-Ready Managed Services Penguin Solutions has designed and deployed large NVIDIA DGX clusters with high-speed NVIDIA InfiniBand networking and optimized storage. We have deep expertise and relationships with most storage vendors which allows us to provide bespoke solutions for every customer. Explore AI Managed Services Explore AI Managed Services Explore AI Managed Services Stratus ztC Endurance Stratus ztC Edge Stratus everRun ## Stratus ztC Endurance™ Stratus ztC Endurance™ is an innovative family of computing platforms that enables intelligent, predictive fault tolerance and 99.99999% compute platform availability. The platform combines built-in fault tolerance, proactive health monitoring, and serviceability by OT or IT, all while meeting your cybersecurity requirements. Discover Stratus ztC Endurance Discover Stratus ztC Endurance Discover Stratus ztC Endurance ## Stratus ztC Edge™ Stratus ztC Edge™ is a secure, rugged, highly automated computing platform that improves productivity, increases operational efficiency, and reduces downtime risk at the edge of corporate networks. Its self-protecting and self-monitoring features drastically reduce unplanned downtime and ensure continuous availability of business-critical applications. Discover Stratus ztC Edge Discover Stratus ztC Edge Discover Stratus ztC Edge ## Stratus everRun® Stratus everRun® is a software solution that pairs two servers via virtualization to create protected and replicated virtual machines (VMs) within a single operating environment, ensuring your applications run without interruption or data loss. Stratus everRun accelerates time to revenue by transforming your applications into continuously available solutions with customized availability. Discover Stratus everRun Discover Stratus everRun Discover Stratus everRun CXL® Memory Solutions Zefr ZDIMMs Data Center SSDs ## Introducing the New Family of CXL® Add-in-Cards (AICs) Compute Express Link (CXL) enables data centers, cloud services, and HPC providers to expand memory for intensive computing easily and cost-effectively. Discover CXL AIC Discover CXL AIC Discover CXL AIC ## Ultra-High Reliability Zefr ZDIMM Memory Modules Ideal for data centers, hyperscalers, and HPC platforms running large memory applications that require maximum compute availability. Discover Zefr ZDIMM Discover Zefr ZDIMM Discover Zefr ZDIMM ## Next-Generation Data Center SSDs Designed to meet the stringent demands placed on storage systems in hyperscaler, hyper-converged, enterprise, and edge data centers. Discover Data Center SSDs Discover Data Center SSDs Discover Data Center SSDs News Corner ## Latest from Penguin Solutions News November 17, 2025 ### Penguin Solutions Releases ICE ClusterWare Management Software 13.0 Read More Read More Media October 9, 2025 ### Our CEO, Mark Adams, Recently Spoke with Scott McGrew on NBC News Read More Read More Media September 3, 2025 ### CEO & President Mark Adams Joins the Micro Journeys Podcast Read More Read More News August 5, 2025 ### SK Telecom Launches Sovereign AI Infrastructure, Powered by NVIDIA Read More Read More Media June 13, 2025 ### Five Critical Design Considerations for AI Infrastructure Read More Read More News May 6, 2025 ### Penguin Solutions Signs Agreement with CDW Expanding Customer Reach Read More Read More News April 7, 2025 ### Stratus ztC Endurance Named “HPC Solution of the Year” Read More Read More News March 25, 2025 ### Penguin Solutions' OriginAI Honored as a Winner in the 2025 AI Excellence Awards Read More Read More News March 11, 2025 ### Penguin Solutions Supports Pure Storage Introduction of FlashBlade//EXA™ Read More Read More News March 4, 2025 ### Rebellions Partners on Strategic Collaboration Initiative Read More Read More News March 4, 2025 ### Penguin Solutions Expands Its AI Infrastructure Management Software Read More Read More Media January 17, 2025 ### Mark Seamans Discusses Simplifying AI Complexity with Data Management Read More Read More News January 9, 2025 ### Penguin Solutions Signs AI Data Center Collaboration Agreement with SK Telecom and SK hynix Read More Read More Blog November 20, 2024 ### Penguin Solutions Named in Top Five Vendors to Watch in 2024 HPCwire Readers’ and Editors’ Choice Awards Read More Read More News November 19, 2024 ### OriginAI Infrastructure Now Available with Additional GPUs and Enhanced Cluster Management Capabilities Read More Read More News November 18, 2024 ### Penguin Solutions Accelerates Time to Value for AI Factories Read More Read More News July 11, 2024 ### Penguin Solutions Selected as the Managed Services Partner for Voltage Park’s NVIDIA Clusters Read More Read More Media July 9, 2024 ### @HPCpodcast Industry View: Penguin Solutions on Getting AI Infrastructure Right Read More Read More Media May 8, 2024 ### Sandia Partners With NextSilicon and Penguin Solutions to Deliver ‘First of its Kind’ Runtime Reconfigurable Accelerator Technology Read More Read More Media April 15, 2024 ### AI Makes Mark on Engineering Education Read More Read More Media April 10, 2024 ### Georgia Tech Unveils New AI Makerspace in Collaboration with NVIDIA Read More Read More Blog February 19, 2024 ### The Infrastructure Behind the Outputs: Cloud and HPC Unlock the Power of AI Read More Read More Media January 22, 2024 ### Shell Deploys Cooling Immersion Pods in Texas Data Center Read More Read More Media September 20, 2023 ### Air Force Research Lab Adds 12PFLOPS HPC System Read More Read More Media April 27, 2023 ### Supercomputing Platform From Penguin Solutions Installed at DoD Site Read More Read More Request a Callback ## Talk to Our Experts Whether you’re struggling with AI solution design, build, deployment, or management—in your data center or in the cloud—Penguin Solutions can help. Partner with Penguin Solutions and get on track to your improve AI advantage. Let's Talk Let's Talk Let's Talk ## Solving complexity. Accelerating results. Penguin Solutions accelerates digital transformation with the power of emerging technologies in HPC, AI, and IoT with solutions and services that span the continuum of edge, core, and cloud. ### Get in touch 1-415-954-2800 45800 Northport Loop W. Fremont, CA 94538 Contact Us Investor Relations Public Relations Media Kit Ethics, Compliance and Helpline ### Partners Program Overview Alliance Partners Channel Partners System Integrators Partner Directory Partner Login ### Company About Us Investors Newsroom Careers Support Third Party Onboarding Terms and Conditions Registered Trademarks Privacy Policy © YYYY Penguin Solutions. All rights reserved. | Do Not Sell or Share Information | Cookie Preferences