This solution enables organizations to harness the full potential of generative AI while overcoming challenges related to management, processing power, and scalability.
Artificial intelligence (AI) is poised to revolutionize all aspects of life, driving businesses to make significant investments in initiatives that deliver value. According to IDC’s December 2023 Future Enterprise Resiliency and Spending Survey (Wave 11), 66% of organizations worldwide are exploring the potential of GenAI. Advances in traditional IT systems, including the incorporation of graphics processing units (GPUs) have made it possible to effectively run predictive AI workloads. However, this acceleration is also exposing unforeseen challenges for AI adopters. The advent of generative AI (gen AI) has put an even greater strain on IT systems and software requirements.
Together, Red Hat and Cisco provide a blueprint for simplified delivery of predicative and generative AI/ML models. With Cisco’s full-stack architecture working in tandem with Red Hat’s open source containerization and automation solutions, organizations can simplify, optimize, and scale their operations for AI/ML and gen AI adoption.
This solution, which is based on Cisco Unified Computing System (UCS®) with Intel® Xeon® Scalable Processors and Cisco Nexus®, provides a compelling and scalable foundation for deploying generative AI at scale, while Red Hat® OpenShift® AI offers an AI-focused suite of tools covering the entire AI/ML experimentation and model lifecycle. The integration of Cisco Intersight and Red Hat OpenShift allows for automated Cisco UCS bare-metal configuration, provisioning, and installation, helping organizations simplify management and maintain servers anywhere.
The architecture scales with your Generative AI inferencing needs. Add or remove servers, adjust memory capacities, and configure resources in an automated manner as your models evolve and workloads grow using Cisco Intersight®.
The solution provides a platform build on Red Hat OpenShift to deliver a consistent, streamlined, and automated experience when handling the workload and performance demands of AI/ML projects in the data center and across the hybrid cloud.
Build on a secure, stable, enterprise-grade foundation, this solution provide a robust and trusted environment for the development and deployment of machine learning models at scale. The architecture integrates security at multiple layers of infrastructure and operations, from silicon and motherboard to OS and applications development platform level, to reduce risk, avoid attack and protect data. To protect AI applications and workloads, Red Hat OpenShift AI provides consistent security controls across hybrid cloud environments, ensuring that your organization maintain security posture and compliance standards. The platform also offers monitoring and observability tools that enable agencies to track the performance of AI models, detect anomalies and respond to security incidents promptly. By leveraging the security features of Red Hat OpenShift AI, you can enhance the protection of their AI models, data and infrastructure, mitigating security risks and ensuring compliance with regulatory requirements.
Collections are a distribution format for Ansible content that can include playbooks, roles, modules, and plugins. You can install and access the certified collections through the Red Hat Ansible Automation Hub in the Hybrid Cloud Console.
The Red Hat Ecosystem Catalog is the official source for discovering and learning more about the Red Hat Ecosystem of both Red Hat and certified third-party products and services.
We’re the world’s leading provider of enterprise open source solutions—including Linux, cloud, container, and Kubernetes. We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.