ThinkSystem SR675 V3

The Lenovo ThinkSystem SR675 V3 is a versatile GPU-rich 3U rack server that supports eight double-wide GPUs including the new NVIDIA H100 and L40S Tensor Core GPUs, or the NVIDIA HGX H100 4-GPU offering with NVLink and Lenovo Neptune hybrid liquid-to-air cooling. The server is based on the new AMD EPYC 9004 Series processors (formerly codenamed "Genoa", “Genoa-X” and “Bergamo”).

Overview

Key features

The SR675 V3 features a modular design for ultimate flexibility. Multiple configurations are supported, including:

  • One or two 4th Generation AMD EPYC™ Processors
  • Up to eight double-wide GPUs with NVLink bridges
  • NVIDIA HGX H100 4-GPU with NVLink and Lenovo Neptune hybrid liquid cooling
  • AMD Instinct™ MI Series Accelerators
  • Choice of front or rear high-speed networking
  • Choice of local high speed NVMe storage

There are three different base configurations of the SR675 V3 as shown in the following figure. The configurations determine the type and quantity of GPUs supported as well as the supported drive bays.

The SR675 V3 is built on up to two AMD EPYC 9004 Series processors and is designed to support the vast NVIDIA Hooper and Ampere datacenter portfolio and AMD Instinct™ MI Series Accelerators. The SR675 V3 delivers performance optimized for your workload, be it visualization, rendering or computationally intensive HPC and AI.

Scalability and performance

The SR675 V3 offers numerous features to boost performance, improve scalability and reduce costs:

  • Supports up to eight high-performance PCIe double-wide GPUs including the new NVIDIA H100 and L40 Tensor Core GPUs.
  • Supports the NVIDIA HGX H100 4-GPU complex with NVLink and Lenovo Neptune hybrid liquid cooling.
  • Supports NVIDIA NVLink, which offers a GPU-to-GPU direct connection of up to 900 GB/s bandwidth and supported in both SXM5 and double-wide PCIe GPU configurations. NVLink also allows for a larger combined memory footprint for bigger batch sizes or the processing of larger images.
  • Supports up to two fourth-generation AMD EPYC 9004 processors
  • Up to 128 cores and 256 threads
  • Core speed of up to 3.5 GHz
  • Nominal TDP rating of up to 360 W, configurable TDP up to 400W
  • Supports up to 24 DDR5 memory DIMMs operating at up to 4800 MHz to maximize the performance of the memory subsystem.
  • Using 128GB 3DS RDIMMs, the server supports up to 3TB of system memory
  • Supports GPU Direct RDMA I/O where high-speed network adapters are directly connected to the GPUs, to maximize I/O performance.
  • Supports GPU Direct Storage where NVMe drives are directly connected to the GPUs, to maximize storage performance.
  • A variety of slot configurations available, depending on the GPU and NVMe storage configuration selected:
  • Two front PCIe 5.0 x16 slots
  • Four rear PCIe 5.0 x16 slots (configuration dependent)
  • One rear OCP 3.0 slot, PCIe 4.0 x8 or x16
  • Supports a variety of internal storage configurations:
  • 8x 2.5-inch hot-swap SSDs, with SAS, SATA or NVMe interfaces
  • 6x EDSFF E1.S hot-swap NVMe SSDs
  • 4x EDSFF E3.S hot-swap NVMe SSDs
  • Supports NVMe drives without oversubscription of PCIe lanes (1:1 connectivity). The use of NVMe drives maximizes drive I/O performance, in terms of throughput, bandwidth, and latency.
  • Supports high-speed RAID controllers from Lenovo and Broadcom providing 12 Gb SAS connectivity to the drive backplanes. A variety of PCIe 3.0 and PCIe 4.0 RAID adapters are available.
  • Supports M.2 drives for convenient operating system boot functions. Available M.2 adapters support either one M.2 drive or two M.2 drives in a RAID 1 configuration for performance and reliability.
  • The server has an industry-standard OCP 3.0 small form factor (SFF) slot, with a PCIe 4.0 interface, up to x16, supporting a variety of Ethernet network adapters. A simple-swap mechanism with a thumbscrew and pull-tab enables tool-less installation and removal of the adapter. The adapter supports shared BMC network sideband connectivity to enable out-of-band systems management.
  • The server offers PCI Express 5.0 I/O expansion capabilities that doubles the theoretical maximum bandwidth of PCIe 4.0 (32GT/s in each direction for PCIe 5.0, compared to 16 GT/s with PCIe 4.0). A PCIe 5.0 x16 slot provides 63 GB/s bandwidth, enough to support a 200GbE network connection.
  • The server offers a choice of PCIe 5.0 x16 full-height slots, depending on the GPU and NVMe connections selected. Available slots are two front slots and four rear slots, plus the slot dedicated to the OCP adapter. The flexibility of configuration ensures you can get the PCIe resources you need for a high-performance HPC/AI implementation.

Availability and serviceability

The SR675 V3 provides many features to simplify serviceability and increase system uptime:

  • Designed to run 24 hours a day, 7 days a week.
  • The server offers hot-swap drives, supporting RAID redundancy for data protection and greater system uptime.
  • Available M.2 RAID Boot Adapters support RAID-1 which can enable two NVMe M.2 drives to be configured as a redundant pair.
  • The server has four hot-swap power supplies and five simple-swap redundant fans to provide availability for business-critical applications. N+N, N+0 (non-redundant), N+1 configurations available.
  • The Liquid Assisted Cooling Module on the configuration with SXM5 GPUs employs four redundant low-pressure pumps to circulate the liquid to cool the GPUs.
  • The light path diagnostics feature uses LEDs to lead the technician to failed (or failing) components, which simplifies servicing, speeds up problem resolution, and helps improve system availability.
  • Solid-state drives (SSDs) offer more reliability and performance than traditional mechanical HDDs for greater uptime.
  • Proactive Platform Alerts (including PFA and SMART alerts): Processors, voltage regulators, memory, internal storage (SAS/SATA SSDs, NVMe SSDs, M.2 storage, flash storage adapters), fans, power supplies, RAID controllers, server ambient and subcomponent temperatures. Alerts can be surfaced through the XClarity Controller to managers such as Lenovo XClarity Administrator, VMware vCenter, and Microsoft System Center. These proactive alerts let you take appropriate actions in advance of possible failure, thereby increasing server uptime and application availability.
  • The built-in XClarity Controller continuously monitors system parameters, triggers alerts, and performs recovery actions in case of failures to minimize downtime.
  • Built-in diagnostics in UEFI, using Lenovo XClarity Provisioning Manager, speed up troubleshooting tasks to reduce service time.
  • Lenovo XClarity Provisioning Manager supports diagnostics and can save service data to a USB key drive or remote CIFS share folder for troubleshooting and reduce service time.
  • Auto restart in the event of a momentary loss of AC power (based on power policy setting in the XClarity Controller service processor)
  • Offers a diagnostics port on the front of the server to allow you to attach an external diagnostics handset for enhanced systems management capabilities.
  • Support for the XClarity Administrator Mobile app running on a supported smartphone or tablet and connected to the server through the service-enabled USB port, enables additional local systems management functions.
  • Three-year or one-year customer-replaceable unit and onsite limited warranty (varies by geography), 9 x 5 next business day. Optional service upgrades are available.

Manageability and security

Systems management features simplify local and remote management of the SR675 V3:

  • Lenovo XClarity Controller 2 (XCC2) monitors server availability and performs remote management. XCC2 Platinum is standard, which enables remote KVM, the mounting of remote media files (ISO and IMG image files), boot capture, and power capping.
  • Lenovo XClarity Administrator offers comprehensive hardware management tools that help to increase uptime, reduce costs and improve productivity through advanced server management capabilities.
  • UEFI-based Lenovo XClarity Provisioning Manager, accessible from F1 during boot, provides system inventory information, graphical UEFI Setup, platform update function, RAID Setup wizard, operating system installation function, and diagnostic functions.
  • Support for Lenovo XClarity Energy Manager, which captures real-time power and temperature data from the server and provides automated controls to lower energy costs.
  • Lenovo HPC & AI Software Stack provides our HPC customers you with a fully tested and supported open-source software stack to enable your administrators and users with for the most effective and environmentally sustainable consumption of Lenovo supercomputing capabilities.
  • Our Confluent management system and Lenovo Intelligent Computing Orchestration (LiCO) web portal provides an interface designed to abstract the users from the complexity of HPC cluster orchestration and AI workloads management, making open-source HPC software consumable for every customer.
  • An integrated industry-standard Unified Extensible Firmware Interface (UEFI) enables improved setup, configuration, and updates, and simplifies error handling.
  • Support for industry standard management protocols, IPMI 2.0, SNMP 3.0, Redfish REST API, serial console via IPMI.
  • An integrated hardware Trusted Platform Module (TPM) supporting TPM 2.0 enables advanced cryptographic functionality, such as digital signatures and remote attestation.
  • Administrator and power-on passwords help protect from unauthorized access to the server.
  • Supports Secure Boot to ensure only a digitally signed operating system can be used. Supported with SSDs, as well as M.2 drives.
  • Industry-standard Advanced Encryption Standard (AES) NI support for faster, stronger encryption.
  • An included chassis intrusion switch provides an additional physical security feature.

Energy efficiency

The SR675 V3 offers the following energy-efficiency features to save energy, reduce operational costs, and increase energy availability:

  • Energy-efficient system board components help lower operational costs.
  • High-efficiency power supplies with 80 PLUS Titanium or Platinum certification.
  • Solid-state drives (SSDs) consume as much as 80% less power than traditional spinning 2.5-inch HDDs.
  • Optional Lenovo XClarity Energy Manager provides advanced data center power notification, analysis, and policy-based management to help achieve lower heat output and reduced cooling needs.


Next steps

Review the product specifications

Take a deeper dive into the specifications for this product
View product specs
Certifications

Certifications

Learn about Red Hat Certification

Red Hat certified hardware is proven to incorporate Red Hat's best practices and provides customers tested interoperability, known life cycle management, and trusted support.

CompareProductLevel
Red Hat OpenStack Services on OpenShift 18.0 - 18.x

Base OS: Red Hat Enterprise Linux 9.0

Architecture: x86_64

CertifiedView features
Red Hat OpenStack Platform 17.0 - 17.x

Base OS: Red Hat Enterprise Linux 9.0

Architecture: x86_64

CertifiedView features
Red Hat Enterprise Linux

Architecture: x86_64

CertifiedView features
Red Hat Enterprise Linux

Architecture: x86_64

CertifiedView features
Red Hat Enterprise Linux 8.6 - 8.x

Architecture: x86_64

CertifiedView features
Red Hat OpenShift Container Platform 4.13 - 4.x

Base OS: Red Hat Enterprise Linux 9.0

Architecture: x86_64

CertifiedView features
Red Hat OpenShift Container Platform 4.11 - 4.12

Base OS: Red Hat Enterprise Linux 8.6

Architecture: x86_64

CertifiedView features
Red Hat Virtualization

Base OS: Red Hat Enterprise Linux 8.6

Architecture: x86_64

CertifiedView features
Red Hat Enterprise Linux AI

Architecture: x86_64

Partner ValidatedView features
Red Hat logoLinkedInYouTubeFacebookTwitter

Platforms

Products & services

Try, buy, sell

Help

About Red Hat Ecosystem Catalog

The Red Hat Ecosystem Catalog is the official source for discovering and learning more about the Red Hat Ecosystem of both Red Hat and certified third-party products and services.

We’re the world’s leading provider of enterprise open source solutions—including Linux, cloud, container, and Kubernetes. We deliver hardened solutions that make it easier for enterprises to work across platforms and environments, from the core datacenter to the network edge.