Nvidia DGX Station Manuale utente

DU-08255-001 _v4.6 | July 2020
DGX Station
User Guide

DGX Station DU-08255-001 _v4.6|ii
Table of Contents
About this Guide....................................................................................................................v
Chapter1.Introduction to the NVIDIA® DGX Station™....................................................... 1
1.1.What's in the Box......................................................................................................................2
1.2.DGX OS Desktop Software Summary...................................................................................... 2
1.3.DGX Station Hardware Summary.............................................................................................3
Chapter2.Setting Up the NVIDIA DGX Station................................................................... 4
2.1.Siting the DGX Station.............................................................................................................. 4
2.2.Removing or Replacing the Packing Inside the DGX Station..................................................5
2.3.Connecting and Powering on the DGX Station........................................................................7
2.4.Completing the Initial Ubuntu OS Configuration...................................................................12
2.5.Adding Support for Additional Languages to the DGX Station..............................................12
2.6.Registering Your DGX Station.................................................................................................13
2.7.Configuring the DGX Station To Use Multiple Displays........................................................ 13
2.8.Enabling Multiple Users to Access the DGX Station Remotely............................................ 15
2.9.Preparing the DGX Station for Use with Docker...................................................................16
2.9.1.Enabling Users To Run Docker Containers....................................................................16
2.9.2.Preventing IP Address Conflicts Between Docker and the DGX Station........................17
2.10.Managing CPU Mitigations................................................................................................... 18
2.10.1.Determining the CPU Mitigation State of the DGX System.......................................... 18
2.10.2.Disabling CPU Mitigations............................................................................................. 18
2.10.3.Re-enabling CPU Mitigations.........................................................................................19
Chapter3.Upgrading DGX OS Desktop Software on DGX Station.................................... 20
3.1.Upgrading Within the Same DGX OS Desktop Major Release.............................................. 20
3.1.1. Upgrading Within the Same DGX OS Desktop Major Release from the Software
Updater Application...............................................................................................................21
3.1.2.Upgrading Within the Same DGX OS Desktop Major Release from the Command
Line......................................................................................................................................... 22
3.2.Upgrading to a New DGX OS Desktop Major Release.......................................................... 22
3.3.Opting in to DGX OS Desktop Patch Updates........................................................................25
3.4.Available DGX Station Software Updates...............................................................................26
3.4.1.Updates to Docker and Software Exclusive to the DGX Station.....................................27
3.4.2.Updates to the Ubuntu Software on the DGX Station.....................................................28
3.5.Checking for Updates to DGX Station Software.................................................................... 29
3.6.Getting Release Information for DGX Station........................................................................29
3.7.Updating Software on an Air-Gapped DGX Station System.................................................. 30

DGX Station DU-08255-001 _v4.6|iii
3.7.1.Providing DGX Station Software Updates from a Private Repository.............................30
3.7.2.Loading a Container Image onto an Air-Gapped DGX Station System...........................31
Chapter4.Maintaining and Servicing the NVIDIA DGX Station.........................................33
4.1.Problem Resolution and Customer Care.............................................................................. 33
4.2.Cleaning the Mesh Filter Under the DGX Station................................................................. 33
4.3.Since DGX OS Desktop 4.4.0: Checking the Health of and Collecting Troubleshooting
Information for the DGX Station...............................................................................................35
4.4.DGX OS Desktop 4.3.0 and Earlier: Collecting Information for Troubleshooting the DGX
Station........................................................................................................................................ 35
4.5.DGX OS Desktop 4.3.0 and Earlier: Checking the Health of the DGX Station.......................36
4.6.Replacing the System and Components................................................................................37
4.6.1.Replacing the System...................................................................................................... 38
4.6.2.Repacking the DGX Station for Shipment.......................................................................38
4.6.3.Replacing a DIMM............................................................................................................ 41
4.6.4.Replacing the CMOS Power Cell in the DGX Station......................................................45
4.7.Maintaining the DGX Station Persistent Storage.................................................................. 49
4.7.1.Changing the RAID Level of the RAID Array...................................................................49
4.7.2.Checking the Status of the DGX Station RAID Array...................................................... 50
4.7.3.Checking the Status of the DGX Station SSDs............................................................... 51
4.7.4.Adding or Replacing an SSD........................................................................................... 52
4.7.5.Rebuilding the DGX Station RAID Array.......................................................................... 56
4.7.6.Configuring the SSDs for Data Storage as an NFS Cache.............................................57
4.7.7.Sanitizing the DGX Station Persistent Storage...............................................................59
4.7.7.1.Running an Ubuntu Desktop LiveCD Session on the DGX Station.......................... 60
4.7.7.2.Sanitizing All DGX Station SSDs............................................................................... 60
4.8.Restoring the DGX Station Software Image.......................................................................... 62
4.8.1.Obtaining the DGX Station Software ISO Image and Checksum File............................. 63
4.8.2.Creating a Bootable Installation Medium....................................................................... 63
4.8.2.1.Creating a Bootable USB Flash Drive by Using Startup Disk Creator.................... 64
4.8.2.2.Creating a Bootable USB Flash Drive by Using Akeo Rufus................................... 65
4.8.3.Verifying the Bootable Installation Medium....................................................................67
4.8.3.1.Verifying a Bootable USB Flash Drive...................................................................... 67
4.8.3.2.Verifying a Bootable DVD-ROM.................................................................................68
4.8.4.Installing the DGX Station Software Image from a USB Flash Drive or DVD-ROM......68
4.9.Updating the DGX Station System BIOS................................................................................ 69
4.10.Maintaining the GPU Liquid Cooling System.......................................................................70
4.10.1.Monitoring GPU Temperatures......................................................................................71
4.10.2.Checking the Level of the Liquid in the GPU Cooling System......................................72

DGX Station DU-08255-001 _v4.6|iv
4.10.3.Replenishing the Liquid in the GPU Cooling System................................................... 74
AppendixA.Safety.............................................................................................................. 78
A.1.Intended Application Uses......................................................................................................78
A.2.General Precautions...............................................................................................................79
A.3.Electrical Precautions............................................................................................................ 79
A.4.Communications Cable Precautions..................................................................................... 80
A.5.Other Hazards.........................................................................................................................81
AppendixB.Connections, Controls, and Indicators.......................................................... 82
B.1.Front-Panel Connections and Controls................................................................................ 82
B.2.Rear-Panel Connections and Controls................................................................................. 82
B.3.LAN Port Indicators................................................................................................................84
B.4.Audio I/O Connections............................................................................................................85
AppendixC.Compliance.....................................................................................................87
C.1.DGX Station Model Number...................................................................................................87
C.2.Argentina................................................................................................................................. 87
C.3.Australia/New Zealand...........................................................................................................87
C.4.Brazil....................................................................................................................................... 88
C.5.Canada.....................................................................................................................................88
C.6.China........................................................................................................................................89
C.7.European Union...................................................................................................................... 90
C.8.India......................................................................................................................................... 91
C.9.Israel........................................................................................................................................91
C.10.Japan..................................................................................................................................... 92
C.11.Russia.................................................................................................................................... 92
C.12.South Africa...........................................................................................................................92
C.13.South Korea.......................................................................................................................... 93
C.14.Taiwan....................................................................................................................................93
C.15.United States.........................................................................................................................94
C.16.United States/Canada...........................................................................................................95
C.17.Vietnam..................................................................................................................................95
AppendixD.DGX Station Hardware Specifications........................................................... 96
D.1.Environmental Conditions......................................................................................................96
D.2.Component Specifications......................................................................................................96
D.3.Mechanical Specifications......................................................................................................97
D.4.Power Specifications.............................................................................................................. 97
AppendixE.Customer Support for the NVIDIA DGX Station.............................................98

DGX Station DU-08255-001 _v4.6|v
About this Guide
DGX Station User Guide explains how to install, set up, and maintain the NVIDIA® DGX
Station™.
This guide is aimed at users and administrators who are familiar with the Ubuntu Desktop
Linux OS, including use of the command line and the sudo command.
Note: The instructions in this guide for software administration apply only to the DGX OS
Desktop. They don't apply if the DGX OS Desktop software that is supplied with the DGX Station
has been replaced with the DGX software for Red Hat Enterprise Linux or CentOS.
For additional information to help you use the DGX Station, see the following table.
Task Additional Information
Use the Ubuntu Desktop Linux OS ‣Ubuntu 18.04 Desktop Guide (https://
help.ubuntu.com/18.04/ubuntu-help/
index.html)
‣Ubuntu 16.04 Desktop Guide (https://
help.ubuntu.com/16.04/ubuntu-help/
index.html)
Find out about the DGX OS Desktop software for
the DGX Station
DGX OS Desktop Release Notes
Use the DGX Station to download and run
containers for deep learning frameworks
NGC Container Registry for DGX User Guide
Use deep learning frameworks optimized for
NVIDIA DGX systems
NVIDIA Deep Learning Frameworks
Documentation (https://docs.nvidia.com/
deeplearning/dgx/)
Use the tools and libraries in the DGX OS Desktop
for development of deep learning frameworks
NVIDIA Deep Learning SDK Documentation
(https://docs.nvidia.com/deeplearning/sdk/)

About this Guide
DGX Station DU-08255-001 _v4.6|vi

DGX Station DU-08255-001 _v4.6|1
Chapter1. Introduction to the NVIDIA®
DGX Station™
The NVIDIA DGX Station is a fast, multi-GPU workstation for deep learning and AI analytics.
You can use the DGX Station to run neural networks, and deploy deep learning models.
Because the DGX Station is software compatible with the NVIDIA DGX-1 server, you can also
use the DGX Station to optimize applications to run on a production DGX-1 cluster.

Introduction to the NVIDIA® DGX Station™
DGX Station DU-08255-001 _v4.6|2
1.1. What's in the Box
‣DGX Station
‣Accessory boxes containing:
‣Quick Start Guide
‣AC power cable
‣3 DisplayPort™ 1.2 to HDMI 2.0 adapters
‣USB recovery flash drive containing a backup copy of the operating system image and
CUDA toolkit
‣DVD-ROM containing source code of open-source software installed on the DGX
Station
‣Toxic Substance Notice and Safety Instructions
‣Declaration of Conformity
‣Repacking Instructions/Intra-Transit
Inspect each piece of equipment in the packing box. If anything is missing or damaged, contact
your supplier.
1.2. DGX OS Desktop Software Summary
The DGX OS Desktop software that is supplied with the DGX Station includes the software that
you need for downloading and running containers for deep learning frameworks. The software
is already installed on the DGX Station, except where licensing requirements mandate that the
software be supplied separately. Any software that must be supplied separately is installed
automatically when the DGX Station is first powered on.
For details about the DGX OS Desktop software, refer to DGX OS Desktop Release Notes.
Note:
You can replace the DGX OS Desktop software that is supplied with the DGX Station by
installing the DGX software for Red Hat Enterprise Linux or CentOS. For instructions, see:
‣DGX Software for Red Hat Enterprise Linux 7 - Installation Guide
‣DGX Software for CentOS - Installation Guide

Introduction to the NVIDIA® DGX Station™
DGX Station DU-08255-001 _v4.6|3
1.3. DGX Station Hardware Summary
Processors
Component Qty Description
CPU 1 Intel Xeon E5-2698 v4 2.2 GHz (20-Core)
GPU - current units 4 NVIDIA Tesla® V100-DGXS-32GB with 32 GB per GPU (128 GB total)
of GPU memory
GPU - earlier units 4 NVIDIA Tesla V100-DGXS-16GB with 16 GB per GPU (64 GB total) of
GPU memory
System Memory and Storage
Component Qty
Unit
Capacity
Total
Capacity Description
System memory 8 32 GB 256 GB ECC Registered RDIMM DDR4 SDRAM
Note: You can replace all eight factory-
installed 32-GB DIMMs with 64-GB
DIMMs to give a total capacity of 512 GB.
Data storage 3 1.92 TB 5.76 TB 2.5" 6 Gb/s SATA III SSD in RAID 0 configuration
Note: Since DGX OS Desktop 4.4.0 or DGX
software for Red Hat Enterprise Linux
or CentOS EL7-20.02: You can add four
1.92-TB SSDs for data storage to give
a total capacity of 13.44 TB in a RAID 0
configuration.
OS storage 1 1.92 TB 1.92 TB 2.5" 6 Gb/s SATA III SSD

DGX Station DU-08255-001 _v4.6|4
Chapter2. Setting Up the NVIDIA DGX
Station
Before using the DGX Station, ensure that its initial set-up is complete.
2.1. Siting the DGX Station
CAUTION:
The DGX Station weighs 88 lbs (40 kg). Do not attempt to lift the DGX Station. Instead, remove
the DGX Station from its packaging and move it into position by rolling it on its fitted casters.
To prevent damage to components inside the DGX Station, do not subject the DGX Station to
excessive vibration or mechanical shock. After moving or transporting the DGX Station, visually
inspect the NVLINK bridge, which connects the GPUs, and the drive trays in the drive cage
to see if they have shifted out of position. If any of these components has shifted, reseat the
component before operating the DGX Station.
Site the DGX Station in a location that is clean, dust-free, well ventilated, and near an
appropriately rated, grounded AC power outlet.
Leave approximately 5" (12.5 cm) of clearance behind and at the sides of the DGX Station to
allow sufficient airflow for cooling the unit.
When operating the DGX Station, keep the ambient temperature and relative humidity within
the following ranges:
‣Ambient temperature: 10°C to 30°C (50°F to 86°F)
‣Relative humidity: 10% to 80% (non-condensing)
Always keep the DGX Station upright. Do not lay the unit on its side.
Altri manuali per DGX Station
1
Indice
Altri manuali Nvidia Scrivania























