Online Access Free NCP-AII Practice Test

Exam Code:NCP-AII
Exam Name:NVIDIA AI Infrastructure
Certification Provider:EMC
Free Question Number:125
Posted:Jun 03, 2026
Rating
100%

Question 1

A DGX H100 system shows intermittent "Link Down" errors on a 200G DAC cable. CVT reports "No Signal" despite physical connection. What is the first hardware check?

Question 2

During a multi-day NeMo burn-in, intermittent " GPU fell off bus " errors occur. Which diagnostic approach isolates hardware faults?

Question 3

A company has a registered NGC account and their server has NGC CLI installed. What step should be taken first to gain access to NGC?

Question 4

An infrastructure engineer runs an NCCL burn-in on an eight-node GPU cluster. Over a 12-hour period, all GPUs are tested with repeated all-reduce collectives. Monitoring tools show the following observations:
Aggregate bandwidth remains within 5% of documented reference for the hardware on every run.
No errors or timeouts are reported in NCCL logs.
On three occasions, one GPU logged single-run bandwidth dips of 15-20% compared to its normal performance, but performance recovered on the next run and stayed stable afterward. System logs show no hardware or driver errors.
Two minor NCCL WARN-level messages about "unexpected latency spike" appear in system logs for separate nodes, but could not be reproduced.
Which conclusion is the best strategy before releasing the cluster to production?

Question 5

A DGX server reports degraded performance and storage alerts. How would you use NVSM and nvidia-smi to troubleshoot both system and GPU issues?

Add Comments

Your email address will not be published. Required fields are marked *

insert code
Type the characters from the picture.