The NCP-AI Operations certification is an intermediate-level credential that validates a candidate’s ability to monitor, troubleshoot, and optimize AI infrastructure by NVIDIA. The exam is online and proctored remotely, includes 50 questions, and has a 90-minute time limit.
Topics Covered in the Exam
Topics covered in the exam include:
- Base Command Manager for configuration, management, and troubleshooting
- Slurm cluster administration
- Kubernetes cluster administration
- System management tools for troubleshooting and performance optimization
Candidate Audiences
- MLOps engineers
- DevOps engineers
- Solution architects
- System architects
- AI Infrastructure engineers
Prerequisites
Two to three years of operational experience working in a data center with NVIDIA hardware solutions. The candidate should be able to monitor and manage all the parts of a data center infrastructure in support of AI workloads.
Recommended training for this certification
- AI Infrastructure & Operations Fundamentals
- AI Operations Professional Workshop
Recertification
This certification is valid for two years from issuance. Recertification may be achieved by retaking the exam.