Senior Site Reliability Engineer - Datacenter Automation
Nvidia · JR2011458
NVIDIA is hiring experienced SRE engineers to help scale up its AI Infrastructure. We expect you to have significant experience with site reliability principles and techniques including reliability assessments, incident management processes, production system observability, monitoring and alerting, automated deployments and toil elimination. We view SRE as a software engineering discipline and expect significant contributions to our codebase. We welcome out-of-the-box thinkers who can provide n…
Apply on original site