- NVIDIA (Santa Clara, CA)
- …at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring , and remediation across all layers of infrastructure, IaaS, platforms ... define and drive the technical implementation for DGX Cloud offerings in the observability , monitoring , and remediation practice. + Collaborate on Cross Domain… more
- LinkedIn (Mountain View, CA)
- …and driving systemic improvements in availability and performancePrevious experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale ... for operational excellence and incident responseDefine and build frameworks to improve monitoring , alerting, and observability across hundreds of services and… more