- Walmart (Sunnyvale, CA)
- …pipelines, observability ) for reliability, scale, developer velocity, and an AI /agent-ready future **What you'll do ** As the Distinguished Architect for ... Integrations), including reference architectures, guardrails, and roadmap for an AI /agent-ready platform. + **Modernize core membership services** (GraphQL, gateway,… more
- NVIDIA (Santa Clara, CA)
- …Design, and build resilient distributed systems that power NVIDIA's next-generation AI -driven enterprise products and services. + Drive automation and ... observability improvements, using metrics and analytics to enhance performance,...and efficiency. + Collaborate across Cloud, Platform, Security, and AI /ML teams to implement modern SRE components that ensure… more
- Rubrik (Sacramento, CA)
- …something that truly matters, protecting the world's data. As a Strategic Sales Engineer , you will provide technical direction and business guidance to the regional ... will be responsible for evangelizing, positioning, and architecting Rubrik's Data Resilience, Observability and Remediation tools to a targeted list of new &… more
- NVIDIA (Santa Clara, CA)
- We are now seeking a Senior Infrastructure and Build Systems Engineer for NVIDIA AI TensorRT-LLM team. This is a unique opportunity to take full ownership of the ... achieve our goals. If you're passionate about infrastructure, automation, observability , and compliance, we want you with us at...modularity of our build systems using CMake + Use AI to help build automated triaging workflows + Extensive… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a Staff Machine Learning Engineer , you will play a key role in designing, developing, and deploying machine learning ... engineering best practices, including API design, automated testing, CI/CD, and observability for ML-driven systems. + Contribute to innovation in MLOps,… more
- NVIDIA (Santa Clara, CA)
- …by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts ... as the brains of computers, generative AI , robots, and self-driving cars that can understand the...work. We are looking for an experienced full‑stack software engineer to design and optimize large‑scale cloud infrastructure-driving company‑wide… more
- NVIDIA (Santa Clara, CA)
- We are developing advanced multi-rack, multi-tenant AI /ML datacenters with NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our ... and debug the toughest Kubernetes + Slurm issues in multi-rack, multi-tenant AI datacenters. You'll tackle complex scheduling challenges across racks, tenants, and… more
- NVIDIA (Santa Clara, CA)
- …be the best we can be. We are looking for a Senior Network Validation Engineer to lead & hands on contribute to Network validation activities in the Datacenter ... and test coverage are optimal for Data Center scale AI products. The ideal candidate is self-motivated, works well...IPv6 & Telemetry at a Data Center scale with Observability tools like Grafana & Prometheus preferred NVIDIA is… more
- NVIDIA (Santa Clara, CA)
- …maintaining vital systems efficiently and reliably.. As a Senior Storage Product Engineer , you will take ownership of NVIDIA's Product Team's internal and ... detection and remediation of performance and reliability issues. + Optimize AI /ML and HPC workloads by crafting intelligent caching, low-latency storage invention,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA has become the platform upon which every new AI -powered application is built. From healthcare research applications to autonomous vehicles, or ... the center of this revolution. We are seeking a motivated Senior Systems Software Engineer to join our AV Infrastructure organization and become a key driver in… more