- Deloitte (Costa Mesa, CA)
- Reliability Engineer - Manager Join our AI & Engineering team in transforming technology platforms, driving innovation, and helping make a significant impact on our ... Recruiting for this role ends on 3/15/2026. Work You'll Do As a Reliability Engineer - Manager, you will lead teams in ensuring the stability, performance, and… more
- Broadcom (CA)
- …Sign-In before you apply.** **Job Description:** **Job Description** **Senior Software Engineer , Tanzu Intelligent Assist** **About the Role** We are seeking a ... Staff Engineer to lead the design, development, and scaling of...system reliability and performance at scale. + Implement comprehensive observability through logging, monitoring, and metrics, anticipating potential failure… more
- Charles Schwab (San Francisco, CA)
- …GenAI initiatives that redefine client experiences. We're seeking a Senior AI Engineer who will design and deliver cutting-edge GenAI applications, enhancing the ... of AI at Schwab, with a special emphasis on site reliability, monitoring, observability , and operations. You'll ensure that the systems you build are robust,… more
- DoorDash (San Francisco, CA)
- …Logistics, Fraud, and Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation of ... back where it makes sense - to accelerate innovation. As Staff Software Engineer , you'll pair deep technical execution with influence on the roadmap, ensuring our… more
- NVIDIA (Santa Clara, CA)
- …resiliency, or observability . + Hands-on experience as a Machine Learning Engineer (MLE) or deep familiarity with DL frameworks (eg, PyTorch, TensorFlow, JAX, ... this progress. We're looking for a Senior Full-Stack Software Engineer to join our DGX Cloud AI Infrastructure team...Develop APIs, backend services, and UIs to improve visibility, observability , and control over large-scale GPU clusters + Develop… more
- Walmart (Sunnyvale, CA)
- …World's Largest Retail Network Walmart Global Tech is hiring a Principal Engineer to lead the design and development of our next-generation infrastructure ... World's Largest Retail Network** Walmart Global Tech is hiring a **Principal Engineer ** to lead the design and development of our next-generation **infrastructure… more
- MongoDB (Palo Alto, CA)
- We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, retrieval, ... integrated into Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with design and implementation, while… more
- Walmart (San Bruno, CA)
- **Position Summary ** **What you'll do ** **Principal UX Designer / Engineer , AI Systems - Design Organization** **What you'll do** As a Principal UX Designer / ... Engineer , AI Systems in the Design Organization, you will:...in Python and Node.js, focusing on performance, resilience, and observability . Harden prototypes into production-ready, secure, and scalable services… more
- Teradata (Sacramento, CA)
- …of reasoning, planning, and autonomous systems. + You are an excellent full stack engineer who codes daily and owns systems end-to-end. + Build intuitive UI with ... (LangChain, AutoGPT, ReAct, etc.), and orchestration tools. + Experience with AI observability tools and practices (eg, logging, monitoring, tracing, metrics for AI… more
- Oracle (Redwood City, CA)
- …streamlined process, increased productivity, and improved business decisions. Service Reliability Engineer will work on latest and greatest software stack to provide ... and reusable scripts/services to accelerate delivery and reduce errors. - Observability : define and implement monitoring, logging, alerting, and tracing strategies;… more