- NVIDIA (Santa Clara, CA)
- …resiliency, or observability . + Hands-on experience as a Machine Learning Engineer (MLE) or deep familiarity with DL frameworks (eg, PyTorch, TensorFlow, JAX, ... this progress. We're looking for a Senior Full-Stack Software Engineer to join our DGX Cloud AI Infrastructure team...Develop APIs, backend services, and UIs to improve visibility, observability , and control over large-scale GPU clusters + Develop… more
- MongoDB (Palo Alto, CA)
- …next-generation, AI-powered applications. **About the Role** We're looking for a Staff Engineer to join our team building the inference platform for embedding models ... integrated into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design and implementation, while working with engineers… more
- Cisco (San Jose, CA)
- Senior Distributed Golang Software Engineer , Isovalent Tetragon Team (US) Apply (https://jobs.cisco.com/jobs/Login?projectId=1444334) + Location:Offsite, San Jose, ... open-source software and enterprise solutions solving networking, security, and observability needs for modern cloud native infrastructure. The flagship technology,… more
- Coinbase (Sacramento, CA)
- …wide system's reliability and less customer impact . As a *Senior Software Engineer * you will help to promote reliability culture across Coinbase. You would be ... a daily basis. *What you'll be doing (ie. job duties):* * Improve observability , reliability and availability by defining and measuring key metrics * Build… more
- The Walt Disney Company (Santa Monica, CA)
- …with other parts of The Walt Disney Company. **Job Summary:** The Data Reliability Engineer II will help us in the ongoing mission of delivering outstanding services ... members of our team to monitor and drive improvements for reliability and observability of critical data pipelines and deliverables. This is a high-impact role where… more
- The Walt Disney Company (Glendale, CA)
- …across all media platforms. **Job Summary:** We're looking for a Principal Platform Engineer , Infrastructure & Tooling for Services, Data, and GenAI to help shape ... (EKS) with integrated Istio service mesh for traffic management and observability . + Architect secure network configurations, including VPC design, IAM, peering,… more
- JPMorgan Chase (Palo Alto, CA)
- …perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise Technology, AI/ML & Data Platforms ... ensure operational efficiency. **Job responsibilities** + Architect and implement observability platforms and tools for proactive detection and continuous… more
- RELX INC (San Diego, CA)
- Are you a Site Reliability Engineer looking to support an AI-powered technology product that enhances digital identity onboarding and identity verification? About ... you will drive reliability, implement automation, and enhance telemetry and observability to ensure system performance. About the team; This team supports… more
- Cisco (CA)
- …a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers ... technical and domain expertise to introduce and operationalize Security and Observability use-cases and solutions. The Solutions Engineers (SEs) are Splunk's… more
- ADP (Pasadena, CA)
- **ADP is hiring a** **Lead Software Engineer ** **.** + _Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of ... for you. Ready to design what's next?_ We are seeking a Lead Software Engineer to lead the technology transformation of our Tax Compliance platform into a modern,… more