- NVIDIA (WA)
- …building the next generation of GPU-accelerated Kubernetes runtime distributions. As a Software Engineer on the Runtime team, you will design and build automation ... systems that enable operators to seamlessly install, upgrade, and...automation-first, self-service tools that minimize manual effort while enhancing reliability and reproducibility. What you will be doing: +… more
- Microsoft Corporation (Redmond, WA)
- …driving operational excellence across the Microsoft Cloud to strengthen quality, reliability , security, and customer trust. As part of EngOps, you'll design ... of empowerment, inclusion, and growth mindset defines how we work. Azure Reliability is driving transformation to AI-powered operations by building scalable ML… more
- Oracle (Olympia, WA)
- …all areas of cloud service software engineering: high-scale distributed systems , virtualized infrastructure, identity, security, observability, and user experience. ... fast, still at an early stage, and working on ambitious new initiatives. An engineer at any level can have a significant technical and business impact here. You… more
- Microsoft Corporation (Redmond, WA)
- …optimize AI agent performance. We are seeking a passionate and skilled software engineer to join the Observability platform team. This team is responsible for ... analysis, model serving and model evaluation. + Design and develop scalable systems for benchmarking AI models, including pipelines for automated evaluation, metric… more
- Microsoft Corporation (Redmond, WA)
- …On this team, you'll play a key role in improving the performance and reliability of customizing some of the world's most advanced AI models. You'll gain hands-on ... the pace of development on the team. + Considers diagnosability, reliability , testability, and maintainability when reviewing code, and understands when code… more
- Oracle (Olympia, WA)
- …the Healthcare AI roadmap and deliver value to customers and partners. As ML Engineer , you will drive the scientific vision for Healthcare AI systems . ... production experience with LLM agents 3. Proven record improving agent reliability via reward modeling, policy learning, constitutional methods, tool-use strategies,… more
- Palo Alto Networks (Seattle, WA)
- …through runtime. We are looking for a highly skilled and experienced Senior Staff Engineer to join the Prisma AIRS Model Security team. In this role, you will ... part in designing, developing, and optimizing robust, scalable, and high-performance backend systems that power our cutting-edge AI security platform. **This role is… more
- Oracle (Olympia, WA)
- …services for running applied science models, with an emphasis on scalability, reliability , and security in Oracle Cloud Infrastructure (OCI). 3. Collaborate closely ... **(user-agent, agent-agent and multimodal)** using message queues and data streaming systems . 5. Use and extend containerization practices with Docker; deploy and… more
- Oracle (Olympia, WA)
- …storage access. If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this role offers the opportunity to ... comms) + Develop production-grade, high-performance software features with rigorous reliability , observability, and security. + Define performance goals and success… more
- Teradata (Olympia, WA)
- …AI/ML inference into customer-facing workloads, with strong emphasis on reliability , explainability, and governance. + Modernize the platform through microservices, ... leadership. + 15+ years of experience in enterprise architecture, distributed systems , or large-scale analytics platforms. + Demonstrated experience integrating AI… more