- Qualcomm (Santa Clara, CA)
- …deployment and management. System Monitoring and Troubleshooting: Monitor system performance , troubleshoot issues, and implement logging and alerting ... systems , hardware, architecture, test engineers, and other teams to design system -level software solutions and obtain information on performance requirements… more
- SproutsAI (Palo Alto, CA)
- …ensuring our platform can handle massive AI workloads with reliability and performance . As part of our cloud native engineering team, you`ll work on ... management Implement and optimize cloud native solutions for scalability, reliability , and performance Contribute to code reviews,...You'll Thrive Here If You 7+ years of software engineering experience with focus on distributed systems … more
- Resolve AI (San Francisco, CA)
- …practices for the team. Incorporate excellence in reliability , scalability, and performance into every layer of system design, particularly in distributed ... design. Build and manage critical platform assets, ensuring seamless integration, reliability , and performance across environments. Write clean, maintainable,… more
- Disneyland Hong Kong (Glendale, CA)
- …or ML platform DevOps roles. Knowledge of multi‑agent orchestration patterns and operational reliability for AI systems . Strong background in test automation and ... for Disney's industry‑leading ad technology and products - driving advertising performance , innovation, and value in Disney's sports, news, and entertainment… more
- The Walt Disney Company (Glendale, CA)
- …or ML platform DevOps roles. Knowledge of multi‑agent orchestration patterns and operational reliability for AI systems . Strong background in test automation and ... for Disney's industry-leading ad technology and products - driving advertising performance , innovation, and value in Disney's sports, news, and entertainment… more
- Klaviyo Inc. (San Francisco, CA)
- …Demonstrated success embedding AI into enterprise IT or SaaS environments, improving system reliability , scalability, and observability through AI and automation ... to identity and access frameworks and by implementing AIOps practices that increase reliability and performance . Build the foundation for an AI‑native workforce… more
- Carollo Engineers, Inc. (San Francisco, CA)
- …systems , ensuring efficient and reliable performance across all operational systems . Collaborate with engineering teams to develop system ... systems that seamlessly integrate with plant operations, driving improved efficiency, reliability , and performance . At Carollo, you'll make a meaningful… more
- KLA-Belgium (Milpitas, CA)
- …Engineer is a plus.* Experience with KLA products is a plus.* Experience with Reliability Engineering is a plus.* Software knowledge in Python, Visual Basic is ... of the PLC, and improve short- and long-term product service revenue performance * Drive continuous improvement of system hardware, diagnostics and software,… more
- NerdWallet, Inc (San Francisco, CA)
- …existing code, conduct thorough testing, and troubleshoot complex issues to improve system performance and reliability . Senior software engineers also ... Design & Architecture - Experience in designing scalable, distributed, and high‑ performance systems . Databases - Knowledge of SQL (PostgreSQL, MySQL) and… more
- Harness Inc (Mountain View, CA)
- …includes modules for CI, CD, Cloud Cost Management, Feature Flags, Service Reliability Management, Security Testing Orchestration, Chaos Engineering , Software ... environment Authors software functional specifications and design documents Quickly understand complex systems /code and own key pieces of the system , including… more