- The Walt Disney Company (Glendale, CA)
- …+ 10+ years of experience across Infrastructure, DevOps, Software Engineering, or Site Reliability Engineering in large-scale cloud environments. + Deep ... SRE, and application teams to deliver platform capabilities that improve developer experience, streamline operations, and support rapid, high-quality delivery of new… more
- JPMorgan Chase (Palo Alto, CA)
- …AIOps platforms. Our mission is to enhance scalability, security, and reliability for CDAO-hosted managed services. As a Machine Learning Engineer within ... deployment into production environments. + Optimize software applications for performance, reliability , and scalability. + Conduct code reviews and provide technical… more
- ServiceNow, Inc. (Pleasanton, CA)
- …for infrastructure changes with drift detection and remediation. **Observability & Site Reliability Engineering** + Architect comprehensive monitoring using ... Cloud Development Environment platforms including Coder for workspace provisioning. + Site Reliability Engineering: SLI/SLO design, error budgets, chaos… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …to make an impact on the world of technology. Seeking a technical developer for Cadence's Hardware Emulation Cloud to develop scalable and secure monitoring platform ... Key Responsibilities: + Implement monitoring framework to improve infrastructure reliability , observability, and alerts. + Identifying and implementing automation… more
- Palo Alto Networks (Santa Clara, CA)
- …Impact ** + Work with development teams to ensure that applications have scalability and reliability built-in from day one - agile is second nature to you and you're ... + Design, review and enhance software architecture to improve scalability, service reliability , cost, and performance - you've helped create services that are… more
- Amazon (Santa Monica, CA)
- …* Design and architect distributed systems at scale while ensuring reliability , performance, and cost efficiency across the organization * Lead organization-wide ... standards across multiple teams, establishing best practices for system observability, reliability , and incident management * Drive technological innovation at scale… more
- Palo Alto Networks (Santa Clara, CA)
- …Impact** + Work with development teams to ensure that applications have scalability and reliability built-in from day one- agile is second nature to you and you're ... + Design, review and enhance software architecture to improve scalability, service reliability , cost, and performance - you've helped create services that are… more
- Coinbase (Sacramento, CA)
- …and level-up engineers across the team, creating a multiplier effect on developer velocity and platform reliability . * Collaborate cross-functionally with ... best practices adopted by every Coinbase product. * Own design and reliability of business-critical Tier-0/Tier-1 backend systems used by millions of customers. *… more
- Amazon (San Bernardino, CA)
- …This position can be based anywhere in the US. Amazon's North America Reliability Maintenance & Engineering (RME) team needs a dynamic Regional Maintenance Manager ... and guide field teams in developing effective decision-making tools for site managers. You'll evaluate Fulfillment Center RME departments' performance and implement… more
- Amazon (San Francisco, CA)
- …security, compliance, and observability in containerized workloads. * DevOps & Reliability : * Drive GitOps-first workflows, CI/CD at scale, IaC (Terraform/CDK), and ... on technology adoption, evolution, and competitive advantage. Scaling & Developer Enablement * Build repeatable solutions and reference architectures addressing… more