• Senior DevOps Service Reliability

    NVIDIA (Santa Clara, CA)
    …Support), you will partner with other key members of our organization including Site Reliability Engineering , Security Operations Center, DevOps teams, and ... Administration, SRE , or NOC). + BS in Computer Science, Engineering , Physics, Mathematics, or equivalent experience. + Expert-level knowledge of Linux system… more
    NVIDIA (11/15/25)
    - Related Jobs
  • Senior, Software Engineer

    Walmart (Sunnyvale, CA)
    …orders daily through our high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with L2, ... established criteria (for example, probability of failure, frequency of failure) to measure site reliability . Monitors site reliability conditions and… more
    Walmart (11/14/25)
    - Related Jobs
  • Staff Software Engineer - eCommerce

    General Motors (Mountain View, CA)
    …team on production support, perform root cause analysis, resolve incidents, solve problems. ( Site Reliability Knowledge - SRE preferred) + Architecture ... in CI/CD pipelines, observability frameworks (eg, Datadog), incident response and reliability engineering . + Leverage your technical leadership to ensure… more
    General Motors (11/19/25)
    - Related Jobs
  • Principal DevOps Engineer (Cortex- Prisma Cloud)

    Palo Alto Networks (Santa Clara, CA)
    …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + Troubleshooting - Ability to effectively ... monitoring tools and practices. **Your Impact** As a Principal SRE with the Prisma Cloud DevOps team, you will:...our SaaS product + Collaborate - Work with our Engineering teams to influence the operability of the product… more
    Palo Alto Networks (11/06/25)
    - Related Jobs
  • Sr. Platform Engineer (Hadoop Admin)

    Hyundai Autoever America (Fountain Valley, CA)
    Engineering , or a related field + 10+ years of experience in Platform Engineering , Site Reliability Engineering , or similar roles, with proven ... Hyundai AutoEver America is seeking a highly experienced Senior or Lead Platform Engineer/ Site Reliability Engineer ( SRE )/Hadoop Admin to manage and enhance… more
    Hyundai Autoever America (10/02/25)
    - Related Jobs
  • Product Manager II - Core Infrastructure

    Coinbase (Sacramento, CA)
    …large, dynamic workloads with high reliability . * Skilled collaborator with engineering , SRE , and security partners. * Familiarity with cloud infrastructure ... resilience, and efficiency of Coinbase's core infrastructure. You will work closely with engineering and SRE teams to evaluate cloud platforms, improve compute… more
    Coinbase (10/08/25)
    - Related Jobs
  • AVP, Technology Operations

    PennyMac (Westlake Village, CA)
    …field + 8+ years of progressive experience in technology operations, infrastructure management, or site reliability engineering , with at least 3-5 years in a ... of Technology Operations is a key leadership role responsible for overseeing the Site Reliability Operations (SRO) team that provides 24/7 monitoring and support… more
    PennyMac (10/23/25)
    - Related Jobs
  • Senior Solutions Architect

    Amazon (San Francisco, CA)
    …including ML/AI systems, cloud-native architectures (Kubernetes, microservices, serverless), DevOps/ SRE practices, and data engineering /strategy, with advanced ... security, compliance, and observability in containerized workloads. * DevOps & Reliability : * Drive GitOps-first workflows, CI/CD at scale, IaC (Terraform/CDK), and… more
    Amazon (09/23/25)
    - Related Jobs
  • Senior DevOps Program Manager

    Keeper Security, Inc. (El Dorado Hills, CA)
    …field (Master's preferred) + 8+ years of experience in DevOps, Cloud Infrastructure, or Site Reliability Engineering + 4+ years of technical program or ... ideal for a hands-on technical program leader who excels at bridging engineering execution with strategic planning. Keeper's cybersecurity software is trusted by… more
    Keeper Security, Inc. (11/22/25)
    - Related Jobs
  • Cisco Meraki - Product Manager - Cloud Platform…

    Cisco (San Francisco, CA)
    … principles such as SLIs/SLOs, incident management, toil reduction, and reliability engineering practices. + Familiarity with CI/CD pipelines, ... and execution behind scalable, reliable cloud infrastructure. You'll partner closely with engineering to build the platforms and tools that power our products… more
    Cisco (11/14/25)
    - Related Jobs