• Staff, Site Reliability Engineer

    Walmart (Bentonville, AR)
    …better. **What You will Do** + Drive the design and evolution of monitoring and observability frameworks that enable proactive detection, root cause analysis, ... of customers of Walmart and its subsidiaries? As a Principal Site Reliability Engineer in Customer Engagement Services (CES) Tech Org, you'll lead efforts to ensure… more
    Walmart (08/26/25)
    - Related Jobs
  • (USA) Principal, Software Engineer

    Walmart (Bentonville, AR)
    …**What you'll do ** **About the Role:** We are looking for a **Principal Software Engineer ** who is an **architect-level expert in Python** , with a deep mastery of ... extend across use cases and orgs. As a Principal Engineer , you set the vision, write the critical path...**from configuration schema (YAML)** to **execution trace logging** , ** observability ** , and **self-healing recovery patterns** . + Lead… more
    Walmart (07/26/25)
    - Related Jobs
  • Dynatrace Engineer

    Cognizant (Santa Fe, NM)
    We are seeking a **Dynatrace Engineer ** for the end-to-end implementation, configuration, and optimization of Dynatrace monitoring solutions with a strong focus ... SaaS environments. This role demands deep technical expertise in observability platforms, cloud-native architectures, and automation tooling. **About Cognizant's CIS… more
    Cognizant (10/08/25)
    - Related Jobs
  • Devsecops Engineer

    DEFTEC (Norfolk, VA)
    …and OpenStack. Proficient in implementing Site Reliability Engineering (SRE) and observability practices, including monitoring , logging, metrics, and distributed ... Cloud, VMware, OpenStack). + Implement site reliability engineering (SRE) and observability practices to ensure resilience, monitoring , logging, metrics, and… more
    DEFTEC (10/03/25)
    - Related Jobs
  • Site Reliability Engineer (SWE-I)

    Travelers Insurance Company (Hartford, CT)
    …**Target Openings** 1 **What Is the Opportunity?** Travelers is seeking a Software Engineer I to join our organization as we grow and transform our Technology ... including developing, analyzing, configuring, testing, debugging, troubleshooting, documenting, health monitoring /alerting, and implementing based on user or system design… more
    Travelers Insurance Company (09/02/25)
    - Related Jobs
  • Specialist Site Reliability Engineer

    Waystar (Lehi, UT)
    …data licensing services and manage error budgets. + Build automation for deployment, monitoring , and incident response. + ** Observability & Monitoring ** + ... our licensed data products. This role is ideal for an experienced engineer who thrives in data-intensive environments and is passionate about building reliable,… more
    Waystar (08/27/25)
    - Related Jobs
  • Principal/ Senior Principal DevOps Platform…

    Northrop Grumman (Linthicum Heights, MD)
    …and testing workflows across various environments + Deploy and maintain robust monitoring , alerting, and observability tools (eg Prometheus, Grafana, ELK) to ... radical new energy-efficient computing systems. MDA is seeking a **DevOps Platform Engineer ** with demonstrated ability to support and enhance development of new… more
    Northrop Grumman (08/08/25)
    - Related Jobs
  • Associate Director Staff Engineer

    OneMain Financial (Irving, TX)
    …Kubernetes, Docker, Openshift + Data & Format Handling: XML, JSON + Monitoring & Testing: OpenTelemetry, Prometheus, Grafana, Jenkins, GitHub Actions + Automation ... decisioning components for rule authoring, versioning, simulation, testing, and observability , delivered through intuitive UIs, APIs, and automation pipelines. +… more
    OneMain Financial (10/11/25)
    - Related Jobs
  • Senior Site Reliability Engineer - GCP…

    Wells Fargo (Chandler, AZ)
    **Overview** We are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native environments. ... of production systems across Windows, Linux, and GCP environments. + Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling… more
    Wells Fargo (10/11/25)
    - Related Jobs
  • SRE/MLOps Engineer

    SAIC (VA)
    **Description** We are seeking a versatile **SRE/MLOps Engineer with DevSecOps expertise** to design, automate, and operate secure, scalable, and repeatable **model ... mission teams to move from experimentation to production with confidence. The engineer will not only support **ML lifecycle operations** (Databricks, MLflow, AWS… more
    SAIC (10/02/25)
    - Related Jobs