- Walmart (Bentonville, AR)
- …better. **What You will Do** + Drive the design and evolution of monitoring and observability frameworks that enable proactive detection, root cause analysis, ... of customers of Walmart and its subsidiaries? As a Principal Site Reliability Engineer in Customer Engagement Services (CES) Tech Org, you'll lead efforts to ensure… more
- Walmart (Bentonville, AR)
- …**What you'll do ** **About the Role:** We are looking for a **Principal Software Engineer ** who is an **architect-level expert in Python** , with a deep mastery of ... extend across use cases and orgs. As a Principal Engineer , you set the vision, write the critical path...**from configuration schema (YAML)** to **execution trace logging** , ** observability ** , and **self-healing recovery patterns** . + Lead… more
- Cognizant (Santa Fe, NM)
- We are seeking a **Dynatrace Engineer ** for the end-to-end implementation, configuration, and optimization of Dynatrace monitoring solutions with a strong focus ... SaaS environments. This role demands deep technical expertise in observability platforms, cloud-native architectures, and automation tooling. **About Cognizant's CIS… more
- DEFTEC (Norfolk, VA)
- …and OpenStack. Proficient in implementing Site Reliability Engineering (SRE) and observability practices, including monitoring , logging, metrics, and distributed ... Cloud, VMware, OpenStack). + Implement site reliability engineering (SRE) and observability practices to ensure resilience, monitoring , logging, metrics, and… more
- Travelers Insurance Company (Hartford, CT)
- …**Target Openings** 1 **What Is the Opportunity?** Travelers is seeking a Software Engineer I to join our organization as we grow and transform our Technology ... including developing, analyzing, configuring, testing, debugging, troubleshooting, documenting, health monitoring /alerting, and implementing based on user or system design… more
- Waystar (Lehi, UT)
- …data licensing services and manage error budgets. + Build automation for deployment, monitoring , and incident response. + ** Observability & Monitoring ** + ... our licensed data products. This role is ideal for an experienced engineer who thrives in data-intensive environments and is passionate about building reliable,… more
- Northrop Grumman (Linthicum Heights, MD)
- …and testing workflows across various environments + Deploy and maintain robust monitoring , alerting, and observability tools (eg Prometheus, Grafana, ELK) to ... radical new energy-efficient computing systems. MDA is seeking a **DevOps Platform Engineer ** with demonstrated ability to support and enhance development of new… more
- OneMain Financial (Irving, TX)
- …Kubernetes, Docker, Openshift + Data & Format Handling: XML, JSON + Monitoring & Testing: OpenTelemetry, Prometheus, Grafana, Jenkins, GitHub Actions + Automation ... decisioning components for rule authoring, versioning, simulation, testing, and observability , delivered through intuitive UIs, APIs, and automation pipelines. +… more
- Wells Fargo (Chandler, AZ)
- **Overview** We are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native environments. ... of production systems across Windows, Linux, and GCP environments. + Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling… more
- SAIC (VA)
- **Description** We are seeking a versatile **SRE/MLOps Engineer with DevSecOps expertise** to design, automate, and operate secure, scalable, and repeatable **model ... mission teams to move from experimentation to production with confidence. The engineer will not only support **ML lifecycle operations** (Databricks, MLflow, AWS… more