- Wells Fargo (Chandler, AZ)
- **Overview** We are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native environments. ... of production systems across Windows, Linux, and GCP environments. + Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling… more
- Walmart (Bentonville, AR)
- …, ** observability ** , and **self-healing recovery patterns** . + Lead cross-org architecture reviews, influence roadmap prioritization, and set coding and design ... **What you'll do ** **About the Role:** We are looking for a **Principal Software Engineer ** who is an **architect-level expert in Python** , with a deep mastery of… more
- CoStar Realty Information, Inc. (Atlanta, GA)
- …reliability, and resource efficiency. Reliability Engineering + Build and maintain robust monitoring , alerting, and observability systems to ensure 99.9%+ SLA ... CoStar Real Estate Manager - Site Reliability Engineer Job Description CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real… more
- IBM (Austin, TX)
- … Monitoring and Troubleshooting: 1-3 years of experience in monitoring / observability , issue response, and troubleshooting for optimal system performance. ... and CI/CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD. * Monitoring / Observability : knowledge or experience crafting alerts and dashboards using tools… more
- IBM (San Jose, CA)
- …for issue resolution. **Required technical and professional expertise** * System Monitoring and Troubleshooting: knowledge in monitoring / observability , issue ... and CI/CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD. * Monitoring / Observability : knowledge or experience crafting alerts and dashboards using tools… more
- Palo Alto Networks (Santa Clara, CA)
- …drives great outcomes. **Your Career** We are looking for a Principal MLOps Engineer to lead the design, development, and operation of production-grade machine ... model versioning, reproducibility, auditing, and compliance best practices + Drive observability & monitoring : Develop real-time monitoring , alerting,… more
- Microsoft Corporation (Redmond, WA)
- …team is leading the next generation of observability by bringing AI observability to the forefront of cloud monitoring . We integrate Grafana deeply into ... applications without the operational overhead of managing their own observability stack. In this role, you will work as...In this role, you will work as a software engineer building and operating the Azure Managed Grafana, and… more
- IBM (Lowell, MA)
- …of DevOps principles in a cloud environment and familiarity with cloud monitoring tools to implement robust observability practices that prioritize metrics, ... critical part of HCP with a mission to provide observability data to the customers. Observability data...quality development with an emphasis on Golang development * Lead and execute large-scale projects, ensuring the reliable delivery… more
- ServiceNow, Inc. (Kirkland, WA)
- …that balance performance, maintainability, and extensibility + **Contribute to observability and monitoring ** of database systems, implementing instrumentation ... Azure, GCP) and containerization technologies (Docker, Kubernetes) + **Experience with monitoring and observability tools** for database systems +… more
- Hyundai Autoever America (Fountain Valley, CA)
- Purpose: Hyundai AutoEver America is seeking a highly experienced Senior or Lead Platform Engineer /Site Reliability Engineer (SRE)/Hadoop Admin to manage and ... CI/CD and scripting in Python and bash. + Practical knowledge of monitoring and observability tools (eg, Prometheus, Grafana, OpenTelemetry) and understanding… more