- Walmart (Sunnyvale, CA)
- …across e-commerce, supply chain, and in-store systems. + **Build intelligent observability and monitoring systems** using ML-driven anomaly detection, predictive ... automated deployment, validation, and rollback mechanisms for SRE tools and monitoring systems with built-in observability and performance monitoring… more
- United Airlines (Chicago, IL)
- …alignment across the enterprise + Design and implement comprehensive end-to-end observability solutions that integrate with key monitoring tools across ... background in managing complex enterprise applications and infrastructure + Experience of observability and monitoring of enterprise-wide solutions. + Cloud and… more
- Charles Schwab (San Francisco, CA)
- …of DevOps and SRE practices into the development lifecycle + Champion reliability, monitoring , observability , and operational best practices for AI systems and ... redefine how we serve our clients. As a Senior Engineer on AI.x, you will play a key role...Objectives, error budgets and incident response runbooks. + Implement observability frameworks for real-time monitoring of AI… more
- J&J Family of Companies (Raritan, NJ)
- …and implement scalable solutions to streamline incident management, troubleshooting, and system monitoring . + Drive the adoption of observability platforms and ... technologies. This role involves designing and implementing strategies around observability , AI, and automation to optimize operational efficiency, predictability,… more
- PennyMac (Westlake Village, CA)
- …maintaining service level agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation ... operational efficiency and system reliability. + Team Management - Lead , mentor, and develop a team of Site Reliability...of comprehensive monitoring and observability practices using New Relic and other tools to… more
- Infinitive Inc (Mclean, VA)
- …and cost-effective shared service across lines of business. Scalability, Cost, and Observability + Engineer platform capabilities that provide deep visibility ... into compute, storage, and catalog operations through integrated observability , monitoring , and FinOps practices. + Develop resource optimization strategies to… more
- Lowe's (Charlotte, NC)
- …Git-based CI/CD for schema changes, refreshes, checks, and patching. + Monitoring & Observability : build actionable dashboards/alerts (eg, Prometheus/Grafana); ... management). You'll be responsible for availability, performance, security, and observability across large-scale fleets-partnering closely with engineering, architecture, SRE,… more
- Oracle (Nashville, TN)
- **Job Description** As a Principal Software Development Engineer (IC4), you will take ownership of designing, implementing, and operating core components of our ... resiliency, and low-latency performance across distributed environments. You will lead major features and services through the full development lifecycle,… more
- Entergy (New Orleans, LA)
- …CI/CD optimization) to reduce manual effort and risk. + Advocate for observability and proactive monitoring practices across systems and applications. ... self-motivated, detail-oriented, and capable of working independently to develop observability solutions that enhance decision-making and system performance. **Job… more
- TEKsystems (San Diego, CA)
- …Bash, or Ansible to streamline system operations. * Implement and maintain monitoring and observability tools such as SolarWinds, Dynatrace, Grafana, or ... Splunk. * Lead root cause analysis and implement preventive and corrective...iSCSI, RAID Configurations Automation & Scripting PowerShell, Bash, Ansible Monitoring & Observability SolarWinds, Dynatrace, Grafana, Splunk… more