- Walmart (Bentonville, AR)
- …response, drive RCAs and performance/capacity tuning; build end‑to‑end observability (metrics/logs/traces) that keeps the platform fast and reliable. ... + Automation‑first operations (Everything as Code). Use Ansible and pipeline‑as‑code; turn runbooks into idempotent Python/Shell jobs; enforce policy‑as‑code and robust secrets management by default. + Deep toolchain + multi‑cloud integration. Wire GitHub with… more
- Coinbase (Phoenix, AZ)
- …proactively addressing technical debt and driving improvements in reliability and observability * Participate in code reviews and on-call rotation, lead incident ... response, and foster a team-wide environment that welcomes constructive feedback to maintain high code quality standards *What we look for in you (ie. job requirements): * 5+ years of experience in backend software development, with a strong focus on backend… more
- Coinbase (Jefferson City, MO)
- …proactively addressing technical debt and driving improvements in reliability and observability * Participate in code reviews and on-call rotation, lead incident ... response, and foster a team-wide environment that welcomes constructive feedback to maintain high code quality standards *What we look for in you (ie. job requirements): * 5+ years of experience in backend software development, with a strong focus on backend… more
- JPMorgan Chase (Seattle, WA)
- …systems (eg, ELK stack, Splunk). + Ability to implement and manage observability practices to ensure system reliability. + Proficiency in cloud platforms (eg, ... AWS, Azure, Google Cloud) and their services. + Experience in implementing SRE principles and practices to improve system reliability and availability. + Proficiency in SQL, NoSQL databases, and data warehousing solutions + Experience hiring, developing, and… more
- DoorDash (San Francisco, CA)
- …powers all of DoorDash's business. + Improve the reliability, scalability, and observability of our training and inference infrastructure. We're excited about you ... because + BS, MS, or PhD. in Computer Science or equivalent + Exceptionally strong knowledge of CS fundamental concepts and OOP languages + 6+ years of industry experience in software engineering + Prior experience building machine learning systems in… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …+ Implement monitoring framework to improve infrastructure reliability, observability , and alerts. + Identifying and implementing automation opportunities ... to reduce manual work and acceleration delivery. + Drive technical decisions on architecture, automation, and tooling. + Develop processes to track and scale key metrics for reliability, efficiency and scalability + Drive operational excellence by + Achieving… more
- ManpowerGroup (Columbus, OH)
- …engineers. + Champion best practices in test automation, CI/CD, and observability across cheminformatics platforms. **Qualifications** + PhD or Master's in ... Chemistry, Cheminformatics, Computational Chemistry, or related field (Bachelor's with exceptional experience considered). + 10+ years of experience in cheminformatics, computational chemistry, or scientific software development. + Strong programming skills in… more
- VetsEZ (CA)
- …while fostering a culture of experimentation and delivery excellence. + Observability and Reliability: Implement monitoring, logging, and automated alerting (eg, ... CloudWatch, Datadog, Prometheus) to ensure system reliability and traceability of AI workflows. + Governance and Compliance: Ensure all AI-enabled components meet HIPAA, VA, and NIST security requirements, aligning with enterprise healthcare standards. +… more
- MongoDB (New York, NY)
- …intelligent scaling algorithms, leveraging the latest hardware available, automating fleet-wide observability , and working closely with the MongoDB Server team to ... design new systems at a humongous scale. We are looking to speak to candidates who are based in New York City for our in office or hybrid working model. **What you'll do** + Lead a team of motivated individual contributors who are eager to learn and grow +… more
- Northrop Grumman (Manhattan Beach, CA)
- …Specialty, Azure Data Engineer Associate, or Google Professional Data Engineer . + MLOps Expertise, Observability Tools, Data Versioning, and Containerization ... for deploying data engineering workflows. + Expertise in cloud security best practices, including IAM, encryption, and compliance with frameworks like NIST or FedRAMP. + Knowledge of advanced networking concepts such as VPC peering, VPNs, and load balancing… more