- Microsoft Corporation (Redmond, WA)
- …storage, networking). + **Automation & Tooling** : Build automation for deployments, incident response , scaling, and failover in hybrid cloud/on-prem CPU+GPU ... its benefits. We're looking for an experienced **Site Reliability Engineer (SRE)** to join our infrastructure team. In this...environments. + ** Incident Management** : Lead on-call rotations, troubleshoot production issues,… more
- pony.ai (Fremont, CA)
- …Experience with observability and SRE practices (Prometheus, Grafana, ELK, Datadog; SLOs, incident response , postmortems). + Familiarity with workloads common to ... public at NASDAQ in November 2024. Responsibilities As a (Senior) Kubernetes Engineer , you will: + Design, operate, and optimize Kubernetes clusters across hybrid… more
- Insight Global (Richardson, TX)
- …observability efforts including metrics, logs, traces, and alerting systems. . Participate in incident response and post‑ incident reviews; help reduce MTTR ... process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce- privacy -policy/. Skills and Requirements .… more
- Microsoft Corporation (Reston, VA)
- …repeatable, scalable solutions to guarantee quality. + Participate in on-call rotations and incident response to ensure high availability of services. + Develops ... Microsoft has an exciting opportunity for a Senior Software Engineer in the Cloud+AI Azure Data Explorer team. As...are followed to achieve a high degree of security, privacy , safety, and accessibility. Checks for visible evidence to… more
- Google (Sunnyvale, CA)
- …Guide technical decisions, balancing the need for a reliable system and efficient incident response with highly dynamic, customer priorities. + Ensure the ... Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision… more
- Cardinal Health (Annapolis, MD)
- …family develops system back-up and disaster recovery plans, conducts incident responses, threat management, vulnerability scanning, virus management and intrusion ... and requires having an in-depth understanding of local, national and international privacy and security regulations such as HIPAA (Health Insurance Portability and… more
- Coinbase (Charlotte, NC)
- …and engaging with dApps or blockchain-based services. * You have experience with incident response , disaster recovery, and are interested in making systems ... the existence of Coinbase. We are looking for a seasoned Senior Software Engineer to join the Datastores (Infrastructure) team at Coinbase. The team's charter… more
- Coinbase (Charlotte, NC)
- …operational procedures, and troubleshooting steps across system lifecycle * Facilitate incident response , conduct root cause analysis, and blameless ... and fully supported. Coinbase is hiring! We are looking for an experienced system engineer (SE) to join the IT Operations Corporate Engineering team to build and… more
- J&J Family of Companies (Danvers, MA)
- …Improvements, Risk Assessments, Security Architecture Design, Security Framework, Security Incident Response , Security Planning, Security Policies, Standard ... team is recruiting for an experienced Sr Product Security Engineer to be based in Danvers, MA or Raritan,...to provide secure coding recommendations and execute reviews *Data privacy experience, including HIPAA and GDPR *Understanding of industry… more
- University of Washington (Seattle, WA)
- …system uptime and performance through proactive monitoring and SLA adherence. * Lead incident response and recovery for outages and security events. * ... and staff-we're the team that makes it all possible. **The Role: Senior Email Engineer ** Are you passionate about scaling enterprise email systems that keep a global… more