- Oracle (Olympia, WA)
- …Prometheus, Grafana, ELK stack) to maintain system health, detect anomalies, and support incident response . + Provide technical leadership and guidance to teams ... on container technologies, microservices design, and cloud-native security best practices. + Establish configuration management for software revisions, security patches, and compliance (eg, FIPS-compliant images). + Develop and maintain documentation,… more
- Walmart (Bentonville, AR)
- …multiple teams-including coding, testing, CI/CD deployment, observability, monitoring, incident response , and maintenance. Implements distributed architectures ... optimized for real-time data processing, AI/ML integration, and cross-service reliability. Maintains architectural decision records (ADRs) to ensure traceability, alignment, and transparency in technical planning and tradeoffs. AI/ML Integration & Product… more
- Oracle (Des Moines, IA)
- …experience on a major public cloud, including observability, orchestration, and incident response . BS/MS in Computer Science, Electrical/Computer Engineering, or ... equivalent practical experience; proven technical leadership and mentoring. Preferred: Familiarity with high-performance IO paths; understanding of cross-region networking and latency trade-offs. Strong foundation in consensus and transactions. Expertise with… more
- Oracle (Phoenix, AZ)
- **Job Description** Overview Join OCI's Edge Security team as a Principal Engineer to architect and deliver cloud-scale DDoS protection. You'll lead design for ... DNS, and edge platform teams. - Set operational standards: SLOs/SLAs, on-call health, incident response (including incident commander duties), runbooks, and… more
- Herbalife (Torrance, CA)
- …reliability and customer experience. * Develop operational standards and runbooks for incident response , disaster recovery, and performance management. * Partner ... **Overview** **THE ROLE:** We are seeking a highly experienced Principal II, Site Reliability Engineer (SRE) to...embedding them into production systems. * Strong background in incident response , postmortems, and operational excellence. *… more
- LinkedIn (Mountain View, CA)
- … incident lifecycle across thousands of services and multiple regions, from incident response and mitigation, through problem management and post- incident ... and customer to experience LinkedIn as "always on", every engineer to benefit from a more insightful and proactive...are the backbone of how LinkedIn detects issues, coordinates incident response , captures context, and turns outages… more
- Mission Support and Test Services (Las Vegas, NV)
- …United States and its allies by providing high-hazard experimentation and incident response capabilities through operations, engineering, education, field, and ... **Responsiblities** MSTS is seeking a candidate for the role of Principal Mission Communications Specialist for the Global Mission Communications Programs (GMCP),… more
- New York State Civil Service (Syracuse, NY)
- …support construction operations. Responsibilities may include serving a role within the Incident Command System to support the department's response to regional ... NY HELP Yes Agency Transportation, Department of Title Principal Engineering Technician (NY HELPS)-Region 3 Occupational Category Other Professional Careers Salary… more
- Oracle (Pierre, SD)
- …mission-critical services + Champion operational excellence through proactive monitoring, testing, and incident response + Contribute to a culture of innovation, ... flexible, and inclusive environment where your contributions matter. As a Senior Principal Software Engineer IC5, you'll provide technical leadership to the… more
- Autodesk (San Francisco, CA)
- …fostering team growth and security excellence + Participate in architecture reviews, incident response , and risk assessments related to IAM **Minimum ... Overview** Autodesk's Cyber Defense team is looking for a Sr. Principal IAM Security Engineer to lead the strategy, design, and execution of secure, scalable… more