- Google (Sunnyvale, CA)
- …Guide technical decisions, balancing the need for a reliable system and efficient incident response with highly dynamic, customer priorities. + Ensure the ... Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision… more
- Insight Global (South San Francisco, CA)
- …platforms like Amazon CloudWatch, Sumo Logic, Prometheus, and Grafana. * Background in incident response and post-mortem strategy. * Prior involvement in cloud ... Job Description We are seeking a hands-on Senior CloudOps Engineer to lead the execution of our AWS cloud...and services. * Strong skills in observability, cloud networking, security , and FinOps (cloud cost optimization). * Experience mentoring… more
- IBM (San Jose, CA)
- …and innovation thrive. **Your role and responsibilities** As a Site Reliability Engineer , you will work in an agile, collaborative environment to build, deploy, ... at enterprise scale. * Maintenance and Support: Tasks related to applying security patches and upgrades and collaborating with Product support for issue resolution.… more
- General Motors (Mountain View, CA)
- …you run it" culture from initial design through deployment, monitoring, and production incident response . **What Will Give You a Competitive Edge (Preferred ... intelligent provisioning, and remote development workflows. As a Staff Software Engineer , you will architect and build the core platform services including… more
- Robert Half-Robert Half Corporate (San Ramon, CA)
- …and resolution of moderate to complex issues in production platforms, defining incident response approaches and resolution playbooks. + Provides Level III ... We Are** Robert Half is seeking a Senior Software Engineer III - ATI to join our team supporting...support critical production issues, collaborating across development, application, and security teams. + Performs additional duties as assigned in… more
- IBM (San Jose, CA)
- …and innovation thrive. . **Your role and responsibilities** As a Site Reliability Engineer , you will work in an agile, collaborative environment to build, deploy, ... at enterprise scale. * Maintenance and Support: Tasks related to applying security patches and upgrades, and collaborating with Product support for issue resolution.… more
- Astellas Pharma (South San Francisco, CA)
- …on account and role design, backup and recovery, business continuity, and incident response linkages for these platforms. **Procedures, Templates, and Training** ... Role** As part of the Research Compliance team, the Validation Engineer plans and executes risk-based qualification and computerized systems validation activities… more
- LinkedIn (San Francisco, CA)
- …metrics (crash‑free sessions, app performance). + You will participate in incident response and post‑mortems, helping define preventative guardrails and ... A/B testing, Feature flags, Analytics, CI/CD, Agile planning + Security & Privacy: Secure coding, Data minimization, PII handling...following up on an application, will not receive a response . LinkedIn will not discharge or in any other… more
- CVS Health (Sacramento, CA)
- …across multiple regions and environments (cloud, on-premises, colocation). + Develop incident response and recovery strategies. **Required Qualifications:** + 5+ ... every day. **Position Summary** **Who You Are:** + A security expert who can write code as needed and...scale. + Strong passion and technical expertise to automate security functions via code, including pipeline and workflow automation.… more
- LinkedIn (Mountain View, CA)
- …reliable, and performant traffic systems. + Lead design reviews, incident post-mortems, and production readiness reviews, ensuring adherence to reliability ... requirements. + Design, deploy, and optimize traffic routing and security over multiple content delivery networks. + Implement and...following up on an application, will not receive a response . LinkedIn will not discharge or in any other… more