- NVIDIA (Santa Clara, CA)
- …encryption, access controls, and auditing mechanisms for storage systems. + Practice sustainable incident response and blameless root cause analysis. Be part of ... an on-call rotation to support storage and production systems. What We Need To See: + BS degree or equivalent experience in Computer Science, Storage Systems, or a related technical field with 8+ years of practical experience. + Experience with distributed and… more
- Palo Alto Networks (Santa Clara, CA)
- …software engineers and SREs on release planning, deployment strategies, monitoring, and incident response to ensure reliable and predictable production behavior. ... **Your Experience** + Strong problem solver with collaborative team player with clear communication skills, able to work effectively across engineering, product, and SRE teams. + Solid foundation in Machine Learning, Deep Learning, and NLP, with hands-on… more
- Coinbase (Lansing, MI)
- …and observability * Participate in code reviews and on-call rotation, lead incident response , and foster a team-wide environment that welcomes constructive ... feedback to maintain high code quality standards *What we look for in you (ie. job requirements): * 5+ years of experience in backend software development, with a strong focus on backend systems * Expertise in languages such as Golang (preferred), C, Rust or… more
- Coinbase (Santa Fe, NM)
- …and observability * Participate in code reviews and on-call rotation, lead incident response , and foster a team-wide environment that welcomes constructive ... feedback to maintain high code quality standards *What we look for in you (ie. job requirements): * 5+ years of experience in backend software development, with a strong focus on backend systems * Expertise in languages such as Golang (preferred), C, Rust or… more
- Walmart (Bentonville, AR)
- …system performance and data processing workflows. Lead production support and incident response , demonstrating deep expertise in service debugging, root-cause ... analysis, and rapid recovery of mission-critical systems. Work with diverse data and cloud technologies, including non-relational databases (eg, Cassandra), analytical platforms (eg, Spark, BigQuery, Hive, SQL), observability stacks, and multi-cloud… more
- NVIDIA (Santa Clara, CA)
- …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... rotation to support production systems What we need to see: + BS degree in Computer Science or a related technical field involving coding (eg, physics or mathematics), or equivalent experience + 8+ years of experience with Infrastructure automation,… more
- CVS Health (Salem, OR)
- …across multiple regions and environments (cloud, on-premises, colocation). + Develop incident response and recovery strategies. **Required Qualifications:** + 5+ ... years of experience in developing and deploying security technologies. + 5+ years of experience with modern Software Development Lifecycles and CI/CD practices, including pipeline automation and security integration. + 3+ years of experience with remediation… more
- Microsoft Corporation (Redmond, WA)
- …**Act as a Designated Responsible Individual (DRI)** for live-site health and incident response , proactively monitoring and restoring service when issues arise. ... + **Mentor and guide engineers** , fostering technical growth and sharing best practices; lead by example in engineering excellence and inclusive culture. + **Collaborate with stakeholders** (PM, Design, partner teams) to align on priorities, simplify complex… more
- Beth Israel Lahey Health (Charlestown, MA)
- …recommendations for strengthening AI defenses. 8. Provide technical support to incident response teams, analyze vulnerabilities during investigations, and assist ... with corrective measures. 9. Collaborate with blue teams, purple teams, and broader security groups to stress-test systems, validate detection mechanisms, and improve enterprise readiness. 10. Plan, coordinate, and execute full-lifecycle red team operations,… more
- Insight Global (Silver Spring, MD)
- …identify potential issues * Configuring alerts on systems issues and leading incident response in collaboration with development teams * Configuring security ... tools and Salesforce permission sets to enforce zero-trust security strategies and least-privilege access controls * Strong proven analytical ability and ability to solve problems independently We are a company committed to creating diverse and inclusive… more