• Senior DGX Cloud Software Engineer

    NVIDIA (Santa Clara, CA)
    …ROI of building and maintaining automation is worth it. + Practice sustainable blameless incident prevention and incident response while being a member of ... an on-call rotation. + Consult with and provide consultation for peer teams on systems design best practices. + Participate in a supportive culture of values-driven introspection, communication, and self-organization What we need to see: + Proficiency in one… more
    NVIDIA (07/26/25)
    - Related Jobs
  • Lead Security Operations Center (SOC)…

    Sunrun (CA)
    …of SOC analysts, overseeing the daily operations of our security monitoring and incident response functions, and ensuring the continuous improvement of our ... highly motivated and experienced Lead Security Operations Center (SOC) Engineer to join our dynamic security team. This critical...in the hiring and onboarding of new SOC analysts. Incident Response & Management: + Act as… more
    Sunrun (06/27/25)
    - Related Jobs
  • Senior Software Engineer

    Microsoft Corporation (Mountain View, CA)
    …of experience applying site-reliability engineering (SRE) practices, including monitoring, incident response , and improving system resilience. Software ... Engineering IC4 - The typical base pay range for this role across the US is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base… more
    Microsoft Corporation (08/21/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... rotation to support production systems What we need to see: + BS degree in Computer Science or a related technical field involving coding (eg, physics or mathematics), or equivalent experience + 5+ years of experience with Infrastructure automation,… more
    NVIDIA (08/20/25)
    - Related Jobs
  • Senior Storage Production Engineer

    NVIDIA (Santa Clara, CA)
    …encryption, access controls, and auditing mechanisms for storage systems. + Practice sustainable incident response and blameless root cause analysis. Be part of ... an on-call rotation to support storage and production systems. What We Need To See: + BS degree or equivalent experience in Computer Science, Storage Systems, or a related technical field with 8+ years of practical experience. + Experience with distributed and… more
    NVIDIA (08/13/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    Coinbase (Sacramento, CA)
    …* Collaborate with Coinbase product teams to reduce service disruptions and automate incident response * Proactively find and analyze reliability problems across ... our business units and stack, then design and implement software to create step-function improvements. * Educate, mentor and hold accountable the engineering team to improve the reliability of our systems and make reliability a core value of the Coinbase… more
    Coinbase (08/09/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and blameless postmortems + Be part of an on call ... rotation to support production systems What we need to see: + BS degree in Computer Science or a related technical field involving coding (eg, physics or mathematics), or equivalent experience. + 10+ years of experience. + Experience with Infrastructure… more
    NVIDIA (08/01/25)
    - Related Jobs
  • Officer, Senior Information Security…

    Banc of California (Santa Ana, CA)
    …remediation of same. + Establishes and maintains Security Operations team triage and incident response playbooks to protect and recover information assets from ... unauthorized access, modification or destruction. + Assist in developing and implementing technical security standards to support the Bank's security needs and regulatory requirements including ISO2700x, CFPB, SOX, GLBA, NIST, FFIEC and PCI. + Provide subject… more
    Banc of California (07/16/25)
    - Related Jobs
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …direction across orgs, and contributing deeply to culture, hiring, and technical excellenceLead incident response and post- incident reviews to identify root ... focused engineering, or distributed systemsPreferred QualificationsHands-on experience with large-scale incident response , root cause analysis, and resiliency… more
    LinkedIn (06/04/25)
    - Related Jobs
  • Cyber Security/Network Engineer II

    Hyundai Autoever America (San Diego, CA)
    …Follow change management processes for network and security updates. + Assist in incident response and disaster recovery operations. + Support backup and restore ... Alto), and endpoint protection solutions to maintain network performance and security. The engineer collaborates with senior team members to resolve issues and… more
    Hyundai Autoever America (06/11/25)
    - Related Jobs