• Sr . SDE C/C++ Hardware/Software Co-Design,…

    Amazon (Cupertino, CA)
    …operations experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... solutions. If you're passionate about building the highest-performing, hardware-accelerated Machine Learning systems and want to be part of the entire journey from… more
    Amazon (07/24/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (CA)
    …using high-performance NVIDIA infrastructure. Work with NVIDIA's DGX Cloud team as a Senior Site Reliability Engineer to maintain high-performance DGX Cloud ... clouds + Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity + Lead… more
    NVIDIA (08/30/25)
    - Related Jobs
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …demands robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
    NVIDIA (09/30/25)
    - Related Jobs
  • Senior Site Reliability

    The Walt Disney Company (Anaheim, CA)
    …with the Disneyland Resort, Disney Cruise Line and Walt Disney World partners. The Senior Site Reliability Engineer will report to the Manager, Technology. ... **About The Role & Team** This Engineer will be expected to play multiple critical roles...of engineering. **What You'll Do:** + Design complex sensor systems that fuse multiple protocols and technologies. + Consult… more
    The Walt Disney Company (09/18/25)
    - Related Jobs
  • Senior Reliability Methodology…

    NVIDIA (Santa Clara, CA)
    …that are groundbreaking in AI and computing. What you'll be doing: As a Reliability Methodology Engineer at NVIDIA, you will be responsible for ensuring our ... products and systems operate flawlessly. Your key duties will include: +...test engineering teams to apply DFT methodologies to improve reliability screening specific to HTOL (Component level Hight Temp… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Senior Reliability Engineer

    Celonis (Redwood City, CA)
    … Engineering with modern Software Engineering practices to build resilient and scalable systems . + Lead reliability efforts for a fleet of 80+ FedRAMP-compliant ... join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role...SLOs, while continuously improving detection and response mechanisms. + Engineer solutions to enhance the availability, latency, and performance… more
    Celonis (07/18/25)
    - Related Jobs
  • Sr . System Architect, Hardware…

    Amazon (Sunnyvale, CA)
    …Electrical Engineering, Software Engineering, ML Science, Product Design, Industrial Design, Reliability , and Operations. You are a hands-on engineer who ... to drive system architecture across Amazon devices. Key job responsibilities As a Sr System Architect, you will be responsible for defining the system architecture… more
    Amazon (10/01/25)
    - Related Jobs
  • Senior Software Engineer , Backend…

    Coinbase (Sacramento, CA)
    …impact . *Role* - We would like to add a Senior Software Engineer to help promote reliability culture across Coinbase. You would be helping company-wide ... fully supported. *What you'll be doing (ie. job duties):* *Team* - Core Reliability team is a vital part of Infrastructure(Platform) org responsible for paving the… more
    Coinbase (08/09/25)
    - Related Jobs
  • Senior Site Reliability

    Cisco (San Francisco, CA)
    Senior Site Reliability Engineer - Performance Apply (https://jobs.cisco.com/jobs/Login?projectId=1441767) + Location:Offsite, San Francisco, California, US + ... the cloud that supports these customers and their networks. As a Site Reliability Engineering Technical Leader on the Performance team you'll take the lead on… more
    Cisco (09/25/25)
    - Related Jobs
  • Senior System Reliability

    Ford Motor Company (Long Beach, CA)
    …role is expected to support an entire vehicle. What you'll do * Define reliability targets for different systems and subsystems by cascading top level ... you will work with multiple teams and help set reliability targets at the system and subsystem level. You...experts based on the physics of failure, complexity of systems , and technology readiness levels. You will be identifying… more
    Ford Motor Company (09/03/25)
    - Related Jobs