• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a ... such as high-performance NFS, S3-compatible object storage, and distributed storage systems + Develop tooling to automate deployment and management of large-scale… more
    NVIDIA (08/21/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... techniques and Infrastructure as Code (IaC). + Deep understanding of Linux operating systems and TCP/IP fundamentals. + Expertise with at least one major cloud… more
    NVIDIA (09/17/25)
    - Related Jobs
  • Senior Site Reliability

    Tarana Wireless (Milpitas, CA)
    …speeds worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer , you will help us manage software ... environment, to support millions of connected devices + Monitoring of all live systems + Troubleshoot and triage production active issues What You'll Need: + BS… more
    Tarana Wireless (08/15/25)
    - Related Jobs
  • Senior Reliability Methodology…

    NVIDIA (Santa Clara, CA)
    …that are groundbreaking in AI and computing. What you'll be doing: As a Reliability Methodology Engineer at NVIDIA, you will be responsible for ensuring our ... products and systems operate flawlessly. Your key duties will include: +...test engineering teams to apply DFT methodologies to improve reliability screening specific to HTOL (Component level Hight Temp… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to build and… more
    Google (09/27/25)
    - Related Jobs
  • Senior Software Engineer , Site…

    Google (Sunnyvale, CA)
    Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Durham, NC, USA; Raleigh, NC, USA; +3 more; +2 more **Mid** Experience ... SRE ensures that Google's services-both our internally critical and our externally-visible systems -have reliability and uptime appropriate to users' needs and a… more
    Google (10/01/25)
    - Related Jobs
  • Sr . Site Reliability

    Amazon (Culver City, CA)
    …and studio executives at all levels. Our Infrastructure Engineering team is looking for Sr Site Reliability Engineers to build, deploy, operate, and sustain our ... systems in AWS. The team will operationalize the stability and reliability of these systems and discover innovative ways to scale and operate them reliably as… more
    Amazon (09/09/25)
    - Related Jobs
  • Senior Reliability Engineer

    Celonis (Redwood City, CA)
    …engineering and Site Reliability Engineering (SRE) principles to drive system reliability , scalability, and operational excellence across the organization. ... Engineering with modern Software Engineering practices to build resilient and scalable systems . + Lead reliability efforts for a fleet of 80+ FedRAMP-compliant… more
    Celonis (07/18/25)
    - Related Jobs
  • Senior System Reliability

    Ford Motor Company (Long Beach, CA)
    …In this highly interdisciplinary role, you will work with multiple teams and help set reliability targets at the system and subsystem level. You will oversee the ... support an entire vehicle. What you'll do * Define reliability targets for different systems and subsystems... Reliability requirement development, target allocation, cascading the reliability requirements from top level to system more
    Ford Motor Company (09/03/25)
    - Related Jobs
  • Senior Site Reliability

    Palo Alto Networks (Santa Clara, CA)
    …insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
    Palo Alto Networks (10/03/25)
    - Related Jobs