• Senior Site Reliability Engineer - AI…

    NVIDIA (Santa Clara, CA)
    …and operating large scale compute infrastructure + Proven experience in site reliability engineering for high- performance computing environments with ... and reliability aspects of large scale distributed systems with focus on performance at scale,...+ Passion for solving complex technical challenges and optimizing system performance . + Experience with AI/HPC advanced… more
    NVIDIA (03/26/25)
    - Related Jobs
  • Manager, System Development, WWPS Solutions…

    Amazon (San Francisco, CA)
    …managing distributed teams. - Experience in Systems and Network Administration, System Reliability Engineering , or Application Security - Excellence in ... Model Certification (CMMC) approved infrastructure. We are seeking a Systems Development Manager to lead the team that builds...Understanding of design for scalability, performance and reliability - System engineering experience… more
    Amazon (04/04/25)
    - Related Jobs
  • Site Reliability Engineering Manager

    Two95 International Inc. (Sacramento, CA)
    …duties as directed. EDUCATION AND EXPERIENCE: Bachelor's or Master's degree in Reliability Engineering , Computer Science, Information systems , or related ... Position - Site Reliability Engineering Manager Location - Sacramento,...Serve as an active and consistent participant in the systems reliability governance process. + Work with… more
    Two95 International Inc. (03/11/25)
    - Related Jobs
  • Manager, Site Reliability

    General Motors (Mountain View, CA)
    …You'll Do:** + Develop tools and software to automate operational processes, improve system reliability , and reduce manual intervention. + Lead, Implement and ... and a partner, helping engineers grow, and ensuring the reliability and efficiency of the systems they...recurrence. Champion a culture of continuous improvement. + Evaluate system performance and advocate for optimizations that… more
    General Motors (04/24/25)
    - Related Jobs
  • Sr Hardware Reliability Engineer, Hardware…

    Amazon (Cupertino, CA)
    …develop into a better-rounded professional. Basic Qualifications - Bachelor's degree in Reliability Engineering , Physics, Material Science or related field, or ... will have a fundamental understanding of Reliability statistics/ Reliability tests and/or solid understanding of computer systems...equivalent experience - 5+ years of Reliability Engineering work experience with server platforms… more
    Amazon (04/05/25)
    - Related Jobs
  • Software Developer II, Site Reliability

    Google (Sunnyvale, CA)
    …qualifications: + Master's degree in Computer Science or Engineering . Site Reliability Development combines software and systems development to build and run ... large-scale, massively distributed, fault-tolerant systems . Site Reliability Development ensures that Google's... Developers will keep an ever-watchful eye on our systems capacity and performance . Much of our… more
    Google (05/02/25)
    - Related Jobs
  • Manager, Site Reliability

    Amazon (Culver City, CA)
    …Video Tech takes you! Basic Qualifications - Minimum of 10 years of hands-on systems reliability engineering and providing senior level technical direction ... Prime Video's Studios Technology Services team is searching for a Manager, Site Reliability Engineering . The Studios Technology Services team supports our Media… more
    Amazon (04/25/25)
    - Related Jobs
  • Automation Solutions Engineer, Reliability

    Amazon (Bakersfield, CA)
    …of those systems - Perform, utilize, and assess material handling system performance . - Partner with operations leadership, equipment vendors and parts ... fulfillment operations, focusing on maximizing equipment reliability and operational performance of equipment such as conveyors, sortation systems , scanners,… more
    Amazon (02/15/25)
    - Related Jobs
  • Senior System Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics ... What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems ...to End-of-Life phase. + Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system more
    NVIDIA (04/17/25)
    - Related Jobs
  • Software Engineering Manager II, Site…

    Google (San Francisco, CA)
    …+ Master's degree in Computer Science or Engineering . Site Reliability Engineering (SRE) combines software and systems engineering to build and run ... Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance . Much of our...To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/sre/book.html) or read a career… more
    Google (04/24/25)
    - Related Jobs