• Senior Site Reliability

    TP-Link North America, Inc. (Irvine, CA)
    …with simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team ... and tools + Help to mentor and train less senior members of the team + Ability to be...related field. + 5+ years of experience as a Site Reliability Engineer . + Proficiency… more
    TP-Link North America, Inc. (03/11/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
    NVIDIA (04/02/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big ... and operating large scale compute infrastructure + Proven experience in site reliability engineering for high-performance computing environments with operational… more
    NVIDIA (03/26/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define and document standard methodologies… more
    NVIDIA (04/04/25)
    - Related Jobs
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (04/30/25)
    - Related Jobs
  • Senior Site Reliability

    Cisco (CA)
    …applications with low operational burden by handling and improving the reliability and resiliency of SRE-managed services and infrastructure. You thrive on ... automation, infrastructure-as-code, reliability engineering, and getting rid of tedious, manual tasks....are fulfilled through the success of others. **Work on reliability projects, including:** + HA, Business Continuity Planning, disaster… more
    Cisco (03/14/25)
    - Related Jobs
  • Intl - EU - Senior Site

    Insight Global (Novato, CA)
    …Enablement Work closely with backend and DevOps teams. Contribute to system reliability standards and documentation. Mentor engineers on Unix system performance and ... Hands-on experience with observability tools. Ability to troubleshoot complex reliability issues. Nice to Have Experience with live game infrastructure.… more
    Insight Global (04/30/25)
    - Related Jobs
  • Senior Site Reliability

    MongoDB (San Francisco, CA)
    …managing any infrastructure, and our newest offering, Atlas Data Lake. The Cloud Site Reliability Engineering Team designs and builds the global infrastructure ... on which we deploy our services. As our customers grow and globalize, our services must satisfy demands for low-latency requests around the globe, and comply with various data sovereignty requirements. The SRE Team's mission is to build this increasingly… more
    MongoDB (03/18/25)
    - Related Jobs
  • Senior Site Reliability

    General Motors (Mountain View, CA)
    …_Work Arrangement:_ _This role is categorized as hybrid. This means the successful candidate is expected to report to the office three times per week or_ _other_ ... _frequency dictated by the business._ _ _ **What** **You'll** **Do** + Leads and generates technical solutions including specifying of requirements, functional decomposition, analysis, development and testing for current, new and major programs + Lead… more
    General Motors (04/26/25)
    - Related Jobs
  • Senior Site Reliability

    Cisco (CA)
    …Splunk's Cloud group is looking for skilled engineers to support and build our large scale Cloud offering. You will be working with a fun, diverse, geographically ... distributed team to deliver an excellent product and an extraordinary experience to our customers. + You are passionate about building and running distributed systems at scale in production. You understand the challenges and trade-offs to be made when building… more
    Cisco (03/14/25)
    - Related Jobs