• Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …or mathematics), or equivalent experience + 5+ years of experience with Infrastructure automation , distributed systems design, experience with design, develop ... Much of our software development focuses on eliminating manual work through automation , performance tuning and growing efficiency of production systems. As SREs are… more
    NVIDIA (08/02/25)
    - Related Jobs
  • Senior Software Engineer - Digital…

    NVIDIA (Santa Clara, CA)
    …and biological sciences. If you love moving between science, code, and the infrastructure that makes it reliable, fast, and cost‑efficient, you'll feel at home here. ... of GPUs. + Collaborate across teams: Partner with applied research, AI infrastructure , and full‑stack teams; contribute to and upstream improvements across the… more
    NVIDIA (09/25/25)
    - Related Jobs
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    …and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with opportunities to ... the full-stack as we continue to push technology forward. The Architecture, Automation , Analysis, and Accelerator team in Google Cloud is responsible for developing… more
    Google (09/24/25)
    - Related Jobs
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. Most DL/HPC applications ... applications on groundbreaking GPU clusters + Develop tools and automation to isolate issues on new systems and platforms,...teams in different time zones on networking, GPUs, storage, infrastructure and support. What we need to see: +… more
    NVIDIA (07/07/25)
    - Related Jobs
  • Senior Engineer - HashiCorp

    IBM (San Francisco, CA)
    …hybrid environments. You will join a team managing the lifecycle of infrastructure and security, enhancing IBM's cloud solutions to ensure enterprises achieve ... to safely and predictably create, change, and improve production infrastructure via the command line. It codifies APIs into...of Azure. In addition to the providers, we maintain automation tooling, SDKs, and generate our own Go Azure… more
    IBM (09/16/25)
    - Related Jobs
  • Senior DGX AI Cloud Performance Analysis…

    NVIDIA (Santa Clara, CA)
    Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing ... workloads, as well as developing scalable AI and Data infrastructure tools and services. Our objective is to deliver...and build consensus + Passion for "it just works" automation , eliminating repetitive tasks, and enabling team members Ways… more
    NVIDIA (09/07/25)
    - Related Jobs
  • Senior Network Engineer

    Cadence Design Systems, Inc. (San Jose, CA)
    …WAN/MPLS and cloud connectivity. + Improve operational efficiency - Enhance Network Automation , Empower NOC team to through knowledge base, runbooks, review & reduce ... and enhancing network monitoring. + Proven ability to manage mission-critical infrastructure projects. Minimum Requirements + Degree in Computer Science. + Expertise… more
    Cadence Design Systems, Inc. (08/22/25)
    - Related Jobs
  • Senior Site Reliability Engineer

    Rubrik (Sacramento, CA)
    …**What you'll do:** * Deploy and operate security solutions and supporting infrastructure in cloud and datacenter environments in support of internal customer ... * Develop and automate Security tasks that span from Security Operations to Infrastructure as Code in support of InfoSec initiatives * Manage the availability,… more
    Rubrik (08/20/25)
    - Related Jobs
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …hardware technologies. We're in search of a visionary technical leader to engineer and propel innovation in diagnostics for NVIDIA's partner ecosystem. This role ... programming languages like C, C++, and Python for tool development and automation . + Familiarity with high-speed interconnects such as PCIe, Infiniband, NVLink, and… more
    NVIDIA (09/10/25)
    - Related Jobs
  • Senior Reliability Engineer

    Celonis (Redwood City, CA)
    …running on Kubernetes, applying SRE principles to drive observability, automation , and incident prevention. + Own high-priority application incident escalations, ... defined SLOs, while continuously improving detection and response mechanisms. + Engineer solutions to enhance the availability, latency, and performance of… more
    Celonis (07/18/25)
    - Related Jobs