• Senior Site Reliability Engineer - DGX…

    NVIDIA (Santa Clara, CA)
    … design, experience with design, develop tools for running large scale private or public cloud system in Production + Experience in one or more of the following: ... live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through...Experience in using or running large private and public cloud systems based on Kubernetes, OpenStack and… more
    NVIDIA (08/01/25)
    - Related Jobs
  • Senior Storage Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …looking for an engineer who has a deep understanding of distributed systems development, object storage, network file transfer protocols, and file systems . ... is looking for a talented, highly productive Senior Software Engineer to design and implement facilities for data ingress,...distributed systems such as distributed databases, storage systems , or cloud services NVIDIA is leading… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Senior Software Engineer , Google…

    Google (Sunnyvale, CA)
    …from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, ... on and is growing every day. As a software engineer , you will work on a specific project critical...develop, test, deploy, maintain, and enhance software solutions. Google Cloud accelerates every organization's ability to digitally transform its… more
    Google (08/19/25)
    - Related Jobs
  • Senior Software Engineer , Google…

    Google (Sunnyvale, CA)
    …from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, ... on and is growing every day. As a software engineer , you will work on a specific project critical...users have the best and fastest experience possible. Google Cloud accelerates every organization's ability to digitally transform its… more
    Google (08/08/25)
    - Related Jobs
  • Senior Software Engineer , Distributed…

    NVIDIA (Santa Clara, CA)
    …deep learning. What you will be doing: + You will be part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to be ... the crowd: + Technical competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience and… more
    NVIDIA (07/02/25)
    - Related Jobs
  • Principal Software Engineer (CDSS…

    Palo Alto Networks (Santa Clara, CA)
    …Architect and own the design and implementation of core Threat Prevention and AppID cloud services for both public and private cloud environments + Establish and ... GoLang, Python, Linux, and networking + Rich Experience with Microservices and Cloud technologies (Kubernetes, GKE, EKS, Docker, Serverless, PubSub, IAM, etc) + Rich… more
    Palo Alto Networks (08/11/25)
    - Related Jobs
  • Staff Software Engineer , Google…

    Google (Sunnyvale, CA)
    …from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, ... on and is growing every day. As a software engineer , you will work on a specific project critical...develop, test, deploy, maintain, and enhance software solutions. Google Cloud accelerates every organization's ability to digitally transform its… more
    Google (07/09/25)
    - Related Jobs
  • Senior Optical Test and Automation Engineer

    Google (Sunnyvale, CA)
    …of hardware experiences, delivering unparalleled performance, efficiency, and integration. The ML, Systems , & Cloud AI (MSCA) organization at Google designs, ... lab test automation frameworks. + Understanding of computer and networking systems such as physical, functional, logical, mechanical, optics, electrical, software,… more
    Google (08/13/25)
    - Related Jobs
  • Senior GPU and HPC Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    …+ Understanding of performance, security and reliability in complex distributed systems . Familiarity with system level architecture, data synchronization, fault ... from the crowd: + Proficiency in architecting and managing large-scale distributed systems , independent of cloud providers. Deep knowledge of datacenter… more
    NVIDIA (07/10/25)
    - Related Jobs
  • Senior DGX AI Cloud Performance Analysis…

    NVIDIA (Santa Clara, CA)
    …work will enable AI researchers to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for performance optimization and ... Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the...+ Develop AI performance tools for large scale AI systems providing real time insight into applications performance and… more
    NVIDIA (06/08/25)
    - Related Jobs