• Senior ML Platform Engineer - Lepton

    NVIDIA (Santa Clara, CA)
    …using Ansible and Terraform, ensuring reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, ... SRE roles, including 3+ years focused on ML infrastructure or distributed compute systems. + Strong proficiency in Infrastructure-as-Code (IaC) tools, specifically… more
    NVIDIA (11/04/25)
    - Related Jobs
  • Senior Software Architect, Observability Platform

    NVIDIA (Santa Clara, CA)
    …Senior Software Architect for our Observability Platform to architect and implement distributed observability systems for data centers enabling EDA workflows . We ... What We Need to See: + Experience developing large scale, distributed observability systems. + Ability to collaborate with data scientists, researchers,… more
    NVIDIA (11/04/25)
    - Related Jobs
  • Sr. Software Development Engineer - Amazon…

    Amazon (East Palo Alto, CA)
    …mentor, tech lead or leading an engineering team - 4+ years of distributed systems experience, or Bachelor's degree in computer science, engineering, mathematics or ... equivalent - Experience with distributed computing and enterprise-wide systems Proficiency in at least...writing algorithms and creating data structures - Experience with distributed systems at scale - Experience performing live system… more
    Amazon (11/02/25)
    - Related Jobs
  • Senior Solutions Architect (Pre-Sales)

    MongoDB (San Francisco, CA)
    …in a professional capacity + Experience designing with scalable and highly available distributed systems in the cloud and on-prem + Demonstrated ability to work with ... software. MongoDB's unified database platform-the most widely available, globally distributed database on the market-helps organizations modernize legacy workloads,… more
    MongoDB (11/01/25)
    - Related Jobs
  • Sr. Connectivity Architect

    Panasonic Avionics Corporation (Irvine, CA)
    …network engineer, or similar role working with OSS/BSS of large-scale distributed systems. + 5+ year's architecture or technical leadership experience with ... Cloud, and Commerce Cloud, and their integration with OSS/BSS systems. + Distributed Systems: Knowledge of distributed systems concepts and architectures,… more
    Panasonic Avionics Corporation (10/29/25)
    - Related Jobs
  • Principal Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    …common file formats such as Parquet, ORC and JSON + Collaborate with distributed systems teams to craft solutions to distributed processing problems challenges ... development + Outstanding technical skills in designing and implementing high-quality distributed systems + Excellent programming skills in C++, Java, and/or Scala… more
    NVIDIA (10/29/25)
    - Related Jobs
  • Director, Software Engineering

    Walmart (Sunnyvale, CA)
    …area. + Deep working knowledge of all aspects of Cloud native distributed system development: Azure/ GCP/ WCNP preferred + Experience working with micro-services ... architecture and distributed systems. + Experience managing software development engineers, leaders,...ensure performance and scalability. + Experience collaborating with geographically distributed teams to align on platform goals. + Strong… more
    Walmart (10/28/25)
    - Related Jobs
  • Senior ASIC Front End Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    …requirements + Deploy new compute farm technologies such as containers, volume cloning, distributed storage and distributed compute at scale + Deploy tracking ... preferred. + Experience with Make based build systems in large, distributed computing environments + Continuous Integration pipeline and/or pre-submit verification… more
    NVIDIA (10/28/25)
    - Related Jobs
  • Software Engineer - Core Infra (FedRamp)

    Rubrik (Palo Alto, CA)
    …build upon, including our microservices architecture, Kubernetes deployment system, distributed job workflow engine, database instances(Mysql) and platform cloud ... and innovating on each of our core layers - Kubernetes services, distributed job framework, MySQL and security. **Some challenges include:** **Scalability** : As… more
    Rubrik (10/28/25)
    - Related Jobs
  • Director, Software Engineering - AI Infrastructure

    LinkedIn (Mountain View, CA)
    …solutions, our AI Infrastructure brings together information retrieval, machine learning, distributed systems, and other fundamental areas of computer science. The ... delightful for our ML Engineers to productively use. To do this, our distributed search platform must scale seamlessly across data and traffic, while enabling… more
    LinkedIn (10/28/25)
    - Related Jobs