• Senior MLOps Engineer, GenAI Framework

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a dedicated and motivated senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (Megatron-LM ... performance in every domain. What you'll be doing: + Architect and manage the continuous integration pipelines and release...stand out from the crowd: + Proven-track record with GPU accelerated systems at scale. + Well-versed in DL… more
    NVIDIA (07/17/25)
    - Related Jobs
  • Senior Hardware Development Engineer, AWS…

    Amazon (Cupertino, CA)
    …leveraging your experience with server design and the knowledge of various teams to architect the solutions that we will deploy at scale. To deliver your products ... speed bus design and signal integrity, failure analysis, server components (eg CPU, GPU , SSDs, drives), BIOS, BMC, and networking - Excellent written and oral… more
    Amazon (08/08/25)
    - Related Jobs
  • Senior Software Engineer, AI Platform…

    NVIDIA (Santa Clara, CA)
    …orchestration problems in distributed AI/ML systems. What you'll be doing: + Architect , develop, and deploy backend services supporting NVIDIA GR00T using Kubernetes ... ROS2) or simulation tools (eg, Isaac Sim, Omniverse). + Background with GPU cluster management and scheduling across cloud providers. + Contributions to open-source… more
    NVIDIA (07/25/25)
    - Related Jobs
  • Senior Silicon Circuits System Design…

    NVIDIA (Santa Clara, CA)
    …through their inventions. As part of the Silicon Solutions Team, we architect and deliver groundbreaking solutions for productizing NVIDIA's chips into consumer, ... in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention, the GPU , serves as the visual cortex of modern computers and is… more
    NVIDIA (06/13/25)
    - Related Jobs
  • Staff Software Engineer (Compute)

    DataRobot (San Francisco, CA)
    …efficient cloud spending for ourselves and our customers. + Design and architect automated quality platforms to go from Enterprise-Grade releases from once-a-quarter ... OpenTelemetry or experience with other orchestrators like nomad/slurm + Experience with gpu clusters, either as a user or administrator or experience in multi-node… more
    DataRobot (07/11/25)
    - Related Jobs