• Senior GPU and HPC Infrastructure Engineer - DGX…

    NVIDIA (Santa Clara, CA)
    …multiple data streams, ranging from GPU hardware diagnostics to cluster and network telemetry . + Work on software that manages NVLINK topography across GPU clusters. ... in Machine Learning Operations. Hands-on experience with Bright Cluster Manager . + Hands-on experience developing and/or operating hardware fleet management… more
    NVIDIA (10/09/25)
    - Related Jobs
  • Network Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …with network management tools such as Prometheus, Grafana, Alert Manager , Nautobot/Netbox, BigPanda. + Network Automation: Expertise in automating networks ... taking operational signals through means such as SNMP, Syslog, Streaming Telemetry to solve operational challenges. + Platform Exposure: Experience with… more
    NVIDIA (09/23/25)
    - Related Jobs
  • Senior Instrumentation and Controls Engineer

    Carollo Engineers (Costa Mesa, CA)
    …best place for you to build your career. **Responsibilities** + Works with project manager and clients + Develops scopes of work for I&C systems, networks and ... preferences, HMI graphics design and standard development + Designs radio telemetry SCADA systems using microwave, spread spectrum, licensed MAS, cellular or… more
    Carollo Engineers (09/12/25)
    - Related Jobs
  • Mid Level Instrumentation and Controls Engineer

    Carollo Engineers (Costa Mesa, CA)
    …best place for you to build your career. **Responsibilities** + Works with project manager and clients + Develops scopes of work for I&C systems, networks and ... preferences, HMI graphics design and standard development + Designs radio telemetry SCADA systems using microwave, spread spectrum, licensed MAS, cellular or… more
    Carollo Engineers (09/12/25)
    - Related Jobs
  • Network Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Familiarity with network management tools such as Prometheus, Grafana, Alert Manager , Nautobot/Netbox, BigPanda + Expertise in automating networks using frameworks ... of taking operational signals through means such as SNMP, Syslog, Streaming Telemetry to solve operational challenges + History of debugging and optimizing code;… more
    NVIDIA (07/26/25)
    - Related Jobs
  • CPU/Linux Performance - Software Engineer

    Nutanix (San Jose, CA)
    …automation frameworks for testing and deployment + Working with databases for telemetry , logging and performance analysis + Knowledge of performance profiling tools ... in-office presence. Additional team-specific guidance and norms will be provided by your manager . If hired, employee will be in an "at-will position" and the Company… more
    Nutanix (07/25/25)
    - Related Jobs