• Senior Solutions Architect - AI Infrastructure…

    NVIDIA (MA)
    …Grafana, and NVIDIA DCGM + Understanding of datacenter networking technologies ( InfiniBand , Ethernet, OFED) and experience with network configuration + Familiarity ... with power and cooling systems architecture for data center infrastructure Ways to stand out from the crowd: + Background in deploying LLM training and inference workflows in a research computing environment + Experience deploying and evaluating cluster… more
    NVIDIA (09/17/25)
    - Related Jobs
  • Senior HPC Cluster Engineer - EDA

    NVIDIA (Santa Clara, CA)
    …and tools. + Familiarity with High-Speed Networking pertaining to HPC including InfiniBand , RDMA and RoCE. + Understanding of fast, distributed storage systems such ... as Lustre and GPFS for AI/HPC workload. + Familiarity with metrics collection and visualization at scale with Prometheus, OpenSearch and Grafana. NVIDIA offers competitive salaries and benefits. Our experienced and talented employees contribute to our… more
    NVIDIA (09/17/25)
    - Related Jobs
  • Optical Communications Business Lead

    Global Foundries (UT)
    …expertise in optical communications protocols including high speed Ethernet (IEEE 802.3) and InfiniBand and multi-source agreements such as SFP, OSFP and OIF + Deep ... domain expertise in optical module development + Experience designing optical-to-electrical interface circuits + Language: Fluency in a second language to support high growth markets Expected Salary Range $131,900.00 - $263,000.00 The exact Salary will be… more
    Global Foundries (09/17/25)
    - Related Jobs
  • Senior Machine Learning Engineer

    Red Hat (Boston, MA)
    …Experience with high-performance networking protocols and technologies including UCX, RoCE, InfiniBand , and RDM + Strong communications skills with both technical ... and non-technical team members + BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus \#AI-HIRING \#LI-MD2 The salary range for this position is $170,770.00 - $281,770.00. Actual offer will… more
    Red Hat (09/16/25)
    - Related Jobs
  • Principal Machine Learning Engineer, Distributed…

    Red Hat (Boston, MA)
    …knowledge of high-performance networking protocols and technologies including UCX, RoCE, InfiniBand , and RDMA is a plus. + Excellent communication skills, capable ... of interacting effectively with both technical and non-technical team members. + A Bachelor's or Master's degree in computer science, computer engineering, or a related field. **Following is considered a plus** + Experience with the Kubernetes ecosystem,… more
    Red Hat (09/13/25)
    - Related Jobs
  • HPC Systems Engineer

    US Tech Solutions (Houston, TX)
    …high-availability, LAN / WAN / WLAN topologies and system configuration for Ethernet, InfiniBand , and Fiber Channel SAN. + Experience with HPC Storage Solutions, for ... example configuration and operation of HPE ClusterStor systems, NetApp, Dell Isilon, and Pure Storage. + Ability to write and troubleshoot Bourne, Bash and C Shell, Perl, Python, Ruby and MRTG scripts. + Experience with PostgreSQL and database installation and… more
    US Tech Solutions (09/12/25)
    - Related Jobs
  • Senior Solutions Architect, Spectrum-X Low Level

    NVIDIA (Santa Clara, CA)
    …distributed collection of NVIDIA GPUs inter-connected by networking solutions such as InfiniBand , Ethernet, or RoCE (RDMA over Converged Ethernet) we make powerful ... ML/AI platforms possible. We believe in our people and our products. We are seeking motivated, personable, and independent individuals to join our team! We seek experienced software embedded engineers to help support our groundbreaking, innovative technologies… more
    NVIDIA (09/11/25)
    - Related Jobs
  • Senior Software Engineer, GPU Communications…

    NVIDIA (Santa Clara, CA)
    …CUDA programming and NVIDIA GPUs. + Knowledge of high-performance networks like InfiniBand , iWARP etc. + Experience with HPC applications. + Experience with Deep ... Learning Frameworks such PyTorch, TensorFlow, etc. + Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment. NVIDIA offers highly competitive salaries and a… more
    NVIDIA (09/11/25)
    - Related Jobs
  • Senior System Software Engineer, Enterprise MODS

    NVIDIA (Santa Clara, CA)
    …development and automation. + Familiarity with high-speed interconnects such as PCIe, Infiniband , NVLink, and Ethernet. + Strong communication skills to engage with ... technical and executive team. + BS/MS or equivalent experience in Computer Science, Electrical Engineering, or related field. + 12+ years of engineering experience in diagnostics, embedded systems, or cloud platforms. Ways to stand out from the crowd: +… more
    NVIDIA (09/10/25)
    - Related Jobs
  • HPC Linux Systems Administrator 3/4 - Secret

    Northrop Grumman (Melbourne, FL)
    …understanding of HPC systems design, HPC network architecture (Ethernet and Infiniband ), and parallel processing optimization. + Programming experience with at least ... one high-level programming language or scripting language such as C, Fortran, Python, or BASH + Experience building and maintaining HPC systems and running scalable distributed application software. + Experience with storage clusters and/or parallel file… more
    Northrop Grumman (09/10/25)
    - Related Jobs