- Lockheed Martin (Fort Worth, TX)
- …Intelligence Center \(LAIC\) team is seeking an experienced AI Platform Engineer to support enterprise Machine Learning and Artificial Intelligence systems ... of Machine Learning Architectures, including GPU Computing, High Performance Computing \( HPC \) * Knowledge of ML/AI orchestration, such as Kubeflow, Flyte, MLflow… more
- Broadcom (San Jose, CA)
- …and firmware engineers to join the NIC product development team. As a Software Engineer , you will be responsible for designing and development of the RDMA protocol ... 2. Significant experience in RDMA protocol, QoS, Packet Classifications, Linux Systems programming, Linux kernel, Linux Network Drivers, Linux Kernel Networking,… more
- NVIDIA (Santa Clara, CA)
- …impact on the world! NVIDIA is searching for a highly motivated, technical engineer to join the Tegra system-on-chip (SoC) software organization. You will work on ... and hardware designs. + Strong understanding of multicore hardware, operating systems design, concurrency, virtual memory, caching, interrupts, device drivers and… more
- NVIDIA (Santa Clara, CA)
- We are seeking Lead Post-Silicon Validation Engineer within the GPU Engineering Team to help drive development of future GPUs be used in 3D graphics, deep learning, ... HPC and automotive markets. Make the choice to join...proven experience with three years and working with memory systems in the lab. + Direct experience in taking… more
- NVIDIA (Santa Clara, CA)
- …a discipline that involves designing, building, and maintaining large-scale production systems with high efficiency and availability. It encompasses various areas, ... including software and systems engineering practices, storage, data management, and services. Production..., and ensuring low-latency data access for high-performance computing ( HPC ) and AI/ML workloads. Storage Production Engineers at NVIDIA… more
- NVIDIA (Santa Clara, CA)
- …Infrastructure Specialist to design, develop, and operationalize next-generation thermal systems . This role will be deeply involved in heat-rejection architecture, ... waste-heat recovery integration, and full-stack MEP systems , transforming how we think about thermal performance at...to stand out from the crowd: + Experience with AI/ HPC data centers and advanced cooling technologies, including two-phase… more
- Microsoft Corporation (Redmond, WA)
- **Overview** The HPC /AI (High-Performance Computing and Artificial Intelligence) organization is on a mission to build the next generation of distributed AI ... supercomputers- systems that deliver unprecedented computational power, scalability, and reliability...some of the largest and most complex distributed training systems in the world. This is a rare opportunity… more
- Amazon (Austin, TX)
- …operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you're passionate about pushing the limits of performance, ... the cloud, this is your opportunity to build the systems that define what's next for AWS - and...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Microsoft Corporation (Mountain View, CA)
- …to push the boundaries of AI toward **Humanist Superintelligence-ultra-capable systems that remain controllable, safety-aligned, and anchored to human values.** ... experience. + Apply strong software engineering fundamentals in distributed systems , networking, and storage while building large-scale distributed applications on… more
- NVIDIA (Santa Clara, CA)
- …ecosystem of data center platform & node designs. From single node HGX/DGX systems all the way up to large multi-node NVLink domain rack architectures. These ... InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. NVIDIA NVLink Fusion will enable industry-leading AI scale-up and… more