- Oracle (Des Moines, IA)
- …metrics, logs, eBPF/perf, chaos/failure testing, and SLO-driven operations. Knowledge of AI / HPC workload patterns and their implications for storage, query ... **Job Description** OCI (Oracle Cloud) AI Infrastructure Innovation team is inventing the next...If you thrive at the intersection of large-scale distributed systems , database internals, and cloud platforms, this role offers… more
- Google (Kirkland, WA)
- …degree or equivalent practical experience. + 2 years of experience in high performance computing ( HPC ) system architecture and applications. + 2 years of ... and engineering issues on our Google Cloud Platform (GCP). This High Performance Computing ( HPC ) role offers supercomputer-class infrastructure (CPUs, GPUs or… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are searching for a highly motivated ... benchmarking strategies for our data center platforms and products + Characterize real-world AI training, inference, and HPC workloads at scale + Define, track,… more
- Lockheed Martin (PR)
- …This Full Stack Engineer role is for the High Performance Computing \( HPC \) Delivery Team with a focus on AI Infrastructure\. Engineer responsibilities ... include: * Support the design and development of HPC and utility systems \(computation, network, and storage\) * Support AI Infrastructure and the equivalent… more
- Honeywell (Aguadilla, PR)
- As a **Lead IT Engineer for High Performance Computing ( HPC )** here at Honeywell, you will be at the forefront of our technology initiatives, driving the design ... will be crucial in optimizing our computing resources and ensuring that our systems operate at peak efficiency. Honeywell's HPC infrastructure spans hundreds of… more
- BAE Systems (Wright Patterson AFB, OH)
- …Team Leads, BAE leadership, and government stakeholders to integrate efforts across systems , networking, cybersecurity, HPC operations, and user support. The ... relevant experience). + 10 years of progressive IT experience across systems administration, networking, cybersecurity, HPC operations, or enterprise… more
- IBM (Cambridge, MA)
- …technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The ... focuses on the next generation Hybrid Cloud infrastructure for AI , Storages, HPC and Quantum applications. The...experience with Git * HPC : experience running HPC workloads on HPC systems … more
- General Dynamics Information Technology (Washington, DC)
- …Family:** IT Infrastructure and Operations **Skills:** High Performance Computing ( HPC ),High- Performance Computing ( HPC ) Systems ,Scientific Research ... inspiring collaboration and teamwork + Responsible for multiple High Performance Computing ( HPC ) clusters including all compute,...with researchers for support and troubleshooting + Experience with HPC systems and tooling like SLURM, GPFS,… more
- Deloitte (Costa Mesa, CA)
- …and building secure networks and modern data centers, to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- Meta (Menlo Park, CA)
- …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more