- Google (Reston, VA)
- Cloud Infrastructure Engineer , HPC , TPU, Google Public Sector _corporate_fare_ Google _place_ Reston, VA, USA **Mid** Experience driving progress, ... on Google Cloud Platform (GCP). For these High Performance Computing ( HPC ) requirements, we offer supercomputer-class infrastructure (eg, CPUs, GPUs, or… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure . We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation ... + Minimum 5+ years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job schedulers, such as Slurm,… more
- NVIDIA (Santa Clara, CA)
- …operators with actionable insights. + Collaborate with AI platform, GPU, and cloud infrastructure teams to optimize observability for model training, inference ... team, Managed AI Superclusters (MARS) builds and scales the infrastructure , platforms, and tools that enable researchers and engineers...transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information + Collaborate… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving...software components that are critical building blocks for EC2 infrastructure . Every instance in EC2 is running some type… more
- Amazon (Santa Clara, CA)
- …understanding of the cloud computing delivery model as it relates to HPC . - Knowledge of the underlying infrastructure requirements such as Networking, ... as they refactor an application or designing entirely new cloud -based systems. Do you enjoy solving novel and unique...multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC … more
- NVIDIA (Seattle, WA)
- We are seeking a motivated Senior HPC Technical Support Engineer - AI Infrastructure focusing on InfiniBand, NVLink and AI GPU Cluster technology, passionate ... level of DCA and/or CKA, Virtualization and (KVM/ESXi) and Cloud Infrastructure (AWS/OCI) Technologies + Able to...RDMA, NVLink and NVIDIA GPU Technology + Clustering or HPC Data-Center technologies including Upper Layer Protocols (ie, MPI,… more
- University of Maine System (Orono, ME)
- …of infrastructure for migrating jobs between on-premise clusters and remote/ cloud computing platforms. + Ability and willingness to learn new technologies and ... position will have an active role in shaping what HPC resources are available, who can access those resources,...remain current in developing trends in the HPC community. + Highly developed organizational skills and attention… more
- Citigroup (Irving, TX)
- …community and make a real impact. **Job Description:** We are seeking a Lead Cloud Infrastructure Engineer , a strategic problem-solver, builder, and product ... enabling new infrastructure economics. As a public cloud infrastructure engineer , you will...team that continues to deliver big! From establishing a cloud -based High-Performance Compute ( HPC ) platform for complex… more