- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more
- Lilly (Indianapolis, IN)
- …be driving the engineering and operations of advanced Linux platforms supporting AI and HPC workloads, managing Nvidia DGX systems using Mission Control, ... the world. Come help us unlock the power of HPC and AI based POGPU and Accelerated...in our Infrastructure Hosting Platform area leading the strategy, engineering and development of Advanced Linux computing capabilities for… more
- NVIDIA (Santa Clara, CA)
- …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... runtime designs, and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX,… more
- Meta (Menlo Park, CA)
- …5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques **Minimum ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead… more
- Amazon (Boston, MA)
- …following programming languages: C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. - Current, active US Government Security Clearance ... some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep...National Super Computing Centers , Government agencies , and/or AI /ML , CAE , Weather and accelerated computing with… more
- Federal Reserve Bank (Kansas City, MO)
- …education and experience. + Minimum of 6 years of relevant experience in HPC administration and systems engineering . + Extensive experience with Linux operating ... and accelerator technologies (CUDA, OpenACC). + Experience supporting machine learning and AI workloads on HPC systems. **Additional Information** How We Work… more
- NVIDIA (Santa Clara, CA)
- …to stand out from the crowd: + Experience in solving problems in large-scale HPC network environments with overlay technologies (BGP, OSPF, VXLAN, EVPN), RoCE ... part of the role is also to interact with Engineering , Marketing, and Support teams regularly. What you will...installing our products with a focus on Infiniband, next-generation AI , and HPC server technologies. + Own… more
- NVIDIA (Santa Clara, CA)
- …challenges and provide outstanding HPC solutions. + Collaborate closely with hardware engineering , CUDA engineering , and AI research groups to apply the ... healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as...integrating genomic solutions into mainstream healthcare. As a healthcare HPC engineer, you will join a dynamic development team… more
- GliaCell Technologies (Annapolis Junction, MD)
- Are you a Principal HPC Software Engineer who is ready for a new challenge that will launch your career to the next level? + Tired of being treated like a company ... you. We Make It Happen! GliaCell Technologies focuses on Software & System Engineering in Enterprise and Cyber Security solution spaces. We excel at delivering… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more