- Broadcom (San Jose, CA)
- …Sign-In before you apply.** **Job Description:** Software Field Applications Engineer (FAE) is software technical lead for Broadcom ethernet controllers/Network ... congestion techniques, working experience on debugging Embedded Software, knowledge of HPC and AI/ML data center operational models, deep knowledge of Linux… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
- NVIDIA (Santa Clara, CA)
- …a variety of programming models, frameworks, and tools. We are looking for a Senior Developer Advocate Engineer to own the technical engagements for a rapidly ... High Performance Computing ( HPC ) and Artificial Intelligence (AI) are key markets...doing: + Stay abreast of the latest developments in HPC and AI technologies and develop proof-of-concept solutions for… more
- NVIDIA (Santa Clara, CA)
- …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC scheduler vendor on bug fixes and feature releases +… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...etc.) + Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on… more
- Microsoft Corporation (Mountain View, CA)
- …Solutions (SDCS) team, embedded within the broader silicon engineering organization. As a Senior Systems Engineer , you will play a critical role in designing, ... This role is central to ensuring the availability, performance, and efficiency of HPC services that power Microsoft's silicon innovation. You will work closely with… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving...countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Infrastructure organization is seeking a Senior AI Observability Engineer to help architect and implement distributed observability systems for AI ... and HPC clusters. We serve and collaborate directly with NVIDIA's...spectacularly improve efficiency, performance, and productivity of AI and HPC workloads. You will develop, deploy, and operate observability… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking ... a Senior Software Engineer to lead the development...and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation of AI training and… more
- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are looking for an outstanding engineer for a Senior Performance Engineer role for at scale AI system ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems specialist...and resource management systems. + Experience with large scale HPC environments. Your base salary will be determined based… more