- NVIDIA (Santa Clara, CA)
- …and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly ... understanding of Data Structure and Algorithms. + Expert level knowledge of Linux system administration and management. + Understanding of cluster management systems… more
- NVIDIA (Santa Clara, CA)
- …choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment and ... workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop and bring… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a senior HPC software engineer . As a member of our the High Performance Computing Software development team, you will be responsible for ... technical leaders solving some of the biggest challenges in HPC , machine learning, cloud computing, and system co-design. What...of Programming in C/C++ + 3 years' experience in Linux environment and tools + Knowledge of Networking Protocols… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more
- Actalent (Poway, CA)
- …infrastructure (VMware, Azure, and AWS). + Proactively maintain and develop all Linux infrastructure technology to maintain a 24x7x365 uptime service. + Install and ... set up Linux systems and servers in a hybrid cloud environment....+ Migrate server systems between hybrid cloud platforms. + Engineer systems administration-related solutions for various project and operational… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop… more
- LTD Global (Berkeley, CA)
- Position overview: We are seeking a Site Reliability Engineer to join our Operations Group. This role plays a key part in advancing scientific discovery by ... supporting high-performance computing ( HPC ) and data analysis for the organization. Our center...environmental science, and other missions. As a Site Reliability Engineer , you will be part of a 24/7 operations… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services Division of the Technology and… more
- NVIDIA (Santa Clara, CA)
- …to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads ... be doing: + Design and develop enhancements to the HPC batch scheduler(s). + Work extensively with HPC...as Python, Go, bash scripting + Established experience in Linux operating system, environment and tools + Accomplished in… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... to guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an… more