- SpaceX (Hawthorne, CA)
- Sr. HPC Systems Engineer Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally ... ENGINEER SpaceX is looking for an HPC Systems Engineer with strong...5+ years of professional experience building, deploying and troubleshooting Linux systems . + Experience with a scripting… more
- Northrop Grumman (Redondo Beach, CA)
- …code deployment, maintenance, and optimization efforts. The lessons learned from existing HPC systems will inform the architecture, deployment, and utilization ... but are not limited to: + Develop and deploy architectures for future HPC systems based on engineering computing requirements, collaborating with engineering to… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and...Experience analyzing and tuning performance for a variety of AI/ HPC workloads. Excellent problem-solving to analyze complex systems… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …make an impact on the world of technology. Cadence is seeking a Server Farm Engineer to join our team to support, manage, and improve the compute farm environment. ... design, verification, and analysis. We are looking for a recent graduate software engineer to join our team of collaborative EDA professionals to deliver the… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... ). + Experience analyzing and tuning performance for a variety of AI/ HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks,… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... or LSF + Proficient in administering Centos/RHEL and/or Ubuntu Linux distributions + Solid understanding of cluster configuration managements...IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC … more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems . This role involves working on collective operations - the fundamental ... Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux , kernels, and performant code is important. Experience with embedded systems is… more
- NVIDIA (Santa Clara, CA)
- …Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join our Performance group. In this exciting role, you will profile ... and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools and methodologies...Languages: Python, Bash and C languages + Experience with Linux OS distros. + Great teammate with good communication… more
- NVIDIA (Santa Clara, CA)
- …familiarity with software testing and deployment, familiarity with distributed systems , and excellent communication and planning abilities. Experience working with ... High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly preferred. We also welcome out-of-the-box thinkers who… more
- NVIDIA (Santa Clara, CA)
- …choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment and ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist...and visualization pipelines + Exposure to container technology and Linux performance tools. Widely considered to be one of… more