- Northrop Grumman (Redondo Beach, CA)
- …OR a Master's degree AND 3 years of related professional/military engineering experience. + ** Senior Principal HPC Engineer :** Bachelor's degree in a STEM ... clearance to Special Access Programs (SAPs). **Educational/ Experience Requirements:** + **Principal HPC Engineer :** Bachelor's degree in a STEM discipline AND 5… more
- NVIDIA (Santa Clara, CA)
- …+ Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop ... and operating large scale compute infrastructure. + Experience with AI/ HPC job schedulers and orchestrators, such as Slurm, K8s...that use MPI and NCCL. + Proficient in using Linux including Centos/RHEL and/or Ubuntu Linux distributions.… more
- NVIDIA (Santa Clara, CA)
- …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as...based on Slurm or Kubernetes + Strong understanding of Linux operation system and TCP/IP fundamentals Your base salary… more
- NVIDIA (Santa Clara, CA)
- …efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are ... + Manage and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to automate deployment,… more
- NVIDIA (Santa Clara, CA)
- …like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... skills. + Programming Languages: Python, Bash and C languages + Experience with Linux OS distros. + Great teammate with good communication and interpersonal skills… more
- NVIDIA (Santa Clara, CA)
- …and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly ... understanding of Data Structure and Algorithms. + Expert level knowledge of Linux system administration and management. + Understanding of cluster management systems… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... C/C++ and relatively low level, so solid knowledge of Linux , kernels, and performant code is important. Experience with...systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
- Cisco (San Jose, CA)
- …changes and/or completely innovative approaches for our artificial intelligence platform. + Senior Engineer who can lead and motivate teams, present, and ... AI Infrastructure Engineer - HPC Apply (https://jobs.cisco.com/jobs/Login?projectId=1443781) +...bottlenecks to drive system and workflow efficiency. + Administer Linux systems, ranging from powerful GPU-enabled servers to general-purpose… more
- NVIDIA (Santa Clara, CA)
- …the choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment ... workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop and bring… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...with systems knowledge and experience in area such as Linux OS boot sequencing, Kernel, Hypervisor (Xen or KVM),… more