- NVIDIA (Santa Clara, CA)
- …development of our software, contributing directly to the world's leading AI and HPC systems. What you'll be doing: + Design, develop and optimize server application ... responsible for managing NVLink networks of thousands of network devices! + Collaborate with multiple teams in our multi-functional environment on developing new features and improvements. + Participate in the design and architecture of new next generation… more
- SpaceX (Hawthorne, CA)
- …power) best practices including green power, cooling requirements and HPC cluster environments. + Experience in server/infrastructure virtualization technologies ... such as VMware or Hyper-V. + Experience writing instructional documentation and conveying highly technical ideas in terms non-technical staff can easily understand. + A motivated self-starting personality that is able to work independently while maintaining… more
- Northrop Grumman (Palm Beach Gardens, FL)
- …for efficient processing of analytical models with high performance workstations and HPC clusters. + General user knowledge of Siemens NX and Teamcenter Unified ... Architecture + Good knowledge of metals and additive materials for high thermal loading environments and general knowledge in composites in same environment. Primary Level Salary Range: $142,200.00 - $213,400.00 The above salary range represents a general… more
- NVIDIA (Santa Clara, CA)
- …to understand, define and implement processes to support as well as NVIDIA GPUs HPC and AI platforms to cloud service provider customers and OEMs. This will also ... include responsibilities related to general compute and firmware releases. + Lead software and firmware execution for Datacenter class of Servers, Rack Solution and PCIe products, drive release schedules and plans, executive status updates. Schedule and lead… more
- Mount Sinai Health System (New York, NY)
- …Proficient with python/shell/other scripting languages + Use of high-performance computing ( HPC ) cluster environment + Strong research experience in analysis of ... large-scale genomic datasets (single-cell RNA-seq, ATAC-seq, etc) + Self-motivated and highly dedicated + Effective written and oral communication skills **Qualifications** . **Responsibilities** . **About Us** **Strength through Unity and Inclusion** The… more
- Northrop Grumman (Roy, UT)
- …ensure project success + Demonstrated experience working with Linux and Windows based HPC 's for aero/fluids work + Involvement with the planning and execution of ... physical aerodynamics related testing applications such as wind tunnels or flight test validation including scheduling, test matrix and development of test items + Demonstrated experience leading a technical effort/team through an EMD program in compliance of… more
- Micron Technology, Inc. (Richardson, TX)
- …and influencing the future direction of memory for high-performance compute ( HPC ) and artificial intelligence (AI) systems. This memory-oriented AI/ML role requires ... a strong good understanding of compute system architecture features like parallelism, caching, memory scaling, bandwidth and latency. Outstanding candidates who have a foundation in statistics, algorithms for signal denoising and error detection and correction… more
- NVIDIA (Santa Clara, CA)
- …6+ years of proven experience using in accelerated computing for datacenter/ HPC solutions. + C/C++/Python/Bash programming/scripting experience. + Deep experience in ... backend or infrastructure engineering, especially with complex, multi-team systems + Strong debugging instincts and a methodical approach to untangling ambiguous problems + A background in DevOps or deployment tooling-ideally with an eye for reproducibility… more
- NVIDIA (Santa Clara, CA)
- …Slurm, Terraform and Kubernetes + CUDA programming and NCCL experience. HPC programming experience including MPI, OpenACC, or other parallel programming tools. ... Hands-on experience with DGX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices. + Interest in crafting, analyzing and fixing large-scale distributed systems. + Systematic problem-solving approach, coupled… more
- NVIDIA (Santa Clara, CA)
- …for the growing field of artificial intelligence (AI) and high-performance computing ( HPC ). What you'll be doing: + Architect hardware and software Resiliency ... features to improve system Reliability, Availability, Serviceability (RAS), and performance in the Datacenter. + Model and analyze RAS metrics like Failures in Time for permanent and transient errors, and Availability from GPU to Rack to Datacenter. Use models… more