- NVIDIA (Santa Clara, CA)
- …for the growing field of artificial intelligence (AI) and high-performance computing ( HPC ). What you'll be doing: + Architect hardware and software Resiliency ... features to improve system Reliability, Availability, Serviceability (RAS), and performance in the Datacenter. + Model and analyze RAS metrics like Failures in Time for permanent and transient errors, and Availability from GPU to Rack to Datacenter. Use models… more
- NVIDIA (CA)
- …from agencies and researchers around mission, strategy, requirements, and actions for HPC /AI, Data Analytics and Edge computing and synthesize these into a coherent ... response strategy for NVIDIA. What we need to see: + A Master's degree in the Science or Engineering field (or equivalent experience). + 12+ years of relevant work experience, with 5+ years of experience successfully leading projects in a fast-paced and… more
- Super Micro Computer (San Jose, CA)
- …for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company ... among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers,… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to the design of ... this massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. Join us at the forefront of technological advancement. What you will be doing: + Drive next… more
- Super Micro Computer (San Jose, CA)
- …for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company ... among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers,… more
- NVIDIA (Santa Clara, CA)
- …storage systems, and ensuring low-latency data access for high-performance computing ( HPC ) and AI/ML workloads. Storage Production Engineers at NVIDIA ensure that ... our internal and external-facing GPU cloud services meet reliability and uptime goals as promised to the users while enabling developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency,… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- …team also works on advanced computational workflows linking high performance computing ( HPC ) with experiments in real-time, in collaboration with other groups across ... SLAC, NERSC, and other national laboratories more broadly; this work helps enable AI/ML-based experiment guiding and digital twins for accelerators. Our team is heavily oriented toward collaboration with the broader accelerator community and works… more
- NVIDIA (Santa Clara, CA)
- …InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technology leader for running NVIDIA's ... server TPM team. You will be the cross-section between execution and strategy, leading a team of Senior TPMs driving impactful programs and delivering measurable results across many functions of firmware, software for the deep learning server platforms. What… more
- Northrop Grumman (Palmdale, CA)
- …have experience with either Windows (desktop and server), Linux (Red Hat or HPC ), VMWare, Storage Systems (SAN, NAS, DAS) or CISCO networks. Salary Range: $81,300.00 ... - $150,000.00Salary Range 2: $100,300.00 - $187,300.00 The above salary range represents a general guideline; however, Northrop Grumman considers a number of factors when determining base salary offers such as the scope and responsibilities of the position and… more
- Amazon (Cupertino, CA)
- …C++ (11, 14, etc.) - Experience in multi-threaded programming, vector extensions, HPC , and QEMU - Experience with machine learning accelerator hardware and/or ... software Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and… more