- Google (Mountain View, CA)
- …DAS + Large scale optimization/inversion experience + High performance computing ( HPC ) experience. + GCP experience. + Experience in infrastructure-as-code, eg ... Terraform + Exposure to productions systems that rely heavily on ML models, and/or experience with model deployment + Experience working in start-up like environments where things can change on a dime + Consistent track record of delivering high quality… more
- NVIDIA (Santa Clara, CA)
- …+ Deep understanding of Software Development Process (SDLC), High-Performance Computing ( HPC ), and Software Testing Methodologies. + Expertise in compilers / ... Low-level software tools, understanding how they work and are implemented, with a proven track record to solve problems and implement solutions. + Ability to simplify sophisticated code to reproducible errors with minimal external dependencies. + Experience in… more
- NVIDIA (Santa Clara, CA)
- …lead, and scale globally distributed production systems supporting AI/ML, HPC , and critical engineering platforms across hybrid and multi-cloud environments. ... + Design and lead implementation of automation frameworks that reduce manual tasks, promote resilience, and uphold standard methodologies for system health, change safety, and release velocity. + Define and evolve platform-wide reliability metrics, capacity… more
- NVIDIA (Santa Clara, CA)
- …InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're searching for a highly motivated, technical architect to ... drive the roadmap and innovation in our rack system software architecture. From firmware, kernel drivers, operating systems, fabrics and associated user mode drivers + manageability software. You will work with component leads internally and engage with… more
- NVIDIA (Santa Clara, CA)
- …InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. NVIDIA NVLink Fusion will enable industry-leading AI scale-up and ... scale-out performance with NVIDIA technology plus semi-custom ASICs or CPUs. NVIDIA's robust partner ecosystem enables hyperscalers to build an ASIC hybrid AI infrastructure with NVIDIA NVLink, rack-scale architecture. We're searching for a highly motived,… more
- Amazon (Cupertino, CA)
- …of applications and workloads: databases, web services, games, video encoding, ML and HPC , and a variety of internal and customer services and applications to ensure ... they are taking advantage of Graviton's capabilities. We maintain profiling tools to help AWS customers and internal teams debug performance related problems, often working with them directly find root causes and resolve their issues. We work with the hardware… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload. Scale out is inherent to design of this ... massive superchip. We are looking for expert engineers to come and help design rack level solutions for next generation scaling AI supercomputing platforms. We are looking for a strong technical architect to own end to end manageability architecture for these… more
- Amazon (Cupertino, CA)
- …cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and ... operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access… more
- NVIDIA (CA)
- …students to enter the workforce experienced in Quantum Computing, AI, and HPC ! Students will learn how Accelerated Quantum Supercomputers will change the computing ... landscape through hands-on learning with GPUs and QPUs. In this role, you will help to build the future of Quantum Computing curriculum by engineering a platform that enables professors in multiple fields to integrate CUDA-Q into their existing courses. What… more
- Amazon (Cupertino, CA)
- …range of applications including databases, web services, games, video encoding, ML, and HPC workloads. This doesn't mean you have or will have all those skills, ... but you'll have a chance to learn from those who do. This is a unique opportunity to impact how software runs in AWS, while growing your technical breadth and depth. Key job responsibilities As a Graviton Software Developer, you will: Performance Optimization… more