- NVIDIA (Santa Clara, CA)
- …innovative techniques for emerging AI workloads. From debugging performance bottlenecks in thousand-GPU distributed systems to influencing next-generation ... looking for engineers who excel at parallel programming and systems -level performance work and want to directly...work and want to directly impact the future of AI compilation. The Deep Learning Frameworks Team @ NVIDIA… more
- NVIDIA (Santa Clara, CA)
- …in CS, CE, EE (related technical field) or equivalent experience. + Prior systems software or communication runtime or high performance networking software ... libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . DL and HPC applications have a...computer system architecture, operating systems principles (aka systems software fundamentals), HW-SW interactions and performance … more
- NVIDIA (Santa Clara, CA)
- …technical leader to define a vision and roadmap for distributed data platform and observability systems for large-scale AI and HPC clusters and workloads and ... and visualization to spectacularly improve efficiency, performance , and productivity of AI and HPC workloads. You will lead technical teams to develop,… more
- NVIDIA (Santa Clara, CA)
- …workshops, etc. + Analyze and develop solutions for customer performance issues for both AI workload and systems performance . What we need to see: + ... networking and help develop accelerated computing networking solutions for AI /ML and HPC on hyperscalers. As part...systems in general including but not limited to performance testing/tuning, benchmarking, etc. + Strong systems … more
- Roche (South San Francisco, CA)
- …intensive models, understand AI /ML workflows, and can balance cost, performance , and reliability in high-demand systems . + You have exceptional ... operate cutting-edge scientific platforms and infrastructure, you will enable high- performance computing, AI /ML, and large-scale data processing. Collaborating… more
- NVIDIA (Santa Clara, CA)
- …ensure the performance , reliability, and integrity of pioneering GPU server systems used in the world's most demanding computing environments. If you thrive on ... to make a direct impact on the future of AI and high- performance computing. What You'll Be...interconnects such as NVLink or InfiniBand. + Familiarity with AI /ML or HPC benchmarking and stress-testing tools.… more
- NVIDIA (Santa Clara, CA)
- …Prepare and deliver technical presentations and workshops to customers + Address and optimize customer AI systems performance issues What we need to see: + ... and interpersonal skills to analyze, define, implement and optimize AI /ML and HPC software and system solutions...performance + Experience in designing, running and troubleshooting performance benchmarks for AI systems … more
- NVIDIA (Santa Clara, CA)
- …and Data Structures, Computer Architecture, Compiler Development, Open Source Programming, High- Performance Computing ( HPC ) , Automation Tools (XLA, TVM, ... you're expressing interest in one of our 202 6 Systems Software Engineering Internships. We'll review resumes on an...challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest… more
- Google (Sunnyvale, CA)
- …and compiler. + Knowledge of performance analysis and experience in performance modeling of High- Performance Computing ( HPC ) interconnect topologies. + ... analysis and HW/SW Co-Design that can enable cost effective performance and power of future ML systems ...using full stack HW-SW design space exploration. The ML, Systems , & Cloud AI (MSCA) organization at… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA leads the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing ( HPC ) and Visualization. DGX Cloud provides a ... serverless generative AI infrastructure to the world enabling NVIDIA's ...in software engineering with a strong track record in performance or scalability of high-scale distributed systems … more