- Cisco (Milpitas, CA)
- …team engaged in the design, development and execution of tests to qualify network performance for AI .ML capability. In this role you'll have opportunity to: + ... the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application....Quality of Service (QoS) policies to ensure optimal network performance + Exposure to RDMA, HPC networks… more
- NVIDIA (Santa Clara, CA)
- …strategy by collaborating with teams with varied strengths including GPU Compute, Distributed Systems , Networking, ML Infra, AI Platform, and Cloud Services to ... and cost efficiency of telemetry pipelines while supporting high-volume workloads ( AI /ML, HPC clusters, GPU infrastructure) + Embedding security guidelines… more
- General Motors (Austin, TX)
- …data structures, and algorithm design. + Experience with Docker, Kubernetes, and high- performance compute ( HPC ) environments. + Working knowledge of REST APIs ... includes many different technologies around computer vision, robotics, augmented reality, and AI . Our work supports GM manufacturing goals to build quality products… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... ever. Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing sophisticated memory solutions! The team… more
- NVIDIA (Santa Clara, CA)
- …with performance modeling, profiling, debug, and code optimization of a DL/ HPC /high- performance application + Architectural knowledge of CPU and GPU + GPU ... for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the… more
- Memorial Sloan-Kettering Cancer Center (New York, NY)
- …AI development with healthcare delivery. + Drive the adoption of scalable AI solutions that meet clinical performance and reliability standards. **Key ... at MSK combine advanced statistical methods, deep learning, and high- performance computing to extract insights from complex datasets-particularly in medical… more
- NVIDIA (Santa Clara, CA)
- …training deep learning models at scale, and a good mathematical foundation to analyze new AI algorithms. We focus on AI models for autonomous driving such as ... agent behavior models, end-to-end AV architectures, AI safety, closed-loop training approaches, and AV foundation models...monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters. What… more
- Cisco (Milpitas, CA)
- …agile team engaged in the design, development and execution of tests to qualify network performance for AI /ML capability. You will be a part of our solutions ... a customer-facing environment + Previous experience leading teams + Exposure network operating systems , preferably SONiC + Exposure to RDMA, HPC networks +… more
- Oracle (Santa Clara, CA)
- …a cutting-edge, ultra-high- performance GPU cluster based Data Centers designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...scale from tens to thousands of GPUs without compromising performance . We are the AI Infrastructure Delivery… more
- Meta (Menlo Park, CA)
- …levels 9. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: 10. GPU/ASIC-based kernel development and ... systems for our fleet 4. Technical management 5. Experience in systems architecture, performance , workload-analysis and large scale distributed systems … more