- Oracle (Santa Clara, CA)
- …Cluster Networking team is building an ultra-high- performance network to support AI /ML/ HPC workloads. Join us to design systems that scale from tens to ... hundreds of thousands of GPUs without sacrificing performance . Our team develops and tunes the software and hardware stack for distributed workloads using libraries… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Factories are designed to speed up AI and HPC workloads! At their core is the Digital Twin, a physics-based model used to design, validate, and ... tokens per watt across GPUs, cooling, power, and control systems . We are looking for an intern to support...with cross-functional teams to ensure alignment with sustainability and performance goals. What We Need to See: + Currently… more
- Stanford University (Stanford, CA)
- …groups' meetings and presentations to assist with identifying promising tools and systems and to discuss their computational challenges and requirements. * Engage ... on the use of a broad set of cyberinfrastructure systems , tools, and software. * Provide support for Stanford...debugging techniques. Working knowledge of at least one mainstream ML/ AI framework and how to execute efficiently in an… more
- Oracle (Sacramento, CA)
- …GPU slicing (eg, NVIDIA MIG), and GPU virtualization technologies to support high- performance media workloads. + Lead the architecture, design, and implementation of ... GPU-accelerated cloud services (virtual workstations, rendering clusters, AI /ML-enabled media tools). + Ensure services are built for scale, availability,… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA NVLink, NVIDIA Networking, NVIDIA Data Center CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are seeking an excellent Senior Engineering ... NVIDIA data center systems have become core to NVIDIA's rapidly growing...teams, ensuring robust bring up, productization, and delivery. + Performance Management: Conduct performance evaluations, develop a… more
- Cisco (San Francisco, CA)
- …data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Supply ... teams while collaborating on ASIC Design and Verification for reliable, high- performance products. + Drive innovation in System/Board Design, leveraging excellent… more
- Amazon (Cupertino, CA)
- Description Annapurna Labs builds high- performance hardware and software solutions used in AWS data centers globally. We're looking for an experienced software ... Collective Communication group. The group delivers the the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack… more
- NVIDIA (Santa Clara, CA)
- …help ensure the smooth delivery of complex, high-profile programs that drive global AI advancements. What you will be doing: + Drive strategic CSP partnerships, ... Management: Proven ability to lead software development for rack-scale systems and data center servers, including complex hardware/software integration projects.… more
- Cisco (San Jose, CA)
- …data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will affect billions globally. Supply ... (eg, Altium, Cadence, Mentor Graphics). + Foundational knowledge of embedded systems and microcontrollers. + Familiarity with scripting or programming languages (eg,… more
- NVIDIA (Santa Clara, CA)
- …the last decade, Python has become the de-facto programming language for practitioners in AI , data science and HPC , through popular frameworks such as NumPy, ... for accelerated numerical/scientific computing libraries + Analyze and improve the performance of developed APIs on various CPU and GPU architectures, especially… more