- NVIDIA (Santa Clara, CA)
- …Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users ... planning while keeping an eye on capacity, latency and performance . SRE is also a mindset and a set...software development focuses on eliminating manual work through automation, performance tuning and growing efficiency of production systems. As… more
- Amazon (Cupertino, CA)
- …we are able to develop creative and new designs that set the standards on performance , quality, cost, and operational excellence. What you will do: As a member of ... be constantly looking for ways to improve your products' performance , quality, and cost. We're changing an industry, and...and signal integrity, failure analysis, server components (eg CPU, GPU , SSDs, drives), BIOS, BMC, and networking - Excellent… more
- NVIDIA (CA)
- …NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing and Visualization. The GPU , our invention, serves ... A strong AI and/or machine learning background with the ability to improve workload performance is a plus. What will you be doing: + Partner with multiple internal… more
- NVIDIA (Santa Clara, CA)
- What You'll be Doing: + Build system hardware products around GPU & Tegra SoC. + Collaborate with cross-function team to pursue the balance of product cost, ... performance , and schedule under the guidance of system architects...manufactures and partners. + Optimize/invent circuits, functions for better performance , and lower cost. + Improve the design flow… more
- Oracle (Sacramento, CA)
- …of new optics and transceivers solutions that help connect and drive the GPU , compute, metro and backbone. They write the test and qualification requirements and ... covering numerous areas of network hardware engineering including cabling, low level device performance , thermals, etc. As OCI is a cloud-based network with a global… more
- Oracle (Sacramento, CA)
- …remediation (CVE), and hardening. + Collaborate with platform, BIOS/EDKII, and GPU teams to deliver cohesive platform management; provide rapid triage for ... contributions where appropriate. + Support sustaining activities: defect resolution, performance /footprint optimizations, diagnostics, telemetry, and field reliability improvements. What… more
- NVIDIA (Santa Clara, CA)
- …potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the ... in C/C++/Python/CUDA in a multi-core environment. + Support system integration, performance testing, system demonstration and lab trials for end-to-end system. +… more
- NVIDIA (Santa Clara, CA)
- …potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the ... stack. NVIDIA NVLink Fusion will enable industry-leading AI scale-up and scale-out performance with NVIDIA technology plus semi-custom ASICs or CPUs. NVIDIA's robust… more
- NVIDIA (Santa Clara, CA)
- …to do more world-changing work than ever before. We are now looking for a Performance Engineer Intern focused on Deep Learning (DL) & High- Performance ... internship: 8 months minimum What you'll be doing: + Plan and execute GPU performance benchmarking across a wide range of HPC and DL frameworks and applications… more
- Meta (Menlo Park, CA)
- … GPU training and inference fleet through an observable, reliable and high- performance distributed AI/ GPU communication stack. Currently, one of the team's ... learning/deep learning domains: High speed networking (RDMA), Distributed ML Training, GPU architecture, ML systems, AI infrastructure, high performance … more