- Meta (Sunnyvale, CA)
- **Summary:** Meta is seeking an ASIC Engineer , Architecture to join our Infrastructure organization. Our servers and data centers are the foundation upon which our ... and map data center workloads to ASIC architecture, as well as develop performance and functional models to validate the architecture 3. Implement various reference… more
- Amazon (Sunnyvale, CA)
- …in deep-dive analysis and profiling of production code * Optimize inference performance across various platforms (on-device, cloud-based CPU, GPU , proprietary ... with the world. We push the limits of inference performance to provide the best possible experience for our...existing systems experience - 1+ years of software development engineer or related occupational experience - 1+ years of… more
- Deloitte (San Francisco, CA)
- …spans all relevant technologies from on-prem and cloud deployment, high performance computing, automation, DevOps, LLM/MLOps, data engineering while streamlining IT ... computing or on-prem technologies + Design and lead development on scalable, high- performance data architecture solutions that supports both the client business as… more
- Amazon (Cupertino, CA)
- …forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ... ML training performance with the most teraflops (TFLOPS) of compute power...code generation, optimization, and instruction architectures including CPU, NPU, GPU and novel forms of compute. Explore the Product:… more
- NVIDIA (Santa Clara, CA)
- …model training/inference performance on GPUs. + Experience developing and optimizing GPU kernels for deep learning, with a focus on GEMM and attention kernels. ... doing: + Collaborating closely with customers to improve their workload performance and reduce infrastructure costs. + Leading and developing proof-of-concepts for… more
- Oracle (Sacramento, CA)
- …The Compute Scaled Manufacturing organization's mission is to meet surging GPU demand for Oracle's AI infrastructure by scaling the server qualification ... should be both a rock-solid coder and a lead-level engineer , able to dive deep into any part of...systems, and distributed systems fundamentals. + Strong troubleshooting and performance tuning skills. + Experience with REST API and… more
- NVIDIA (Santa Clara, CA)
- …for building tooling and services. + Experience architecting solutions for GPU -accelerated or other high- performance computing workloads. + Excellent ... and the follow-through to harden them into enterprise-grade software, ensuring reliability, performance , and security across thousands of GPUs. You will shape our… more
- Oracle (Sacramento, CA)
- …for identifying, solutioning, and implementing AI solutions to the corresponding GPU IaaS or PaaS. **Qualifications and experience** + Doctoral or master's ... and solutions for production, relevant professional experience as end-to-end solutions engineer or architect (data engineering, data science and ML engineering is… more
- Meta (Sunnyvale, CA)
- …AI systems, and hardware/software co-design, AI-driven compiler, system design and performance optimization, etc. 3. Analyze and improve efficiency, scalability, and ... C++, C, or other related languages 9. Experience in real-system implementations (eg, GPU , NPU) 10. Experience building systems based on machine learning and/or deep… more