- NVIDIA (Santa Clara, CA)
- …agentic tools and applications and ensuring their seamless and efficient performance . If you're passionate about the latest research and cutting-edge technologies ... engineers. + Work with HW chip designers and LLM research teams to grasp GPU design needs and align LLM infrastructure accordingly. + Optimize the infrastructure for… more
- Deloitte (San Francisco, CA)
- …spans all relevant technologies from on-prem and cloud deployment, high performance computing, automation, DevOps, LLM/MLOps, data engineering while streamlining IT ... computing or on-prem technologies + Design and lead development on scalable, high- performance data architecture solutions that supports both the client business as… more
- Amazon (Cupertino, CA)
- …forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ... ML training performance with the most teraflops (TFLOPS) of compute power...code generation, optimization, and instruction architectures including CPU, NPU, GPU and novel forms of compute. Explore the Product:… more
- NVIDIA (Santa Clara, CA)
- …model training/inference performance on GPUs. + Experience developing and optimizing GPU kernels for deep learning, with a focus on GEMM and attention kernels. ... doing: + Collaborating closely with customers to improve their workload performance and reduce infrastructure costs. + Leading and developing proof-of-concepts for… more
- Oracle (Sacramento, CA)
- …The Compute Scaled Manufacturing organization's mission is to meet surging GPU demand for Oracle's AI infrastructure by scaling the server qualification ... should be both a rock-solid coder and a lead-level engineer , able to dive deep into any part of...systems, and distributed systems fundamentals. + Strong troubleshooting and performance tuning skills. + Experience with REST API and… more
- NVIDIA (Santa Clara, CA)
- …for building tooling and services. + Experience architecting solutions for GPU -accelerated or other high- performance computing workloads. + Excellent ... and the follow-through to harden them into enterprise-grade software, ensuring reliability, performance , and security across thousands of GPUs. You will shape our… more
- Oracle (Sacramento, CA)
- …for identifying, solutioning, and implementing AI solutions to the corresponding GPU IaaS or PaaS. **Qualifications and experience** + Doctoral or master's ... and solutions for production, relevant professional experience as end-to-end solutions engineer or architect (data engineering, data science and ML engineering is… more
- Meta (Sunnyvale, CA)
- …AI systems, and hardware/software co-design, AI-driven compiler, system design and performance optimization, etc. 3. Analyze and improve efficiency, scalability, and ... C++, C, or other related languages 9. Experience in real-system implementations (eg, GPU , NPU) 10. Experience building systems based on machine learning and/or deep… more
- NVIDIA (Santa Clara, CA)
- …potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the ... inference infrastructure. In this groundbreaking role, you will drive performance and scalability in distributed AI systems. You will...are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want… more
- NVIDIA (Santa Clara, CA)
- …intelligence (AI), signal processing algorithms and applications for our NVIDIA GPU accelerated wireless platforms. Artificial Intelligence is transforming how we ... is a significant plus. + Prior experience developing 5G RAN algorithms, performance optimizations and benchmarking, acceleration architecture etc is a definite plus… more