- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more
- Amazon (Cupertino, CA)
- …are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops...AWS Neuron Software Development Kit (SDK), which includes an ML compiler, Neuron Kernel Interface (NKI) compiler,… more
- Deloitte (San Jose, CA)
- …familiarity with Go or Rust a plus. + Strong understanding of AI/ ML frameworks (PyTorch, TensorFlow, ONNX) and performance /model optimization. + Familiarity ... Sr . Java Full Stack Developer - Project Delivery...do/Responsibilities Join our AI and Systems Co-Design team, pioneering high- performance software and hardware technologies for AI and next-generation… more
- Google (Sunnyvale, CA)
- Senior Software Engineer, Core ML Frameworks _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... internal and external users. + Build infrastructure and tooling for kernel development, including benchmarking suites, auto-tuning frameworks, performance … more
- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more
- quadric.io, Inc (Burlingame, CA)
- …and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems. Unlike other NPUs or neural ... Domain Knowledge: Demonstrated ability to drive complex technical projects in the AI/ ML and embedded processing domain. Highly Desired Skills and Experience (Pluses)… more
- NVIDIA (Santa Clara, CA)
- …non- ML computer vision + Strong fundamentals with system-level performance : multi-threaded, multi-process and distributed software development. + Grounding in ... pre- and post-processing. + Improve the efficiency of VLM models themselves: kernel optimization in CUDA + Upstream improvements to SDKs and libraries across… more