Sr ML Kernel Performance Jobs in California

29 jobs (page 1)

Categories

All Categories

Engineering (13)

Software/IT (6)

Sr . ML Kernel…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (11/14/25)
- Related Jobs
Software Engineering Manager, ML…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (12/04/25)
- Related Jobs
Sr . Product Manager - Kernels, AI/…

Amazon (Cupertino, CA)

…stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more

Amazon (12/02/25)
- Related Jobs
Senior Linux Kernel Systems Software…

NVIDIA (Santa Clara, CA)

NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more

NVIDIA (10/01/25)
- Related Jobs
Software Dev Engineer II - Neuron Kernel…

Amazon (Cupertino, CA)

…are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops...AWS Neuron Software Development Kit (SDK), which includes an ML compiler, Neuron Kernel Interface (NKI) compiler,… more

Amazon (10/25/25)
- Related Jobs
Sr . Java Full Stack Developer

Deloitte (San Jose, CA)

…familiarity with Go or Rust a plus. + Strong understanding of AI/ ML frameworks (PyTorch, TensorFlow, ONNX) and performance /model optimization. + Familiarity ... Sr . Java Full Stack Developer - Project Delivery...do/Responsibilities Join our AI and Systems Co-Design team, pioneering high- performance software and hardware technologies for AI and next-generation… more

Deloitte (11/02/25)
- Related Jobs
Senior Software Engineer, Core ML…

Google (Sunnyvale, CA)

Senior Software Engineer, Core ML Frameworks _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... internal and external users. + Build infrastructure and tooling for kernel development, including benchmarking suites, auto-tuning frameworks, performance … more

Google (12/04/25)
- Related Jobs
Senior Software Development Engineer - AI/…

Amazon (Cupertino, CA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more

Amazon (12/10/25)
- Related Jobs
Director / Sr Program Manager, AI…

quadric.io, Inc (Burlingame, CA)

…and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems. Unlike other NPUs or neural ... Domain Knowledge: Demonstrated ability to drive complex technical projects in the AI/ ML and embedded processing domain. Highly Desired Skills and Experience (Pluses)… more

quadric.io, Inc (10/18/25)
- Related Jobs
Senior Computer Vision, VLM…

NVIDIA (Santa Clara, CA)

…non- ML computer vision + Strong fundamentals with system-level performance : multi-threaded, multi-process and distributed software development. + Grounding in ... pre- and post-processing. + Improve the efficiency of VLM models themselves: kernel optimization in CUDA + Upstream improvements to SDKs and libraries across… more

NVIDIA (12/03/25)
- Related Jobs

"Alerted.org

Advanced Search

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?