• Aldea Inc (San Francisco, CA)
    …contextual, and intelligent human-machine interface. The Role We are hiring a Research Engineer (Machine Learning) to build the infrastructure that powers ... Aldea's multi-modal AI research . You will design, optimize, and scale the training...training or inference systems. Preferred Qualifications Experience with custom kernel development ( CUDA , Triton) or GPU optimization.… more
    job goal (01/14/26)
    - Related Jobs
  • Anthropic (San Francisco, CA)
    …define the path forward Strong candidates may also have experience with GPU Kernel Development: CUDA , Triton, CUTLASS, Flash Attention, tensor core optimization ... innovations in GPU performance and systems engineering. As a GPU Performance Engineer , you'll architect and implement the foundational systems that power Claude and… more
    job goal (01/14/26)
    - Related Jobs
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best map in the ... powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver...measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead… more
    job goal (01/14/26)
    - Related Jobs
  • Red Hat, Inc. (Boston, MA)
    …Must have two (2) years of experience with: Python and Modern C++; CUDA , Triton, or CUTLASS kernel optimization; Deep learning frameworks, including PyTorch; ... Machine Learning Engineer page is loaded## Machine Learning Engineerremote type:...on NVIDIA GPUs using tools such as Nsight, tune CUDA , Triton, or CUTLASS kernels for deep neural networks.*… more
    job goal (01/14/26)
    - Related Jobs
  • OpenAI (San Francisco, CA)
    …the core distributed machine-learning training runtime that powers everything from early research experiments to frontier-scale model runs. With a dual mandate to ... iterate quickly and run reliably at any scale, partnering closely with model-stack, research , and platform teams. Success for us is measured by raising both training… more
    job goal (01/14/26)
    - Related Jobs
  • Relace (San Francisco, CA)
    …environments. Requirements Strong background in systems-level ML engineering. Experience with CUDA , GPU kernel optimization, and performance tuning. Fluency in ... you. The Role We're looking for a Machine Learning Engineer who loves getting close to the metal. This...smart systems design. The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance… more
    job goal (01/14/26)
    - Related Jobs
  • kadence (San Francisco, CA)
    …Nice to Have Deep understanding of training architectures (PyTorch/JAX internals, CUDA kernel optimization, TPU environments). Experience building or managing ... datasets. About the Role We're looking for a Machine Learning Engineer with hands‑on experience in model development (training, fine‑tuning, feature engineering)… more
    job goal (01/14/26)
    - Related Jobs
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA's AI Developer Tools organization is seeking a Senior Research Engineer to join our Quality team, where we're building the definitive benchmarks and ... important parallel computing platform. Our growing team operates at the intersection of CUDA domain expertise and cutting-edge AI research . While evaluation is… more
    NVIDIA (01/10/26)
    - Related Jobs
  • Sr. ML Kernel Performance Engineer

    Amazon (Cupertino, CA)
    …or HPC such as GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU kernel optimization and GPGPU computing such as CUDA , NKI, Triton, OpenCL, SYCL, ... on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing performance for… more
    Amazon (11/14/25)
    - Related Jobs
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …Face, vLLM, SGLang). You may also dive deeper into GPU-level optimization, including custom kernel development with CUDA and Triton. This role offers a unique ... with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal… more
    NVIDIA (01/10/26)
    - Related Jobs