• Software engineer -AI/ML, AWS Neuron Inference,…

    Amazon (Seattle, WA)
    …the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role is for a senior software engineer in the ... The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and...required) - Experience with PyTorch - Working knowledge of Machine Learning and LLM fundamentals including transformer… more
    Amazon (09/09/25)
    - Related Jobs
  • Software Development Manager, LLM Inference Model…

    Amazon (Cupertino, CA)
    …develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and ... system stack consisting of the PyTorch inference library, Neuron compiler , runtime, and collectives. A day in the life...day in the life You will work with your senior management and technical leaders to define the model… more
    Amazon (09/06/25)
    - Related Jobs
  • Software Development Manager, AI Inference…

    Amazon (Seattle, WA)
    …develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloudscale machine learning accelerators. Come optimize LLMs such as Llama and GPT ... system stack consisting of the PyTorch inference library, Neuron compiler , runtime and collectives. A day in the life...day in the life You will work with your senior management and technical leaders to define the building… more
    Amazon (08/15/25)
    - Related Jobs