• Software Development Manager, LLM Inference Model…

    Amazon (Cupertino, CA)
    …develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and ... system stack consisting of the PyTorch inference library, Neuron compiler , runtime, and collectives. A day in the life...day in the life You will work with your senior management and technical leaders to define the model… more
    Amazon (09/06/25)
    - Related Jobs