• Lead AI Platform Engineer

    Elevance Health (San Francisco, CA)
    …backward compatibility and deprecation processes. + Optimize cost and performance (autoscaling, concurrency, GPU/ CPU scheduling for inference, storage/egress ... **Lead AI Platform Engineer ** **Location:** This role requires associates to be...the AI platform (APIs, data pipelines, developer hub/marketplace). Set architecture , elevate engineering standards, and ensure systems are secure,… more
    Elevance Health (09/19/25)
    - Related Jobs
  • Software Dev Engineer II, Echo Platform

    Amazon (Sunnyvale, CA)
    …Collaborate with cross-functional teams to prototype new technologies * Optimize system performance across memory, storage, and CPU utilization * Participate in ... will have an enormous opportunity to make a significant impact on the design, architecture , and implementation of the latest Echo products used by people every day.… more
    Amazon (08/27/25)
    - Related Jobs
  • Staff, Software Engineer - Conversational…

    Walmart (Sunnyvale, CA)
    …capabilities in at least some of the following areas: + Service oriented architecture in charge of exposing our NLU capabilities at scale, and enabling increasingly ... to always find the best tradeoffs in terms of architecture , tooling (Tensorflow serving? / VLLM? / Triton?) and...tooling (Tensorflow serving? / VLLM? / Triton?) and infrastructure ( CPU ? / GPU?, GCP? / Azure?) for model serving… more
    Walmart (09/23/25)
    - Related Jobs
  • Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …our capabilities in at least some of the following areas: Service oriented architecture in charge of exposing our NLU capabilities at scale, and enabling ... to always find the best tradeoffs in terms of architecture , tooling (Tensorflow serving? / ONNYX? / Triton?) and...tooling (Tensorflow serving? / ONNYX? / Triton?) and infrastructure ( CPU ? / GPU?, GCP? / Azure?) for model serving… more
    Walmart (08/15/25)
    - Related Jobs
  • Software Development Engineer AI/ML,…

    Amazon (Cupertino, CA)
    …AWS Inferentia and Trainium machine learning accelerators, designed to deliver high- performance , low-cost inference at scale. The Neuron Serving team develops ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more
    Amazon (09/21/25)
    - Related Jobs
  • Senior System Software Engineer , AI…

    NVIDIA (Santa Clara, CA)
    …of DL architectures, PyTorch, and distributed training methods. + Understanding of CPU /GPU architecture plus CUDA, cuDNN, TensorRT‑LLM, Triton, NCCL + Excellent ... market, we need a dedicated and motivated System Software Engineer who is passionate about AI Infrastructure. You will...and application development knowledge to evaluate user experience and performance of our AI platforms, SDKs, libraries and AI… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Senior, Software Engineer - Machine…

    Walmart (Sunnyvale, CA)
    …capabilities in at least some of the following areas: - Service oriented architecture in charge of exposing our NLU capabilities at scale, and enabling increasingly ... to always find the best tradeoffs in terms of architecture , tooling (Tensorflow serving / ONNYX / Triton) and...tooling (Tensorflow serving / ONNYX / Triton) and infrastructure ( CPU / GPU, GCP/Azure) for model serving -- based… more
    Walmart (08/20/25)
    - Related Jobs
  • Sr. ASIC Design Engineer , Cloud-Scale…

    Amazon (Cupertino, CA)
    …and making the right trade-offs. Key job responsibilities As an ASIC Design Engineer , you will: * Develop and implement high- performance , area and ... and architectures to optimize trade-offs between features, power consumption, performance , and area requirements * Implement SystemVerilog RTL, and deliver… more
    Amazon (09/13/25)
    - Related Jobs
  • Senior GenAI Algorithms Engineer

    NVIDIA (Santa Clara, CA)
    …deployment environments would be an asset (eg TRT, ONNX, Triton) + Knowledge of GPU/ CPU architecture and related numerical software Your base salary will be ... We are now looking for a Senior Gen AI Algorithms Engineer ! NVIDIA is seeking engineers to design, develop and optimize Artificial Intelligence solutions to diverse… more
    NVIDIA (09/09/25)
    - Related Jobs
  • Emulation Engineer II

    Microsoft Corporation (Santa Clara, CA)
    …increased agility and deliver significantly superior performance compared to CPU -based alternatives **Responsibilities** As an Emulation Engineer II in the ... software and hardware expertise to create a highly programmable and high- performance ASIC with the capability to efficiently handle large data streams.… more
    Microsoft Corporation (09/23/25)
    - Related Jobs