• Staff, Software Engineer - Conversational AI

    Walmart (Sunnyvale, CA)
    …(more computations) and model serving latency. So, we are always in a quest of crunching more numbers, while preserving our SLAs, and controlling the operational ... tooling (Tensorflow serving? / VLLM? / Triton?) and infrastructure ( CPU ? / GPU?, GCP? / Azure?) for model serving...(cloud infrastructure). + You will provide robust and built-in diagnostics for quality control throughout. + You will integrate… more
    Walmart (06/24/25)
    - Related Jobs
  • Staff, Software Engineer - Conversational AI

    Walmart (Sunnyvale, CA)
    …(more computations) and model serving latency. So, we are always in a quest of crunching more numbers, while preserving our SLAs, and controlling the operational ... in terms of architecture, tooling (Tensorflow serving? / ONNYX? / Triton?) and infrastructure ( CPU ? / GPU?, GCP? / Azure?) for model serving based on the latest… more
    Walmart (05/16/25)
    - Related Jobs