• Principal ML Platform Engineer

    NVIDIA (Santa Clara, CA)
    …learning innovation. In this role, you will architect, scale, and optimize high-performance ML infrastructure used across NVIDIA's AI research and product teams. ... GPU clusters. + Develop internal tools and automation for ML workflow orchestration, resource scheduling, data access,...with a focus on high availability and performance for AI workloads. + Define and monitor ML -specific… more
    NVIDIA (09/19/25)
    - Related Jobs
  • Senior Manager, Machine Learning Engineer…

    Cisco (San Jose, CA)
    …observability, incident response, prompt versioning, and feedback loops. Ensure responsible AI practices and data governance are followed. Qualifications ... ML /LLM systems. Strong understanding of LLMs, fine-tuning, prompt engineering , vector databases (eg, Pinecone, Weaviate, FAISS), and RAG patterns. Experience… more
    Cisco (09/18/25)
    - Related Jobs
  • SDE I - Systems, Runtime, and ML

    Amazon (Cupertino, CA)
    …for custom ML accelerators (Inferentia and Trainium), democratizing access to AI infrastructure. This team bridges the gap between popular ML frameworks ... - Learn and apply new technologies to solve complex engineering challenges About the team Candidates will be routed...AI workloads. While we don't work directly on ML algorithms, we build the critical infrastructure that makes… more
    Amazon (07/15/25)
    - Related Jobs
  • ML Acceleration / Framework Engineer…

    Amazon (Cupertino, CA)
    …scalable deployments with vLLM, Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team ... large models using Python is a must. FSDP (Fully-Sharded Data Parallel), Deepspeed, Nemo and other distributed training libraries...performance. You'll also develop and integrate new features in ML frameworks to support AWS AI services.… more
    Amazon (07/15/25)
    - Related Jobs
  • Staff, Technical Program Manager

    Walmart (Sunnyvale, CA)
    …delivery roadmaps and measured business value. + Serve as the connective tissue across Data Engineering , AI / ML , Security, Product, and Infrastructure ... to drive cross-functional execution and strategic alignment for our Data Engineering organization. This role is focused...cloud data ecosystems (eg, Databricks, Snowflake, AWS/GCP/Azure data services). + Exposure to AI / ML more
    Walmart (09/25/25)
    - Related Jobs
  • Software Engineer, Systems ML - SW/HW…

    Meta (Menlo Park, CA)
    …Systems ML - SW/HW Co-design Responsibilities: 1. Apply relevant AI infrastructure and hardware acceleration techniques to build & optimize our intelligent ... Hardware accelerators architecture, GPU architecture, machine learning compilers, or ML systems, AI infrastructure, high performance computing, performance… more
    Meta (08/01/25)
    - Related Jobs
  • Matterport - Senior ML Ops Engineer

    CoStar Realty Information, Inc. (Sunnyvale, CA)
    …Science, or a related quantitative field. + 5+ years of industry experience in ML Model Optimization, ML Engineering , or MLOps, particularly with large-scale ... efficient models into production. You will work closely with ML R&D Engineers and other engineering teams...up-to-date with the latest research and industry trends in ML model optimization, hardware acceleration, and efficient AI more
    CoStar Realty Information, Inc. (08/28/25)
    - Related Jobs
  • Sr. Mgr., ML Infrastructure, PV…

    Amazon (Sunnyvale, CA)
    …advanced relevance models, and deep insights into viewer behavior. As a Senior Manager, ML Infrastructure, you will lead multiple engineering teams to define the ... combines the art of curation and personalization with sophisticated data science and machine learning algorithms. We shape the...solutions. Your technical leadership will shape the future of AI / ML across Prime Video, unlocking generative and… more
    Amazon (08/27/25)
    - Related Jobs
  • Staff Software Engineer, ML Performance

    Google (Mountain View, CA)
    …to solve ML tasks more efficiently, or new techniques to reduce the label/unlabeled ML data needed to train a model to target accuracy. + Engage with Google ... performance and extracting maximum efficiency for machine learning and Artificial Intelligence ( AI ) workloads. We drive Google ML performance to use deep… more
    Google (09/27/25)
    - Related Jobs
  • Software Engineer, Systems ML - SW/HW…

    Meta (Burlingame, CA)
    …Systems ML - SW/HW Co-design Responsibilities: 1. Apply relevant AI infrastructure and hardware acceleration techniques to build & optimize our intelligent ... Hardware accelerators architecture, GPU architecture, machine learning compilers, or ML systems, AI infrastructure, high performance computing, performance… more
    Meta (08/27/25)
    - Related Jobs