• Sr. Machine Learning Engineer

    Amazon (San Diego, CA)
    …PhD in Computer Sciences, Electrical Engineering, or Mathematics with specialization in machine learning , deep learning , or natural language processing - ... lead or leading an engineering team - Experience in developing machine learning pipelines, developing large language model or natural language understanding models.… more
    Amazon (08/15/25)
    - Related Jobs
  • Principal Staff Software Engineer , AI…

    LinkedIn (Mountain View, CA)
    …years of experience in the industry with leading / building deep learning systems + Hands-on experience developing distributed systems or other large-scale ... responsible for developing and maintaining highly available and scalable deep learning training solutions to power our...work for Training infrastructure. As a Principal Staff Software Engineer on the AI Training Infra team, you will… more
    LinkedIn (09/25/25)
    - Related Jobs
  • R&D Engineer , VCF Cluster Management Team

    Broadcom (Palo Alto, CA)
    …least 3+ years in a role focusing on distributed systems. + Distributed Systems Expertise: Deep understanding and hands-on experience with various ... solutions. We are dedicated to building robust, scalable, and high-performance distributed systems that empower enterprises to achieve their digital transformation… more
    Broadcom (09/30/25)
    - Related Jobs
  • Principal Engineer

    NVIDIA (CA)
    …software. Ways to stand out from a crowd: + Experience architecting or developing large-scale distributed systems for deep learning + Knowledge of CPU and/or ... We are now looking for a Principal Research Engineer focused on Generative AI inference. Are you...in Speech Recognition, Speech Synthesis, Natural Language Processing and Deep Learning + Architecting and implementing features… more
    NVIDIA (08/22/25)
    - Related Jobs
  • Senior ML Storage Engineer - GPU Clusters

    NVIDIA (Santa Clara, CA)
    …to run their flows on our clusters including performance analysis and optimizations of deep learning workflows and participate in the team's on-call rotation to ... are seeking a highly skilled and experienced Sire Reliability Engineer to design, deploy, and manage high speed storage...container networking and storage architecture. + Experience with Machine Learning and Deep Learning concepts,… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Software Development Engineer , AI/ML, AWS…

    Amazon (Cupertino, CA)
    …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...unique opportunity to work at the intersection of machine learning , high-performance computing, and distributed architectures, where… more
    Amazon (09/04/25)
    - Related Jobs
  • Principal Software Engineer - AI Ads

    Microsoft Corporation (Mountain View, CA)
    …This is a rare opportunity to work on cutting-edge AI, big data, and deep learning systems while collaborating with world-class scientists and engineers to ... Product Insights, Selection, Relevance, Modeling, and Personalization. The team leverages deep learning , LLMs/SLMs, AI, NLP, information retrieval, big data,… more
    Microsoft Corporation (09/19/25)
    - Related Jobs
  • Senior DL Algorithms Engineer - Cosmos

    NVIDIA (Santa Clara, CA)
    …for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying ... this role, you will focus on optimizing and deploying deep learning models for efficient and fast...for large-scale video data processing and model post-training. + Deep understanding of distributed systems for large-scale… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Senior HPC and AI Networking Performance Research…

    NVIDIA (Santa Clara, CA)
    …profile and analyze AI workloads on large GPUs and CPUs scale clusters for distributed Deep Learning LLM training focused on collectives communication and ... AI workloads and DL models specifically tailored for large-scale deep learning LLM training on NVIDIA supercomputers and distributed systems focusing on… more
    NVIDIA (09/03/25)
    - Related Jobs
  • Staff Software Engineer , Infrastructure…

    Coinbase (Sacramento, CA)
    …track record of building and operating scalable, fault-tolerant, and highly available distributed systems. * You have deep , hands-on experience with incident ... Datastores team is the architect and guardian of that foundation. We engineer the critical database infrastructure that powers every product, every transaction, and… more
    Coinbase (09/12/25)
    - Related Jobs