- Sesame (San Francisco, CA)
- …like VLLM and SGLang to take advantage of the latest techniques in high‑performance model serving . Work with the training team to identify opportunities to ... variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast,...Always up to date on the latest techniques for model serving optimization. Preferred Qualifications Familiarity with… more
- Sesame (San Francisco, CA)
- …its innovative voice companion technologies. You will optimize and build a high-performance ML serving layer, collaborating with engineers to create reliable and ... efficient systems. Ideal candidates will have deep expertise in PyTorch and performance engineering. This role offers comprehensive employee benefits including health coverage, unlimited PTO, and 401k matching. #J-18808-Ljbffr more
- Cerebras (Palo Alto, CA)
- A leading technology firm is seeking skilled engineers to develop high-performance ML model serving systems. Candidates should have a strong background in ... algorithms, system design, and experience with tools like PyTorch and TensorFlow. Responsibilities include enhancing Ray Serve, optimizing performance, and improving service availability. This role offers a salary range of $170,112 to $237,000 along with… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design,… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime,… more
- PriorLabs GmbH (San Francisco, CA)
- …working on structured data, and we're accelerating fast. Our TabPFN v2 model , recently published in Nature, sets the new state-of-the-art for structured data. ... for tabular data, we have several key areas where ML Engineers can make significant contributions. As an early...you might tackle based on your interests and expertise: Model Engineering & Implementation Build and improve training pipelines… more
- General Motors (Sunnyvale, CA)
- …role, you'll work closely with ML engineers and researchers to ensure efficient model serving and inference in production, for their workflows such as data ... efficiency. About the Role We are seeking a Staff ML Infrastructure engineer to help build and...Python, C++ or other relevant coding languages. Expertise in ML inference, model serving frameworks… more
- Snap Inc. (Palo Alto, CA)
- …Develop high-performance inference systems to ensure fast and efficient AI model serving Build infrastructure to perform scalable ML model training, ... more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc!...high-performance inference systems to ensure fast and efficient AI model serving Build comprehensive data management systems… more
- Pathway Genomics Corporation (Palo Alto, CA)
- …headquartered in Palo Alto, California. The opportunity We are looking for a Senior ML Infrastructure / DevOps Engineer who loves Linux, distributed systems, and ... models and services. Own monitoring, logging, and alerting across training and serving : GPU/CPU utilization, latency, throughput, failures, and data/ model drift… more
- GEICO (Palo Alto, CA)
- …autoscaling, and resource optimization* Design, implement, and maintain feature stores for ML model training and inference pipelines* Build and optimize LLM ... GEICO . For more information, please .Staff Software Engineer - AI/ ML Infra page is...Platform Engineering* Design and maintain robust CI/CD pipelines for ML model deployment using Azure DevOps, GitHub… more
Recent Jobs
-
Database Support Analyst (On-site)
- TEKsystems (Florham Park, NJ)
-
Software Developer 4
- Oracle (Nashville, TN)
-
Solutions Architect
- Uturndata (Chicago, IL)