- DatologyAI (Redwood City, CA)
- …looking for an engineer with deep experience building and operating large-scale training and inference systems. You will design, implement, and maintain the ... researchers to productionize new models and features quickly and safely. Optimize training and inference pipelines for performance, reliability, and cost. Ensure… more
- Capital One (Fredericksburg, VA)
- …Java, or Golang Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware ... Lead AI Engineer (FM Hosting, LLM Inference ) Overview...support AI software components including foundation model training , large language model inference , similarity search,… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: ... requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for...and Python experience is a plus.* Prior experience with training , deploying or optimizing the inference of… more
- OpenAI (San Francisco, CA)
- …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
- Amazon (Seattle, WA)
- …cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... Overview AWS Neuron is the complete software stack for the AWS Inferentia and Trainium...programming language Fundamentals of machine learning models, their architecture, training and inference lifecycles along with work… more
- OpenAI (San Francisco, CA)
- …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...work is inherently cross-functional: you'll collaborate directly with researchers training these models and with product teams defining new… more
- Google Inc. (Sunnyvale, CA)
- Software Engineer III, Infrastructure, Inference Control Plane corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or equivalent practical ... goes on and is growing every day. As a software engineer , you will work on a...push technology forward. The mission of Vertex AI Online Inference Infrastructure team is to build a model serving… more