- Broadcom (Palo Alto, CA)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- Amazon (Cupertino, CA)
- …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
- pony.ai (Fremont, CA)
- …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
- Amazon (Cupertino, CA)
- …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - Bachelor's degree… more
- Amazon (Cupertino, CA)
- …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- Amazon (Cupertino, CA)
- …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement and Generality team for AWS Neuron at Annapurna Labs. This role is ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- TP-Link North America, Inc. (Irvine, CA)
- …enable consumers to enjoy a seamless, effortless lifestyle. We are seeking a Senior AI / ML Computer Vision Engineer to drive the development and deployment of ... AI -powered features across our smart home automation product lines,...video processing with hands-on experience in deploying and optimizing ML models on constrained edge devices. Responsibilities + Lead… more
- TP-Link North America, Inc. (Irvine, CA)
- We are seeking for a Staff AI / ML Computer Vision Engineer to design and develop cutting-edge AI -powered features for our next-generation smart home ... project execution, mentoring engineers, setting standards for deploying efficient, real-time AI at the edge, and ensuring seamless integration with cloud… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Cupertino, CA)
- …and hardware acceleration. The ideal candidate will have a solid understanding of ML frameworks, compilation systems, and runtime architectures. They should be ... Product Manager to define and drive product strategy for ML framework integration. You will be part of the...optimization, profiling and tooling - Experience with Deep Learning model training or inference. - Experience with distributed computing… more
Recent Jobs
-
Transportation Manager I
- First Student (Skaneateles, NY)
-
RT Assistant Technician Call-out
- MISTRAS Group, Inc. (Bakersfield, CA)