- Broadcom (Palo Alto, CA)
- …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
- Amazon (Cupertino, CA)
- …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
- pony.ai (Fremont, CA)
- …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
- Amazon (Cupertino, CA)
- …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
- Google (Mountain View, CA)
- …software and systems for power, thermal, and memory footprint constraints. + Experience with AI / ML model architectures, including Generative AI models ... years of experience in system architecture, software engineering, or AI / ML development. + 5 years of experience...SoCs), with experience in Tensor. + Familiarity with on-device AI inference, runtime environments, and model… more
- TP-Link North America, Inc. (Irvine, CA)
- …enable consumers to enjoy a seamless, effortless lifestyle. We are seeking a Senior AI / ML Computer Vision Engineer to drive the development and deployment of ... AI -powered features across our smart home automation product lines,...video processing with hands-on experience in deploying and optimizing ML models on constrained edge devices. Responsibilities + Lead… more
- Amazon (Cupertino, CA)
- …and hardware acceleration. The ideal candidate will have a solid understanding of ML frameworks, compilation systems, and runtime architectures. They should be ... Product Manager to define and drive product strategy for ML framework integration. You will be part of the...optimization, profiling and tooling - Experience with Deep Learning model training or inference. - Experience with distributed computing… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, ... for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
Recent Jobs
-
Intl - LATAM/India - ThoughtSpot Engineer
- Insight Global (San Francisco, CA)
-
Behavioral/ABA Techs - Bosoton
- Amergis (Boston, MA)
-
Senior Wealth Strategist - PNC Private Bank
- PNC (Nashville, TN)
-
R&D Principal Software Engineer - Security Engineering
- Broadcom (Palo Alto, CA)