- Amazon (Palo Alto, CA)
- …You will liaise with internal Amazon partners and work on bringing state-of-the-art LLM /GenAI models to production. You will stay abreast of the latest developments ... in the field of GenAI and identify opportunities to improve the efficiency and productivity of the team. You will define a long-term science vision for our advertising business, driven by our customer's needs, and translate it into actionable plans for our… more
- Amgen (Thousand Oaks, CA)
- …The responsibilities of this position will be partially dependent on the skillset of the successful candidate and may include one or more of the following: + Advising ... **Join Amgen's Mission of Serving Patients** At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission-to serve patients living with serious illnesses-drives all that we do. Since 1980, we've helped pioneer the world… more
- Amazon (Cupertino, CA)
- …responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama ... family, DeepSeek and beyond, as well as stable diffusion, vision transformers and many more. The Inference Model Enablement team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with… more
- NVIDIA (Santa Clara, CA)
- …on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT- LLM , TRT Model Optimizer) can maintain and increase its leadership in the ... market. What we need to see: + Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field. + 8+ years of relevant work or research experience in Deep Learning. + Excellent software design skills, including debugging,… more
- Google (Mountain View, CA)
- …(eg, fine-tuning, RLHF, prompting, agent development). + Experience deploying backend LLM applications. + Proficiency in Python and deep experience with ML/Deep ... Learning frameworks (eg, TensorFlow, PyTorch, JAX, HuggingFace, LangChain). **It'd be great if you also had these:** + Experience with graph-based machine learning techniques. + Meaningful experience or a strong understanding of applying AI/ML in another… more
- NVIDIA (Santa Clara, CA)
- …with Inference deployment and optimization software (ex. vLLM, SGLang, FlashInfer, TensorRT- LLM , Triton, Dynamo, TorchAO, etc.) + Demonstrable knowledge of GenAI or ... machine learning concepts, particularly around performance optimization, and software development and delivery + BS or MS degree in Computer Science, Computer Engineering, or similar experience (or equivalent experience) + 5+ years of technical product… more
- Amazon (Cupertino, CA)
- …simulation tools - Experience is TensorFlow, PyTorch, and/or JAX - Experience in LLM , Vision or other deep-learning models Amazon is an equal opportunity employer ... and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff;… more
- Deloitte (Sacramento, CA)
- …on-prem and cloud deployment, high performance computing, automation, DevOps, LLM /MLOps, data engineering while streamlining IT and infrastructure. Key ... responsibilities: + Work with clients to design, develop, and deploy new architectures to support machine learning & automation applications + Leverage advanced technical skills in modern data architecture, data science engineering, data transformation, and… more
- Walmart (Sunnyvale, CA)
- …the development of deep learning, reinforcement learning, multi-task learning, and LLM -based techniques to improve search and personalization performance. + **Lead ... Model Architecture Discussions** : Define and drive end-to-end ML architecture decisions, ensuring models are scalable, efficient, and aligned with business needs. Collaborate with ML engineers and platform teams to design architectures that integrate… more
- NVIDIA (Santa Clara, CA)
- …stand out from the crowd: + Proven research track record + Experience in LLM inference, AI network and storage needs. + Background in storage and storage ... optimization: file systems, object store, caches, coherency. + Stellar communication skills. Widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As… more