- Amazon (Palo Alto, CA)
- …of multi-modal modeling, few-shot learning, retrieval-augmented generation (RAG), or reinforcement learning from human feedback (RLHF). * Experience with online ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (Palo Alto, CA)
- …bring deep expertise in quantitative modeling (forecasting, recommender systems, reinforcement learning, causal inferencing or generative artificial intelligence) to ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (San Diego, CA)
- …in gaming. Our highly skilled, multi-discipline team works across Machine Learning, Reinforcement Learning, and Generative AI to reimagine game development. We work ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (Santa Clara, CA)
- …orchestration, Planning, large multimodal models (especially vision-language models), reinforcement learning (RL) and sequential decision making. Basic ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (Palo Alto, CA)
- …this transformation, tackling complex challenges in natural language processing, reinforcement learning, and causal inference. Your pioneering efforts will directly ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (San Francisco, CA)
- …fulfilled. In particular, our work combines large language models (LLMs) with reinforcement learning (RL) to solve reasoning, planning, and world modeling in both ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (San Francisco, CA)
- …fulfilled. In particular, our work combines large language models (LLMs) with reinforcement learning (RL) to solve reasoning, planning, and world modeling in both ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (Santa Clara, CA)
- …* Continual learning, multi-task/meta learning * Reasoning, interactive learning, reinforcement learning * Robustness, privacy, model watermarking * Model ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (San Francisco, CA)
- …really excited about the work in combining large language models (LLMs) with reinforcement learning (RL) to solve reasoning and planning, learned world models, and ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more
- Amazon (East Palo Alto, CA)
- …modern visual-language models and multi-modal AI systems - Experience implementing reinforcement learning for autonomous agent behavior - Experience in professional ... above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others,… more