- DoorDash (San Francisco, CA)
- …+ High Impact: Contribute to powering multiple business lines with high-quality, low - latency data directly integrated into online systems, driving billions in ... revenue. + Cutting-Edge Technology: Work with advanced open-source technologies such as Apache Spark, Flink, Kafka, Airflow, Delta Lake, and Iceberg. + Scalability: Play a crucial role in evolving our systems to accommodate a 10x scale increase, supporting… more
- Walmart (Sunnyvale, CA)
- …mentor top technical talent. Experience in designing and developing large-scale, low - latency distributed systems Deep understanding of Web technologies. ... Experience in e Commerce domain highly desirable **About Walmart Global Tech** Imagine working in an environment where one line of code can make life easier for hundreds of millions of people. Thats what we do at Walmart Global Tech. Were a team of software… more
- NVIDIA (Santa Clara, CA)
- …developing driver, protocols and application to deliver high efficiency and lowest latency with low CPU utilization! Linux/Android/Windows is your calling then ... you have reached right place! If Wi-Fi and BT excites you then this is the best team to join. Come and take a significant part in architecting, crafting, developing and verifying innovating solutions. If you enjoy working in a relevant, growing and highly… more
- Google (Mountain View, CA)
- …DSP, etc. + Experience with testing, profiling, benchmarking, and optimizing code for latency . + Experience developing for low compute or power constrained ... Android devices. + Experience with C/C++ build systems and tooling. + Passion for productionizing the computational photography technologies. Google's software engineers develop the next-generation technologies that change how billions of users connect,… more
- Cisco (San Jose, CA)
- …computing, and real-time simulation. You will be responsible for developing low -level components that bridge user space and kernel space, optimizing memory ... systems efficiently utilize GPU hardware to its full potential, minimizing latency , maximizing throughput, and improving developer experience at scale. This role… more
- Meta (Sunnyvale, CA)
- …models on hardware to achieve the best performance given various real time latency and power constraints 4. Goal setting related to project impact, AI algorithms, ... more advanced model optimization techniques, including quantization, pruning, distillation, Low -Rank Adaptation (LoRA), Parameter-Efficient Fine-Tuning (PEFT) etc, for cloud… more
- Meta (Sunnyvale, CA)
- …workload analysis for performance across the workloads of interest 7. Drive IP latency hiding features and Quality of Service (QoS) recommendations for each compute ... subsystem architecture 17. 2+ years of experience with power concepts and low power design principles 18. Expertise collaborating and communicating effectively in a… more
- quadric.io, Inc (Burlingame, CA)
- …that is focused on model optimization, will research, prototype, and validate low ‑precision techniques that make neural networks leaner and faster on the Chimera ... + Build calibration datasets; develop Python notebooks/dashboards to track accuracy, latency , power, and memory trade‑offs. + Perform layer‑ and token‑level error… more
- Amazon (Cupertino, CA)
- Description Amazon Web Services (AWS) provides a highly reliable, scalable, and low -cost cloud platform that powers thousands of businesses in over 190 countries. ... We are steadily expanding global infrastructure to help our customers achieve lower latency and higher throughput. As our customers grow their businesses, AWS will… more
- HP Inc. (Palo Alto, CA)
- …Do** + Research and implement model compression techniques including quantization, low -rank factorization, distillation, and pruning + Develop methods to deploy SOTA ... constraints + Lead investigations into hardware-aware training strategies to optimize latency , throughput, and memory usage + Collaborate with software engineers and… more