- FocusKPI Inc. (Mountain View, CA)
- …Design, develop, and deploy on-device machine learning models optimized for Android, ensuring low latency and minimal resource consumption. + Build robust and ... understanding of mobile performance optimization, including model size, memory usage, and latency . + Experience with signal processing and real-time ML logic +… more
- NVIDIA (Santa Clara, CA)
- …responsible for: + Optimizing CPU, GPU and NIC sub-systems for predictable low - latency and maximum efficiency + Crafting and implementing performance ... performance analysis, characterization and optimization. + Experience with programming latency sensitive, real-time, multi-threaded applications on CPUs and one or… more
- Walmart (Sunnyvale, CA)
- …and South Africa to name a few. **What you'll do:** + Design scalable, low - latency services to host models; productionize prototypes on the cloud, including data ... offline evaluation and real-time execution. + Create monitoring dashboards; perform latency tuning of deep learning models, scaling solutions to enterprise level;… more
- LinkedIn (Mountain View, CA)
- …Manage and enhance DNS resolution and traffic routing policies to achieve high performance, low latency , and optimal path selection for end users. Lead the ... Establish robust incident management and troubleshooting processes to address routing, latency , and availability issues in real time. Establish and track key… more
- TE Connectivity (CA)
- …tightly with high-radix ASIC subsystems to enable ultra-high bandwidth and low - latency data transmission. + Define novel, scalable CPO/NPO architectures ... electrical, optical, and hybrid interconnect solutions, balancing performance, power, latency , cost, and manufacturability. + Collaborate with internal R&D, product… more
- SpaceX (Hawthorne, CA)
- …reliable, real-time software that plans and executes network topology for our low - latency , high-bandwidth satellite-based global network in order to connect ... that space and ground hardware is optimally scheduled to minimize latency , maximize constellation uptime, throughput and reliability. Our software engineers are… more
- JPMorgan Chase (Palo Alto, CA)
- …power mission-critical financial applications. You will build scalable, fault-tolerant, and low - latency platforms that handle millions of transactions per ... for real-time transaction processing, ensuring efficient database interactions and minimal latency . + Develop and maintain distributed data pipelines for handling… more
- NVIDIA (Santa Clara, CA)
- …and storage hierarchies, using the NVIDIA Optimized Transfer Library (NIXL) for low - latency , cost-effective data movement. What you'll be doing: + Collaborate ... worker replicas with relevant KV cache data, minimizing re-computation and latency for sophisticated, multi-step reasoning tasks. + Distributed KV Cache Management:… more
- Amazon (Palo Alto, CA)
- …scaling AI/ML solutions for real-time bidding systems that demand high availability and low latency . You will work closely with applied scientists and engineers ... services to support ML model deployment and inference at sub millisecond latency . - Collaborate with scientists to operationalize machine learning models and… more
- Walmart (Sunnyvale, CA)
- …with at least 5 years managing high‑performing teams that build distributed, low ‑ latency systems. + Proven ownership of petabyte‑scale data platforms or ... Ad‑Serving initiative) to handle millions of requests per minute with sub‑100 ms latency . + Drive Delivery - Translate product vision into executable roadmaps; own… more