- Insulet Corporation (San Diego, CA)
- Position Overview:As a Staff SRE in Site Reliability Engineering (SRE) at Insulet, you will play a critical role in architecting, implementing, and maintaining ... highly available and scalable infrastructure and systems . You will lead a team of SRE engineers,...the adoption of modern technologies and tools to improve system reliability and efficiency.* Develop and maintain automation tools… more
- Genesis AI (San Carlos, CA)
- …in both throughput-oriented cluster environments and latency-critical on-device deployments System -level mindset with a history of tuning hardware- software ... diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and… more
- NLP PEOPLE (Mountain View, CA)
- …Ideally, you've worked in a fast‑paced development environment before. We're looking for software engineers who love making both the teams they work with and the ... Maintain observability dashboards to track model performance, data quality, and system metrics. Champion best practices for robust, reproducible, and debuggable ML… more
- Cascading (San Francisco, CA)
- Why Casca? Casca is building AGI for banking. We're replacing decades-old legacy systems with AI-native technology that automates 90% of the manual work humans once ... initiatives to fine-tune controls and communication channels to improve predictability and system reliability. You will use data to ruthlessly prioritize work that… more
- Menlo Ventures (San Francisco, CA)
- …practical experience. Optional: MS or PhD in databases, distributed systems . Comfortable working towards a multi-year vision with incremental deliverables. ... structures and their real-world use cases. Experience with distributed systems , databases, and big data systems (Apache...Azure Blob Store. Delta Lake : A storage management system that combines the scale and cost-efficiency of data… more
- Genesis AI (San Carlos, CA)
- …pipeline, and model parallelism System -level mindset with a track record of tuning hardware- software interactions for maximum utilization #J-18808-Ljbffr ... pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and high… more
- Ring Inc (San Francisco, CA)
- About the Company Company Size: Industry: Agentic AI, Automation, Enterprise Software Founding Year: 2023 Stage: Early-Stage Startup Tech Stack: Python, AWS, ... Databricks, and Canva, they're moving fast to turn ambitious ideas into real-world systems . As one of the first backend/infra engineers, you'll design and build the… more
- Plaid Inc (San Francisco, CA)
- …As a member of the Consumer team, you will help design and build the systems that power Plaid's first consumer app. You will collaborate with product, design, and ... a senior technical leader, you will guide architectural decisions, scale backend systems , and shape the long term technical direction of Plaid's consumer platform.… more
- Genesis AI (San Carlos, CA)
- …petabyte scale Own core data infrastructure: data model, storage systems , ingestion pipelines, transformation frameworks, and orchestration layers Standardize data ... committed to building general-purpose Physical AI What You'll Bring Excellent software engineering skills (Python, Go, or similar) Extensive experience designing,… more
- Fal (San Francisco, CA)
- …their workloads benefit from our accelerator. Requirements: Strong foundation in systems programming with expertise in identifying and fixing bottlenecks. Deep ... architectures. Ideally following closely the developments in all these systems as they happen. Have a fundamental view of...a fundamental view of the underlying hardware (Nvidia based systems at the moment), and when necessary go deeper… more