- Meta (Menlo Park, CA)
- …leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI /GPU communication stack. ... learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations, or… more
- NVIDIA (Santa Clara, CA)
- …expertise in different domains, such as storage architecture, high-performance distributed storage, data management, systems , networking, coding, database ... storage solutions, optimizing data placement and access patterns, managing large-scale distributed storage systems , and ensuring low-latency data access for… more
- Ensono (Los Angeles, CA)
- …role is for **builders who thrive at scale** -engineers who can deliver reliable , high-performance software that runs across mainframe, distributed , and cloud ... EnvisionOS with platforms like **ServiceNow** , Snowflake, and monitoring systems . + **Data & AI Productization** -...applications. + **Scalable System Design** - Architect and implement systems that run reliably across distributed , cloud,… more
- NVIDIA (Santa Clara, CA)
- …of orchestration, service modeling, API development, monitoring, and automation + Build highly reliable distributed systems that our customers can depend on ... of fast, efficient, and reliable data transfer systems . The goal is to enable NVIDIA AI...this goal, you should have a strong understanding of distributed systems development, object storage, network file… more
- Ensono (Los Angeles, CA)
- …ELT/ETL pipelines that move, clean, and organize data from ServiceNow, mainframe, distributed , and cloud systems . + **Integration with ServiceNow** - Develop ... (Workday, Concur, etc)** . + Familiarity with observability tooling and distributed data systems . + Knowledge of enterprise data governance, compliance,… more
- Deloitte (San Jose, CA)
- …team of engineers specializing in AI , machine learning, edge computing, and distributed systems . + Develop and architect comprehensive hybrid edge AI ... Qualcomm and NVIDIA) + Proven track record of delivering AI /ML solutions in edge computing, IoT, or distributed...solutions. + 2+ years' experience of edge computing architectures, distributed systems , and real-time data processing. +… more
- Google (Sunnyvale, CA)
- …Search, AI Overviews, and Agentic Workflows. We bridge the gap between generative AI research and production-grade distributed systems . AI will ... in software development, with a focus on Machine Learning, Distributed Systems , or Backend Engineering. + 5...post-production monitoring. + Experience building, optimizing, and deploying Generative AI systems in production (eg, LLMs, RAG,… more
- TP-Link North America, Inc. (Irvine, CA)
- …with proven experience building and operating large scale data pipelines and distributed systems in production, including terabyte scale big data environments. ... Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked...Git version control and collaborative development workflows required. + Distributed systems expertise: Deep knowledge of … more
- ServiceNow, Inc. (Santa Clara, CA)
- …/ Kubernetes / Prometheus /Splunk/ GitLab CI); + Strong working experience operating distributed systems built on Linux and J2EE; + Experience with ... to today - ServiceNow stands as a global market leader, bringing innovative AI -enhanced technology to over 8,100 customers, including 85% of the Fortune 500(R). Our… more
- MongoDB (San Francisco, CA)
- …As an SRE on the Fabric team, you will leverage your expertise in networking, distributed systems , and automation to ensure our systems are resilient, ... plays a crucial role in developing and maintaining the reliable and globally connected multi-cloud network that supports MongoDB...6+ years of experience working on software and operating distributed systems , with deep expertise in networking… more