- NVIDIA (Santa Clara, CA)
- …using Ansible and Terraform, ensuring reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, ... SRE roles, including 3+ years focused on ML infrastructure or distributed compute systems. + Strong proficiency in Infrastructure-as-Code (IaC) tools, specifically… more
- NVIDIA (Santa Clara, CA)
- …Senior Software Architect for our Observability Platform to architect and implement distributed observability systems for data centers enabling EDA workflows . We ... What We Need to See: + Experience developing large scale, distributed observability systems. + Ability to collaborate with data scientists, researchers,… more
- Amazon (East Palo Alto, CA)
- …mentor, tech lead or leading an engineering team - 4+ years of distributed systems experience, or Bachelor's degree in computer science, engineering, mathematics or ... equivalent - Experience with distributed computing and enterprise-wide systems Proficiency in at least...writing algorithms and creating data structures - Experience with distributed systems at scale - Experience performing live system… more
- MongoDB (San Francisco, CA)
- …in a professional capacity + Experience designing with scalable and highly available distributed systems in the cloud and on-prem + Demonstrated ability to work with ... software. MongoDB's unified database platform-the most widely available, globally distributed database on the market-helps organizations modernize legacy workloads,… more
- Panasonic Avionics Corporation (Irvine, CA)
- …network engineer, or similar role working with OSS/BSS of large-scale distributed systems. + 5+ year's architecture or technical leadership experience with ... Cloud, and Commerce Cloud, and their integration with OSS/BSS systems. + Distributed Systems: Knowledge of distributed systems concepts and architectures,… more
- NVIDIA (Santa Clara, CA)
- …common file formats such as Parquet, ORC and JSON + Collaborate with distributed systems teams to craft solutions to distributed processing problems challenges ... development + Outstanding technical skills in designing and implementing high-quality distributed systems + Excellent programming skills in C++, Java, and/or Scala… more
- Walmart (Sunnyvale, CA)
- …area. + Deep working knowledge of all aspects of Cloud native distributed system development: Azure/ GCP/ WCNP preferred + Experience working with micro-services ... architecture and distributed systems. + Experience managing software development engineers, leaders,...ensure performance and scalability. + Experience collaborating with geographically distributed teams to align on platform goals. + Strong… more
- NVIDIA (Santa Clara, CA)
- …requirements + Deploy new compute farm technologies such as containers, volume cloning, distributed storage and distributed compute at scale + Deploy tracking ... preferred. + Experience with Make based build systems in large, distributed computing environments + Continuous Integration pipeline and/or pre-submit verification… more
- Rubrik (Palo Alto, CA)
- …build upon, including our microservices architecture, Kubernetes deployment system, distributed job workflow engine, database instances(Mysql) and platform cloud ... and innovating on each of our core layers - Kubernetes services, distributed job framework, MySQL and security. **Some challenges include:** **Scalability** : As… more
- LinkedIn (Mountain View, CA)
- …solutions, our AI Infrastructure brings together information retrieval, machine learning, distributed systems, and other fundamental areas of computer science. The ... delightful for our ML Engineers to productively use. To do this, our distributed search platform must scale seamlessly across data and traffic, while enabling… more