• Staff Cloud Engineer - Architecture

    TP-Link North America, Inc. (Irvine, CA)
    …balancing performance, scalability, and budget constraints. + Possess experience in distributed systems and architectural design, building highly available, ... ABOUT US: Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable...resilient, and distributed architectures that ensure system stability under large-scale, high-concurrency… more
    TP-Link North America, Inc. (10/16/25)
    - Related Jobs
  • Principal Staff Software Engineer, Service…

    LinkedIn (Mountain View, CA)
    …practical experience. + 7+ years of industry experience in software design, distributed systems , or infrastructure engineering. + 7+ years of experience ... years in an architect or technical leadership role. + Experience with distributed systems , networking, or inter-service communication protocols (eg, gRPC,… more
    LinkedIn (10/08/25)
    - Related Jobs
  • Site Reliability Engineer (Senior or Staff),…

    MongoDB (San Francisco, CA)
    …As an SRE on the Fabric team, you will leverage your expertise in networking, distributed systems , and automation to ensure our systems are resilient, ... + Have 6+ years of experience working on software and operating distributed systems , with deep expertise in networking fundamentals and a good understanding of… more
    MongoDB (10/07/25)
    - Related Jobs
  • Sr. Manager, Applied Science, Deep Science…

    Amazon (Santa Clara, CA)
    …world class scientists to work on foundation models, large-scale representation learning, and distributed learning methods and systems . At AWS Deep Science for ... Description AWS Deep Science for Systems & Services is looking for a Sr....efficient model architecture, training objective and curriculum design - Distributed training, accelerated optimization methods - Continual learning, multi-task/meta… more
    Amazon (12/24/25)
    - Related Jobs
  • Site Reliability Manager, Site Reliability…

    Google (Mountain View, CA)
    …topologies and hardware, SDN). + 3 years of experience developing infrastructure, distributed systems /networks. + 2 years of experience with distributed ... and systems engineering to build and run large-scale, massively distributed , fault-tolerant systems . SRE ensures that Google's services-both our internally… more
    Google (12/23/25)
    - Related Jobs
  • Senior, Software Engineer

    Walmart (Sunnyvale, CA)
    …and testing. + Implement and maintain secure, high-performance, and fault-tolerant distributed systems . + Develop reusable frameworks and components for ... while following engineering best practices. + Debug and troubleshoot issues in distributed systems across environments. + Contribute to performance optimization,… more
    Walmart (11/13/25)
    - Related Jobs
  • Senior Research Scientist, Post-Training LLM…

    NVIDIA (Santa Clara, CA)
    …LLMs with novel algorithmic/data pipelines + Experience developing andscaling large distributed systems for deep learning. + Contributions to open-source ... , or related areas. + 2+ years of experiences in machine learning, systems , distributed computing, or large-scale model training. + Proficiency in Python… more
    NVIDIA (11/05/25)
    - Related Jobs
  • Battery Storage Commissioning Engineer

    Generac Power Systems (Corona, CA)
    …AC/DC electrical systems + Background with battery energy storage systems , microgrids, distributed energy resources, power conversion systems , ... electrical and control system engineering expertise to battery energy storage systems (BESS) product deployment projects by interfacing with cross functional teams… more
    Generac Power Systems (10/04/25)
    - Related Jobs
  • Senior Performance and Development Engineer

    NVIDIA (Santa Clara, CA)
    …in distributed environments. + Strong background in parallel programming and distributed systems + Experience analyzing and optimizing large scale ... of training applications using PyTorch or similar framework + Building distributed software applications using collective communication libraries such as MPI or… more
    NVIDIA (11/01/25)
    - Related Jobs
  • Senior Director, Platform Operations, GDC

    Google (Sunnyvale, CA)
    …+ Experience building or operating large-scale infrastructure platforms and distributed systems , with technical knowledge of public/private cloud, ... + Ability to resolve deep systemic operational problems. **About the job** Google Distributed Cloud (GDC) is a cloud-centric platform that enables enterprises to run… more
    Google (12/30/25)
    - Related Jobs