• Software Engineer, SystemML - Scaling…

    Meta (Menlo Park, CA)
    …leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI /GPU communication stack. ... learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations, or… more
    Meta (11/05/25)
    - Related Jobs
  • Senior Storage Production Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …expertise in different domains, such as storage architecture, high-performance distributed storage, data management, systems , networking, coding, database ... storage solutions, optimizing data placement and access patterns, managing large-scale distributed storage systems , and ensuring low-latency data access for… more
    NVIDIA (11/12/25)
    - Related Jobs
  • Software Engineer

    Ensono (Los Angeles, CA)
    …role is for **builders who thrive at scale** -engineers who can deliver reliable , high-performance software that runs across mainframe, distributed , and cloud ... EnvisionOS with platforms like **ServiceNow** , Snowflake, and monitoring systems . + **Data & AI Productization** -...applications. + **Scalable System Design** - Architect and implement systems that run reliably across distributed , cloud,… more
    Ensono (10/26/25)
    - Related Jobs
  • Senior Software Engineer, Data Ingestion…

    NVIDIA (Santa Clara, CA)
    …of orchestration, service modeling, API development, monitoring, and automation + Build highly reliable distributed systems that our customers can depend on ... of fast, efficient, and reliable data transfer systems . The goal is to enable NVIDIA AI...this goal, you should have a strong understanding of distributed systems development, object storage, network file… more
    NVIDIA (10/15/25)
    - Related Jobs
  • Data Engineer

    Ensono (Los Angeles, CA)
    …ELT/ETL pipelines that move, clean, and organize data from ServiceNow, mainframe, distributed , and cloud systems . + **Integration with ServiceNow** - Develop ... (Workday, Concur, etc)** . + Familiarity with observability tooling and distributed data systems . + Knowledge of enterprise data governance, compliance,… more
    Ensono (10/26/25)
    - Related Jobs
  • Operations Technology Consultant-Specialist Senior

    Deloitte (San Jose, CA)
    …team of engineers specializing in AI , machine learning, edge computing, and distributed systems . + Develop and architect comprehensive hybrid edge AI ... Qualcomm and NVIDIA) + Proven track record of delivering AI /ML solutions in edge computing, IoT, or distributed...solutions. + 2+ years' experience of edge computing architectures, distributed systems , and real-time data processing. +… more
    Deloitte (11/05/25)
    - Related Jobs
  • Staff Software Engineer, Machine Learning, Google…

    Google (Sunnyvale, CA)
    …Search, AI Overviews, and Agentic Workflows. We bridge the gap between generative AI research and production-grade distributed systems . AI will ... in software development, with a focus on Machine Learning, Distributed Systems , or Backend Engineering. + 5...post-production monitoring. + Experience building, optimizing, and deploying Generative AI systems in production (eg, LLMs, RAG,… more
    Google (11/15/25)
    - Related Jobs
  • Sr. Big Data Engineer - Data Infrastructure…

    TP-Link North America, Inc. (Irvine, CA)
    …with proven experience building and operating large scale data pipelines and distributed systems in production, including terabyte scale big data environments. ... Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked...Git version control and collaborative development workflows required. + Distributed systems expertise: Deep knowledge of … more
    TP-Link North America, Inc. (09/18/25)
    - Related Jobs
  • Senior Staff Machine Learning Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …/ Kubernetes / Prometheus /Splunk/ GitLab CI); + Strong working experience operating distributed systems built on Linux and J2EE; + Experience with ... to today - ServiceNow stands as a global market leader, bringing innovative AI -enhanced technology to over 8,100 customers, including 85% of the Fortune 500(R). Our… more
    ServiceNow, Inc. (09/27/25)
    - Related Jobs
  • Site Reliability Engineer (Senior or Staff),…

    MongoDB (San Francisco, CA)
    …As an SRE on the Fabric team, you will leverage your expertise in networking, distributed systems , and automation to ensure our systems are resilient, ... plays a crucial role in developing and maintaining the reliable and globally connected multi-cloud network that supports MongoDB...6+ years of experience working on software and operating distributed systems , with deep expertise in networking… more
    MongoDB (10/07/25)
    - Related Jobs