• Principal Machine Learning Engineer

    General Motors (Sunnyvale, CA)
    …generation of AI-driven driving systems. We're tackling challenges across distributed training, training efficiency, DDP/FSDP, data processing pipelines, and Pytorch ... seamless workflows from research to production. + Drive efficiency in large-scale distributed training and data processing pipelines. + Establish best practices for… more
    General Motors (10/03/25)
    - Related Jobs
  • Software Development Snr Manager

    Oracle (Montpelier, VT)
    **Job Description** Are you interested in building large-scale distributed infrastructure for the cloud? Oracle's Cloud Infrastructure team is building new services ... area that operate at high scale in a broadly distributed multi-tenant cloud environment. Our customers run their businesses...lead and build a team responsible for building in distributed systems and highly available services. If this is… more
    Oracle (01/01/26)
    - Related Jobs
  • Principal Member of Technical Staff-Software…

    Oracle (Atlanta, GA)
    …operate a suite of massive scale, integrated cloud services in a broadly distributed , multi-tenant cloud environment. OCI is committed to providing the best in cloud ... engineers with the expertise and passion to solve difficult problems in distributed highly available services. At every level, our engineers have a significant… more
    Oracle (01/01/26)
    - Related Jobs
  • Principal Site Reliability DevOps Engineer…

    Oracle (Washington, DC)
    …including architecture, provisioning, configuration, deployment, and support Partner with the distributed team in prototyping new platform services Stay informed of ... services Develop designs, architectures, standards, and methods for large-scale distributed systems Facilitate service capacity planning and demand forecasting,… more
    Oracle (01/01/26)
    - Related Jobs
  • Senior Software Engineer, Cloud Performance

    Oracle (Washington, DC)
    …Java runtime, SDKs, and images). Qualifications: + 4 to 5 years distributed service engineering experience in a software development environment + Development ... + Good knowledge of data structures, algorithms, operating systems, and distributed systems fundamentals. + Working familiarity with networking protocols (TCP/IP,… more
    Oracle (01/01/26)
    - Related Jobs
  • Senior Software Engineer, Atlas Clusters Security

    MongoDB (New York, NY)
    …growing product. Atlas allows users to deploy fault-tolerant, secure, globally distributed MongoDB clusters in just minutes. This includes developing software to ... years of professional software development experience + Is skilled at writing large-scale, distributed backend systems in a compiled language (Java, C#, Go, etc.) +… more
    MongoDB (01/01/26)
    - Related Jobs
  • Senior Staff Software Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    …+ 3 years of experience in designing, analyzing, and troubleshooting distributed systems. + Experience with Cloud compute platforms (Kubernetes, Cloud Functions). ... software and systems engineering to build and run large-scale, massively distributed , fault-tolerant systems. SRE ensures that Google Cloud's services-both our… more
    Google (01/01/26)
    - Related Jobs
  • Software Development Engineer - AI/ML, AWS Neuron,…

    Amazon (Seattle, WA)
    …to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration ... compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia. Experience optimizing inference… more
    Amazon (12/31/25)
    - Related Jobs
  • Software Development Manager, Amazon Leo Identity…

    Amazon (Arlington, VA)
    …software development lifecycle experience - 1+ years of developing large-scale, multi-tiered distributed software systems using Java, C#, or C++ experience - 1+ ... years of developing large-scale, multi-tiered distributed software systems using service-oriented architecture experience - 1+ years of developing large-scale,… more
    Amazon (12/31/25)
    - Related Jobs
  • Software Engineer II

    Microsoft Corporation (Redmond, WA)
    …for large customers, mine insights from telemetry and behavior of large distributed systems, learn and contribute to design of service software stack, datacenter ... Experience and understanding in building highly available, highly scalable, reliable, distributed systems + Knowledge of building a secure service and understanding… more
    Microsoft Corporation (12/30/25)
    - Related Jobs