- Amazon (Seattle, WA)
- …(AGI) team is looking for a passionate, talented, and inventive Senior Research Engineer (RE) with a strong hands-on machine learning background, to lead the ... fundamentals of Computer Science, and practical experience building large-scale distributed systems . This person has thrived and succeeded in delivering high quality… more
- Oracle (Olympia, WA)
- …+ Strong technical knowledge in cloud networking, high performance computing, and GPU systems . \#LI-KR4 Oracle is an Equal Employment Opportunity Employer. ... needs. Career Level - IC4 **Responsibilities** As a Principal Network Reliability Engineer , you will be responsible for helping design, build, test, deploy and… more
- Oracle (Olympia, WA)
- …and our customers to build and deploy AI at scale. We are looking for a **Senior Software Engineer ** to join our growing team and help shape the future of AI ... support the end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and deployment tools-we… more
- Oracle (Seattle, WA)
- …tuning performance on distributed systems . + Familiarity with elements of the AI/HPC software stack such as job schedulers (eg, Slurm); NCCL, RCCL, or MPI; or ML ... AI Infrastructure is at the forefront of building cutting-edge GPU supercomputers that scale to tens of thousands of...validation, including on very novel and not fully understood systems . + Document new tools and procedures to a… more
- Amazon (Seattle, WA)
- …powering solutions like Generative AI. Key job responsibilities As a ML Compiler Engineer II on the Neuron Compiler Automated Reasoning Group, you will develop and ... what is known, to best deliver our customers. Strong software development skills using C++/Python are critical to this...(design patterns, reliability and scaling) of new and existing systems experience - 2+ years of experience in developing… more
- Microsoft Corporation (Redmond, WA)
- …is of paramount importance. To achieve this goal, the **Cloud** **Hardware Systems Engineering (CHSE)** team is instrumental in defining and delivering operational ... optimize the Cloud infrastructure. We are looking for a **Principal Hardware Engineer ** to join the team. \#SCHIE #azurehwjobs #CHSE **Responsibilities** + Lead… more
- Oracle (Olympia, WA)
- …largest AI and HPC customers. These fabrics are the foundation underneath OCI's AI, GPU and HPC services, and support major tier-0 vendors in the generative AI ... the RDMA network underneath your workload. A Principal Network Engineer on our team supports the design, deployment, and...on operation and support of RDMA/RoCE network fabrics and systems , through a combination of a deep network understanding… more
- Oracle (Olympia, WA)
- …and our customers to build and deploy AI at scale. We are looking for a Principal Software Engineer to join our growing team and help shape the future of AI ... the future of enterprise AI. **Responsibilities** As a Principal Software Engineer on the team, you will...OCI + Design and build distributed, scalable, fault tolerant software systems + Participate in the entire… more
- Oracle (Olympia, WA)
- …embedded software for scalable, high performance, elegantly-designed, and leading-edge GPU and X86 systems . We design and integrate state-of-the-art, secure, ... utilities, and connectors to manage, monitor and configure Oracle's GPU & x86 servers used in Oracle Cloud. All...driver development, and device tree work. - System and software integration: Work with systems , hardware, architecture… more
- Oracle (Olympia, WA)
- …and our customers to build and deploy AI at scale. We are looking for a Senior Software Engineer to join our growing team and help shape the future of AI ... end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and...OCI + Design and build distributed, scalable, fault tolerant software systems + Participate in the entire… more