- Oracle (Seattle, WA)
- …Proven experience designing, implementing, and managing infrastructure for AI/ML or HPC workloads. + Understanding machine learning frameworks and libraries such as ... TensorFlow, PyTorch, or sci-kit-learn and their deployment in production environments is a plus. + Familiarity with DevOps practices and tools for continuous integration, deployment, and monitoring (eg, Jenkins, GitLab CI/CD, Prometheus). + Strong experience… more
- Oracle (Olympia, WA)
- …and DHCP + Experience in GPU/RDMA network environments, High Performance Compute ( HPC ), or InfiniBand technologies + Experience with network monitoring and telemetry ... solutions, network configuration management, linux systems administration + Experience leading security-related technical troubleshooting calls and performing post-mortem analysis Disclaimer: **Certain US customer or client-facing roles may be required to… more
- Oracle (Seattle, WA)
- …automation, and diagnostic services. These are essential for running distributed AI/ML/ HPC workloads across thousands of GPUs, leveraging technologies like RoCE and ... Infiniband. **Why Join Us?** + Innovative Projects: Build groundbreaking solutions for our customers from the ground up. + Exciting Times: Be part of a young, fast-growing team working on ambitious new initiatives. + Dynamic Environment: Collaborate in a… more
- Oracle (Olympia, WA)
- …the ability to engage credibly with CIOs, Research IT, campus HPC leaders, and Principal Investigators, connecting OCI's capabilities to real-world challenges ... such as scaling AI research, augmenting constrained on-prem clusters, and enabling secure, compliant, and cost-aligned AI adoption across campus. Prior OCI experience is not required, but a strong understanding of research computing, AI workload patterns, data… more
- Oracle (Seattle, WA)
- …Experience in a customer facing role in a tech company. Experience with AI and HPC end customers is a big plus. Manage the development and implementation process of ... a specific company product involving departmental or cross-functional teams focused on the delivery of new or existing products. Plan and direct schedules and monitor budget/spending. Monitor the project from initiation through delivery. Organize the… more
- Oracle (Seattle, WA)
- …(eg, NCCL, Horovod, DeepSpeed) Experience supporting or operating large-scale HPC , AI, or GPU-accelerated clusters in production environments Excellent ... problem-solving skills, with the ability to troubleshoot complex issues and drive resolution in a fast-paced environment Written and verbal communication skills with the ability to present complex information clearly to all audiences Strong documentation… more
- Oracle (Olympia, WA)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/ HPC workloads. This is your chance to be part of the AI ... revolution, working with systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage… more
- Oracle (Olympia, WA)
- …and telemetry pipelines in large-scale environments. + Knowledge of hyperscale networking, HPC , or GPU infrastructure. + Expertise in designing data feedback systems ... that improve AI model performance through continuous learning. + Demonstrated ability to influence technical direction across teams and lead complex cross-functional projects. Disclaimer: **Certain US customer or client-facing roles may be required to comply… more
- Oracle (Olympia, WA)
- …dynamic environment. + You have a deep understanding of Cloud-Based Solutions (IaaS), HPC and GPU Infrastructure, AI Architectures such as RAG, Agentic AI systems, ... and Knowledge Graphs. + You have direct experience with ML Ops, Model Hosting, LLM orchestration, and open and closed models. + You are confident in your knowledge of the enterprise AI market and can draw upon that knowledge to have in-depth conversations with… more
- Oracle (Olympia, WA)
- …and building systems. + Define design envelopes for density growth ( HPC /AI), modularity, and phased expansion; establish redundancy and maintainability strategies ... (N+1/2N). + Create and maintain design guides, specifications, and acceptance criteria aligned with global codes and best practices. + Integrated design and delivery + Lead multidisciplinary design from concept through commissioning (L1-L5). + Validate system… more