- Oracle (Boston, MA)
- …Innovation team is pioneering the creation of next-generation AI/HPC networking for GPU superclusters at massive scale. Our mission is to design and deliver ... changes required across (Kernel, NIC, switch, transport, protocol, storage, GPU comms) + Develop production-grade, high-performance software features with rigorous… more
- Deloitte (Boston, MA)
- …Deep knowledge of AI/ML workload characteristics (training vs. inference), GPU /TPU architectures, high-performance interconnects ( InfiniBand , NVLink), and ... AI Datacenter & Infrastructure Senior Manager -AI & Engineering Join our AI...architecting, implementing, and optimizing enterprise-scale Hybrid Infrastructure Solutions (ie, GPU platforms, server hardware, AI Transformation) + 7+ years… more
- Oracle (Boston, MA)
- …AI/ML/HPC workloads across thousands of GPUs, leveraging technologies like RoCE and Infiniband . We're excited to meet a talented Senior Software Engineer ... AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to… more
- Oracle (Boston, MA)
- …AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to ... is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automation, and diagnostic services. These… more
- Red Hat (Boston, MA)
- …on Github. As a Senior Machine Learning Engineer focused ... experience in writing high performance code for GPUs and deep knowledgeof GPU hardware + Strong understanding of computer architecture, parallel processing, and… more
- Oracle (Boston, MA)
- …AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to ... is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automation, and diagnostic services. These… more
- Oracle (Boston, MA)
- … GPU /RDMA network environments, High Performance Compute (HPC), or InfiniBand technologies + Experience with network monitoring and telemetry solutions, network ... The OCI Core Services organization is looking for a Senior Principal Security Engineer that will be the technical...cloud platform. You will clearly communicate your ideas to senior executive leadership. Your vision will help shape the… more
- Red Hat (Boston, MA)
- …+ Working knowledge of high-performance networking protocols and technologies including UCX, RoCE, InfiniBand , and RDMA is a plus. + Experience with GPU ... performance benchmarking and profiling tools like NVIDIA Nsight or distributed tracing libraries/techniques like OpenTelemetry is a plus. + Excellent communication skills, capable of interacting effectively with both technical and non-technical team members. +… more
- Oracle (Boston, MA)
- …Java, GoLang, C#, C++, Python, etc + Solid understanding of networking (TCP/IP, Infiniband , RoCE, etc) + Experience with operating system and system software + ... Knowledge of CPU, GPU architectures, memory coherence and consistency models + Hands-on coding- Ability to write efficient, production-quality code and debug complex… more