- Oracle (Albany, NY)
- …systems engineers, and DevOps teams to design and implement robust, high- performance solutions that scale across large, distributed systems. Responsibilities + ... system, and software innovations to significantly enhance AI training and inference performance and efficiency + Guide strategic decisions around Oracle Cloud's AI… more
- BlackRock (New York, NY)
- …of the platform. + Troubleshoot and resolve issues related to platform performance and reliability. + Refine business and functional requirements and translate them ... Knowledge of distributed data processing frameworks (Spark, Dask). + Understanding of GPU orchestration and optimization in Kubernetes. + Familiarity with MLOps and… more
- Oracle (Albany, NY)
- …remediation (CVE), and hardening. + Collaborate with platform, BIOS/EDKII, and GPU teams to deliver cohesive platform management; provide rapid triage for ... contributions where appropriate. + Support sustaining activities: defect resolution, performance /footprint optimizations, diagnostics, telemetry, and field reliability improvements. What… more
- Oracle (Albany, NY)
- …of new optics and transceivers solutions that help connect and drive the GPU , compute, metro and backbone. They write the test and qualification requirements and ... covering numerous areas of network hardware engineering including cabling, low level device performance , thermals, etc. As OCI is a cloud-based network with a global… more
- Oracle (Albany, NY)
- …capabilities for OCI GPU Servers. We are seeking a talented Principal Software Engineer with deep experience in GPU and Networking technologies to help build ... You'll develop secure provisioning and management solutions for OCI GPU -accelerated servers, ensuring robust performance , reliability, and security… more
- Oracle (Albany, NY)
- …for the most demanding enterprise workloads. We are focused on delivering high- performance computing, storage, networking, and platform services at global scale. The ... end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and...at scale. We are looking for a **Senior Software Engineer ** to join our growing team and help shape… more
- Oracle (Albany, NY)
- …We are looking for a highly skilled and motivated distributed systems engineer who can architect solutions to scale and optimize Observability solutions for ... AI infrastructure components like GPU control plane and GPU data plane...AI infrastructure to deliver exceptional customer experience and peak performance . **Responsibilities** **Responsibilities** + Architect solutions to scale and… more
- Oracle (Albany, NY)
- …for the most demanding enterprise workloads. We are focused on delivering high- performance computing, storage, networking, and platform services at global scale. The ... end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and...at scale. We are looking for a Senior Software Engineer to join our growing team and help shape… more
- NVIDIA (NY)
- …and optimization of machine learning/deep learning models to ensure the best performance on current- and next-generation GPU architectures. + Work directly ... software design, programming techniques, and algorithms, alongside experience performing performance optimizations. + Familiarity with NVIDIA GPU architectures.… more
- Oracle (Albany, NY)
- …for the most demanding enterprise workloads. We are focused on delivering high- performance computing, storage, networking, and platform services at global scale. The ... end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and...at scale. We are looking for a Principal Software Engineer to join our growing team and help shape… more