- NVIDIA (Santa Clara, CA)
- …and tune RDMA, NVMe-over-Fabrics, RoCE, InfiniBand, and Ethernet-based fabrics for maximum performance . + Partner with GPU , networking, and systems teams to ... Joining our team as a Storage & Networking Product Engineer involves being part of a group that fosters...a group that fosters the development of highly available, high- performance infrastructure. This role is vital for the flawless… more
- Meta (Menlo Park, CA)
- …& serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler ... DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms… more
- Walmart (Sunnyvale, CA)
- …recovery strategies. + Exposure to AI/ML workloads on Scale-Out storage and performance optimization for GPU clusters. + Familiarity with hardware accelerators ... **Position Summary ** We are seeking a highly skilled Principal Engineer (Ceph/Scale-Out Storage) with 10years+ of deep technical experience in distributed storage… more
- General Motors (Sunnyvale, CA)
- **Job Description** **Senior AI/ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... develop and enhance GM's internal ML tooling for high performance software by leveraging state of the art tools...+ Experience developing and deploying machine learning models + GPU programming (CUDA) and familiarity with ML SW stack… more
- Meta (Sunnyvale, CA)
- …like PCIe, RoCE, Ethernet, DDR, HBM 17. Experience with micro-architectural performance verification 18. Experience verifying GPU /CPU designs 19. Experience ... **Summary:** Meta is hiring ASIC Design Verification Engineer within the Infrastructure organization. We are looking...Chip (SoC) for data center applications.As a Design Verification Engineer , you will be part of a agile team… more
- SanDisk (Milpitas, CA)
- …our industry-leading portfolio of products that are recognized globally for innovation, performance and quality. Sandisk has two facilities recognized by the World ... and collaborates effectively across cross-functional teams. As a **System Product Engineer ** within the **Advanced Product Development** team, you will play a… more
- NVIDIA (Santa Clara, CA)
- …model training and finetuning with mixed precision recipes and next-gen NVIDIA GPU architectures. + Performance tuning and optimizations of deep learning ... including pretraining, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience. In this critical role, you will… more
- NVIDIA (Santa Clara, CA)
- …pretraining, reasoning, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience. In this critical role, you will ... paradigms, model optimizations, defining robust APIs, meticulously analyzing and tuning performance , and expanding our toolkits and libraries to be more… more
- NVIDIA (Santa Clara, CA)
- …pretraining, reasoning, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience. In this critical role, you will ... paradigms, model optimizations, defining robust APIs, meticulously analyzing and tuning performance , and expanding our toolkits and libraries to be more… more
- General Dynamics Information Technology (San Diego, CA)
- …Required:** Yes **Job Description:** Transform data into decisive advantage as a **AI/ML Engineer ** with GDIT. A career in applied machine learning at GDIT means ... our differentiator. Our work depends on a Senior AI/ML Engineer who can design, deploy, and sustain models at...secure data pipelines (ETL/ELT) from CANES telemetry, logs, and performance counters; develop features for time-series, graph, and NLP… more