- Meta (New York, NY)
- …working on high performance computing (HPC) and AI/ML systems, including: GPU /ASIC-based kernel development and optimization (eg CUDA, ROCm), distributed systems for ... and serving, and systems architecture and performance 11. Accelerator ( GPU /ASIC) kernel development and optimization 12. Experience in accelerating libraries… more
- Meta (Albany, NY)
- …standards, especially in the field of 3D models, Graphics APIs and GPU programming 15. 3. Developing high-performance rendering solutions with a modern graphics ... API including Metal and Vulkan 16. 4. Writing high-performance GPU programs with a GPU programming language (shading language) 17. 5. Experience owning a… more
- NVIDIA (NY)
- …learning models to ensure the best performance on current- and next-generation GPU architectures. + Work directly with client ML researchers and developers/engineers ... algorithms, alongside experience performing performance optimizations. + Familiarity with NVIDIA GPU architectures. + GPU Development experience through NVIDIA… more
- Capital One (New York, NY)
- …years of experience in LLM model training, evaluation, inference optimization and parallelization in GPU cluster + At least 5 years of experience working with AWS or ... equivalent GPU Clusters + At least 5 years of experience in PyTorch/Tensorflow Capital One will consider sponsoring a new qualified applicant for employment… more
- IBM (Yorktown Heights, NY)
- …Kubernetes, Kserve, and explores optimizations across the entire stack from GPU networking, model scheduling serving, AI platform optimization including inference ... like vllm, llm-d, and KServe * Research or development experience in GPU networking and large-scale acceleration * Research or development experience in Inference… more
- Capital One (New York, NY)
- …years of experience in LLM model training, evaluation, inference optimization and parallelization in GPU cluster + At least 3 years of experience working with AWS or ... equivalent GPU Clusters + At least 5 years of experience in PyTorch/Tensorflow Capital One will consider sponsoring a new qualified applicant for employment… more
- General Motors (Albany, NY)
- …modern C++ or Python **Bonus:** + Experience with profiling CPU and/or GPU software, process scheduling, and prioritization + Passionate about self-driving car ... in modern C++ or Python + Experience with profiling CPU and/or GPU software, process scheduling, and prioritization + Passionate about self-driving car technology… more
- Teradata (Albany, NY)
- …with AI infrastructure - including distributed training, inference optimization, GPU /accelerator integration, and workload orchestration. + Experience defining APIs, ... ensuring cost-efficiency and performance at scale. + Hands-on knowledge of GPU , TPU, and emerging accelerator technologies for training, inference, and fine-tuning… more
- Guardian Life (New York, NY)
- …latency/cost) with automated regression tests. + Optimize compute and inference ( GPU /distributed, caching/batching, model routing) for cost and performance. + Embed ... and proficiency in data/ML engineering patterns (eg, Spark or Ray) and ** GPU /distributed computing** . + **MLOps proficiency** : CI/CD for models and prompts,… more
- Deloitte (New York, NY)
- …controls throughout the AI/ML lifecycle (data handling, training with GPU isolation, deployment, monitoring, versioning, provenance). Integrate SAST/DAST for ML ... Vault, Azure ML security). + Securing MLOps/LLMOps pipelines (data versioning, provenance, GPU isolation). + Security frameworks (OWASP AI Security & Privacy Guide).… more