- NVIDIA (Santa Clara, CA)
- …of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF. ... Applied experience with AI / HPC workflows that use MPI and NCCL. + Proficient in using Linux including Centos/RHEL and/or Ubuntu Linux distributions. A solid… more
- NVIDIA (Santa Clara, CA)
- …, HW, and SW engineering and research teams to define a vision and roadmap for AI / HPC cluster observability. + Architect and lead teams to develop, test, and ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior HPC Engineer to join its...the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is ... customers, partners and internal teams to analyze, define, and implement large-scale AI / HPC projects. These efforts include a combination of networking, system… more
- NVIDIA (Santa Clara, CA)
- …a variety of HPC or EDA workloads. + Solid understanding of cluster configuration managements tools such as Ansible. + Proficiency in Perl for maintaining legacy ... NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI . We are currently ... Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI...GPUs. Your expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
- NVIDIA (Santa Clara, CA)
- …Python, Rust, Angular, React. Ways to stand out from the crowd: + Experience in HPC and/or AI training. + Knowledge of LLMs and agentic workflows. + Have ... We are now looking for a Senior Software Architect. Do you love to provide...in our journey of building software for most performant AI servers. What you'll be doing: + Research, design… more
- NVIDIA (Santa Clara, CA)
- …telemetries, scale out cluster , test plan development, track record in developing AI tools and NLP, DevOps, CI/CD experience to join our platform SWQA team. What ... We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional...OEM business. NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains… more
- NVIDIA (Santa Clara, CA)
- …directly impact NVIDIA's ability to deliver robust, secure, and high-performing solutions for AI , HPC , and cloud-scale systems. You will: + Define End-to-End ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...world! We are seeking a highly skilled and hard-working Senior Test Architect to join our multifaceted Enterprise Software… more
Recent Jobs
-
Anesthesia Technician (Full time, day shift)
- Penn Medicine (Lancaster, PA)
-
Compliance Coordinator
- Brookfield Properties (Charleston, SC)
-
Quality Auditor & Educator
- AdventHealth (Daytona Beach, FL)