-
Senior HPC Support Engineer, InfiniBand - NVLink
- NVIDIA (Seattle, WA)
-
We are seeking a motivated Senior HPC Technical Support Engineer - AI Infrastructure focusing on InfiniBand, NVLink and AI GPU Cluster technology, passionate about data center and networking technologies, to provide comprehensive solutions for sophisticated installations, maintenance, or operations for a broad scope of groundbreaking networking products. As a primary point of contact for our customers; assisting them with technical questions, debugging and resolving their issues. As a member of our Technical Support team, you are a conscientious, proficient communicator who is fundamentally interested in taking ownership in resolving issues, while ensuring a high level of customer satisfaction is maintained and delivered. Significant part of the role is also to interact with Engineering, Marketing, and Support teams regularly on technical issues
What you will be doing:
+ Ability to resolve sophisticated customer concerns and technical issues through meticulous research, reproduction, and solving problems for customers installing our products and supporting systems using Linux Operating Systems (Multi-distro), with the focus on NVIDIA InfiniBand, NVLink and GPU Technology and our End-to-End Solutions
+ Responding to customer product support inquiries via telephone, email, or conference calls
+ Resolving customer issues during installation, operation, maintenance or product application or interoperability with other vendors
+ Participate in multi-functional team meetings and giving feedback to engineering and marketing regarding product requirements, customer experience, support tools, etc.
+ Being a technical resource, develop, re-define and document standard methodologies to share with internal teams (Support/R&D) for support processes and improvements
+ Site visits and conference calls with customers
What we need to see:
+ 5+ years in providing in-depth Customer Support and debugging for hardware and software products.
+ Exceptional interpersonal skills with the ability to maintain and own the overall resolution for any critical issue raised by our customers, under all circumstances.
+ Linux OS including System Administration and Networking on a LFCS/RHCSA level
+ Networking Technology, protocols and routing including IP, L2 and L3 on a CCNP/CompTIA Networking+ and Cloud+ level
+ Containerized solutions experience on a level of DCA and/or CKA, Virtualization and (KVM/ESXi) and Cloud Infrastructure (AWS/OCI) Technologies
+ Able to debug networking protocols using tools such as TCPDUMP and Wireshark or similar packet generation and analysis tools
+ Bash/Python scripting abilities
+ Strong organizational skills and able to prioritize/multi-task easily with limited supervision
+ Integrating AI tools (Cursor, Gemini, ChatGPT, Copilot, Glean, etc.) into your daily workflow.
+ Four-year degree from an accredited University, College, or equivalent experience in Computer Science, or Electrical or Computer Engineering
Ways to stand out from the crowd:
+ NVIDIA Certifications related to AI Infrastructure, Operations and Networking
+ InfiniBand, RDMA, NVLink and NVIDIA GPU Technology
+ Clustering or HPC Data-Center technologies including Upper Layer Protocols (i.e., MPI, NCCL)
+ Additional Operating Systems such as Microsoft Windows, VMware, Unix
+ Configuration and operational expertise with traditional network switch/router and Open platforms
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 108,000 USD - 172,500 USD for Level 3, and 120,000 USD - 201,250 USD for Level 4.
You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) .
Applications for this job will be accepted at least until September 12, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
-
Recent Searches
- Analyst Influencer Relations Manager (United States)
- Remittance Processing Specialist II (United States)
Recent Jobs
-
Senior HPC Support Engineer, InfiniBand - NVLink
- NVIDIA (Seattle, WA)
-
Senior Quality Assurance Automation Engineer
- City of New York (New York, NY)
-
Training and Technical Assistant Program Manager
- Morehouse School Of Medicine (Atlanta, GA)
-
Licensed Practice Manager- Mt Sinai Hospital Outpt Physical Rehabilitation Dept- FT- Days
- Mount Sinai Health System (New York, NY)