- NVIDIA (Santa Clara, CA)
- …workloads in a batch computing environment and a deep understanding of distributed system principles. + Strong programming and debugging skills with C/C++, Python, ... and Perl on UNIX. + A passion for improving engineering productivity and efficiency via data-driven philosophy. #LI-Hybrid Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary… more
- NVIDIA (Santa Clara, CA)
- …project (class, hackathon, or personal) involving AI, ML, data science, distributed systems, or backend infrastructure + Familiarity with cloud environments (AWS, ... GCP, or Azure), including deploying or maintaining small services or apps. + Proficiency in AI assisted code tooling like Cursor, Windsurf, Claude Code, etc. Ways to stand out from the crowd: + Genuine excitement towards the software you will be building. +… more
- Oracle (Sacramento, CA)
- …triage automation, and diagnostic services. These are essential for running distributed AI/ML/HPC workloads across thousands of GPUs, leveraging technologies like ... RoCE and Infiniband. We're excited to meet a talented Senior Software Engineer like you, who shares our enthusiasm for innovation and excellence. As a valued member of our software engineering division, you'll have the opportunity to shape the future of our… more
- NVIDIA (Santa Clara, CA)
- …In this groundbreaking role, you will drive performance and scalability in distributed AI systems. You will focus on optimizing and innovating how large ... language models and generative AI workloads move and process data at scale. What you'll be doing: + Research, design, and implement advanced AI networking technologies with an emphasis on accelerating inference workloads using NVIDIA Inference Xfer Library… more
- RTX Corporation (El Segundo, CA)
- …coordinated teams of UAS platforms. You will be part of a distributed team composed of professionals from several disciplines (software, systems, hardware, autonomy ... algorithms, flight controls, etc.). _This position will work onsite located at one of our following locations: Ft Wayne, IN, McKinney, TX, Cedar Rapids, IA or El Segundo, CA._ **What You Will do:** + Deployment and test of software builds to the real system… more
- NVIDIA (Santa Clara, CA)
- …using NVIDIA's NIMs and Blueprints. + Providing architectural input for distributed applications and Smart Factory standards to improve scalability, reliability, and ... availability. + Collaborating with robotics teams to build and validate autonomous systems before deployment. + Establishing processes for compliance, including functional validation and enterprise security. + Being responsible for the Engineering roadmap for… more
- NVIDIA (Santa Clara, CA)
- …confidential or sensitive information + Be well-versed in managing distributed teams, fostering remote employee engagement, and addressing associated challenges. ... + Ability and talent with using AI to solve people problems. Fluency in MS Word, Excel and PowerPoint and HR systems (preferably Workday) Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The… more
- Palo Alto Networks (Santa Clara, CA)
- …**Your Career** You will build machine learning models and develop big data and distributed systems that use the models to analyze and categorize an enormous amount ... of URLs. You will be a key person in transforming ideas into products which are part of the next generation security platform. The Internet Security Research Team is responsible for innovating new security techniques. **Your Impact** + Design, build, and… more
- NVIDIA (Santa Clara, CA)
- …GPU/accelerated compute architectures and their contributions to AI, HPC, and distributed storage systems is necessary. + Experience with storage, security, ... networking, and high-performance computing workflows is also required. + Shown success in moving products from concept to launch and broad customer adoption is important. + Excellent interpersonal skills and cross-functional collaboration abilities are needed.… more
- NVIDIA (Santa Clara, CA)
- …MLOps and AI infrastructure. + Proven experience designing and optimizing distributed training systems with frameworks like PyTorch, JAX, or TensorFlow. + ... Deep familiarity with reinforcement learning algorithms like PPO, SAC, or Q-learning, including experience tuning hyperparameters and reward functions. + Familiarity with common policy learning techniques like reward shaping, domain randomization, curriculum… more