- pony.ai (Fremont, CA)
- …and optimized ML operator libraries. + Work across the entire ML framework/ compiler stack (eg Torch, CUDA and TensorRT), and system-efficient deep learning models. ... libraries. + Deep knowledge on system performance, GPU optimization or ML compiler . Compensation and Benefits Base Salary Range: $140,000 - $250,000 Annually… more
- Amazon (Cupertino, CA)
- …The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training ... building distributed training support into Pytorch, Tensorflow using XLA and the Neuron compiler and runtime stacks. This role will help tune these models to ensure… more
- NVIDIA (Santa Clara, CA)
- …systems for the CUDA ecosystem. We build innovative agentic runtimes and compiler -integrated orchestration that work together with NVIDIA's software stack to provide ... team, you will develop new agent abstractions, GPU-centric runtimes, and compiler - or runtime-driven system solutions to accelerate agent planning, tool-use, code… more
- Amazon (Cupertino, CA)
- …more. The Distributed training team works side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed training ... building distributed training support into Pytorch and Jax using XLA and the Neuron compiler and runtime stacks. This role will help tune these models to ensure… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …communities, and each other-every day. Job Responsibilities: + Re-architecting protium compiler for very high-performance system + Work includes writing efficient ... algorithm + Reading using timing annotations & incorporating in the Protium Compiler + Write Design Spec & Unit Tests Position Requirements/Qualifications: +… more
- NVIDIA (Santa Clara, CA)
- …be doing: + Orchestrate the integration of new hardware functionalities into TensorRT's compiler and runtime. + Work closely with teams and stakeholders across the ... environment. + Background with systems programming, embedded systems, and/or compiler development. + Experience in software performance benchmarking, profiling, and… more
- Amazon (Cupertino, CA)
- …The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training ... build distributed training support into PyTorch and JAX using XLA, the Neuron compiler , and runtime stacks. You will optimize models to achieve peak performance and… more
- NVIDIA (Santa Clara, CA)
- …Processing, Neural Network Architectures, GPU Acceleration, Deep Learning Neural Networks, Compiler Programming + Performance M odeling, P rofiling, O ptimizing , ... and/or A nalysis Depending on the internship role, prior experience or knowledge requirements could include the following programming skills and technologies: + C, C++, Python , Perl, GPU Computing (CUDA, OpenCL, OpenACC ), Deep Learning Frameworks ( PyTorch ,… more
- NVIDIA (Santa Clara, CA)
- …C, C++, TCL, SPICE, Linux, Verilog, SKILL, Make, ICC2, Design Compiler , PrimeTime (Synopsys, First Encounter), Innovus, Virtuoso (Cadence) Click here ... (http://www.nvidia.com/content/dam/en-zz/Solutions/about-nvidia/careers/UR-Student-Resources.pdf) to learn more about NVIDIA, our early talent programs, benefits offered to students and other helpful student resources related to our latest technologies and… more
- Broadcom (San Jose, CA)
- …and Python. Proficiency in developing optimized code in both x86 and ARM64 compiler toolchains. 8. Strong analytical, problem solving and debugging skills in a ... combined software and hardware environments. 9. Excellent written and verbal communication skills, ability to efficiently collaborate with multiple teams across geographically diverse areas. **Additional Job Description:** **Compensation and Benefits** The… more