- Meta (Menlo Park, CA)
- …hardware. As part of the AI acceleration software stack, we develop PyTorch compiler frontend for MTIA, PyTorch runtime for inference & training, high performance ... runtime and kernel libraries exploiting various hardware architectural features and tooling.We are looking for an engineering manager to support MTIA software stack development for training and inference platform. **Required Skills:** Software Engineering… more
- NVIDIA (Santa Clara, CA)
- …the crowd: + Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation + Prior experience with performance ... modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application + Architectural knowledge of CPU and GPU + GPU programming experience (CUDA or OpenCL) NVIDIA is widely considered to be one of technology's most desirable employers. We… more
- NVIDIA (Santa Clara, CA)
- …inference frameworks engineering, focusing on SGLang. + Partner with internal compiler , libraries, and research teams to deliver end-to-end optimized inference ... pipelines across NVIDIA accelerators. + Oversee performance tuning, profiling, and optimization of large-scale models for LLM, multimodal, and generative AI applications. + Guide engineers in adopting best practices for CUDA, Triton, CUTLASS, and multi-GPU… more
- NVIDIA (Santa Clara, CA)
- …Ways to stand out from the crowd: + Background in domain specific compiler and library solutions for LLM inference and training (eg FlashInfer, Flash Attention) ... + Expertise in inference engines like vLLM and SGLang + Expertise in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar) + Open… more
- Amazon (Seattle, WA)
- …and vision generative AI models * Collaborate directly with silicon architects and compiler teams to push the boundaries of AI acceleration * Drive performance ... benchmarking and tuning that directly impacts millions of inference calls globally Key job responsibilities You will drive the Evolution of Distributed AI at AWS Neuron As a Technical Leader at the forefront of AWS's AI Accelerator, you'll architect the bridge… more
- General Motors (Jefferson City, MO)
- …tooling, build systems, C++, Python, etc. + Experience with / knowledge of compiler toolchain integrations (clang, gcc, nvcc, etc.). + Experience leading teams with ... broad, company-wide impact. + Attention to detail, and a desire to improve processes & systems around you. + Deep understanding of the business and operational impact for different technology tradeoffs. + Passion for developing and growing individual… more
- NVIDIA (Santa Clara, CA)
- …such as TensorFlow/Pytorch. + Review, design, and implement features to enhance compiler features to support the NVIDIA networking ecosystem. + Research, design and ... develop hardware features relevant to scientific, Deep learning, and data-intensive workloads. What we need to see: + A Ph.D. or Master, in computer science, computer engineering, or a closely related field or equivalent experience. + 5+ years of experience in… more
- Amazon (Sunnyvale, CA)
- …workloads to run efficiently on our accelerator * Collaborate closely with compiler engineers, model developers, hardware architects and product teams to build the ... best ML centric hardware and software solutions for our devices * Deliver hardware architecture, microarchitecture and other design collateral for our next generation ML accelerators * Build tools for modeling and performance evaluation to enable power,… more
- Amazon (Cupertino, CA)
- …- Demonstrated level of expertise in PD tools such as Innovus, ICC2, Fusion Compiler , STA, and Sign-Off. - Proven track record of delivering metric driven PPA flow ... development and support. Preferred Qualifications - Expertise in high-performance, low-power physical design, and implementation techniques with industry standard synthesis, PnR, or Signoff tools. - Excellent programming skills in languages like Python, Perl,… more
- Micron Technology, Inc. (San Jose, CA)
- …design and validation; 5. Design/Verification CAD tools, including NCSim. 6. Design Compiler , Formality, or PrimeTime; 7. TCL, PERL, C/C++ or Python; 8. Nand ... Memory Design. The US base salary range that Micron Technology, Inc. estimates it could pay for this full-time position is $148,869.22 - $197,500 per year. For additional pay & benefits information, please refer to the requisition at Micron.com/careers. As a… more