• Software Engineer, Systems ML

    Meta (Menlo Park, CA)
    …15. 4. GPU, CPU, or AI hardware accelerator architectures 16. 5. Compiler optimizations such as loop optimizations, vectorization, parallelization, AND 17. 6. System ... performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development **Public Compensation:** $178,360/year to $200,200/year + bonus + equity + benefits **Industry:**… more
    Meta (10/20/25)
    - Related Jobs
  • Data Scientist - Model Optimization

    quadric.io, Inc (Burlingame, CA)
    …layer‑ and token‑level error analysis to guide numerical‐format choices. + Partner with compiler team to convert your findings into turnkey SDK flows and reference ... configs. + Publish internal whitepapers, external benchmarks, and present results to customers and at industry events. + Monitor academic literature in compression and efficient inference; translate promising ideas into reproducible prototypes. Requirements +… more
    quadric.io, Inc (10/19/25)
    - Related Jobs
  • Software Engineer Intern - AI Accelerators…

    IBM (San Jose, CA)
    …examples of job responsibilities include (a) Develop software optimization in the compiler for AI accelerators, (b) Investigate and prototype AI model optimizations ... for their execution in accelerators, (c) Contribute to the development of cutting-edge demonstrations with emerging AI accelerators **Required technical and professional expertise** * Have strong background on AI accelerators and/or AI accelerator-associated… more
    IBM (10/19/25)
    - Related Jobs
  • Director / Sr Program Manager, AI Accelerator

    quadric.io, Inc (Burlingame, CA)
    …embedded processing domain. Highly Desired Skills and Experience (Pluses) + AI Compiler Knowledge: Good understanding and working knowledge of AI SW compilers (eg, ... TVM, MLIR, LLVM, IREE). + AI Frameworks and Kernels: Knowledge of AI kernel development and familiarity with popular deep learning frameworks like JAX, PyTorch, ONNX, and TensorFlow (TF). + Professional Certification: Current Program Management (PM)… more
    quadric.io, Inc (10/18/25)
    - Related Jobs
  • Software Engineer, PhD, Early Career, AI/Machine…

    Google (San Bruno, CA)
    …working across the full stack, from low-level hardware acceleration and compiler optimizations to high-level model architecture and production APIs, transforming ... your research expertise into robust, scalable products. + Optimize complex system performance by analyzing and fixing performance bottlenecks, memory inefficiencies, and errors in production systems to meet stringent customer goals. + Elevate engineering… more
    Google (10/01/25)
    - Related Jobs
  • Lead C++ Software Engineer

    Cadence Design Systems, Inc. (San Jose, CA)
    …algorithms and optimizations for QoR (Quality of Results) and performance for the Protium Compiler working with a small team of super star engineers to develop our ... next generation FPGA based verification platform. Responsibilities: + Implement new algorithm and enhancements in C/C++ based code to implement the software stack for the FPGA based platform with special focus on synthesis / technology mapping. + Develop the… more
    Cadence Design Systems, Inc. (09/30/25)
    - Related Jobs
  • Senior Deep Learning Architect, LLM Inference

    NVIDIA (Santa Clara, CA)
    …knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations. + Proficiency in Python and C++ programming languages and ... familiarity with CUDA. + Experience with LLMs and their performance challenges and opportunities. + Solid understanding of CPU and GPU microarchitecture and performance characteristics. + Experience with complex software projects like frameworks, compilers, or… more
    NVIDIA (09/24/25)
    - Related Jobs
  • Senior System Software Engineer - AI Performance…

    NVIDIA (Santa Clara, CA)
    …analysis for training/inference workload + Knowledge of Linux device drivers and/or compiler implementation + Knowledge of GPU and/or CPU architecture and general ... computer architecture principles #LI-Hybrid Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for… more
    NVIDIA (09/19/25)
    - Related Jobs
  • Senior Software Development Engineer, AI/ML, AWS…

    Amazon (Cupertino, CA)
    …many more. The Inference Model Enablement team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference ... solutions with Trainium and Inferentia. Experience optimizing inference performance for both latency and throughput on these large models using Python, Pytorch or JAX is a must. Experience with Deepspeed and other distributed inference libraries is a bonus, as… more
    Amazon (09/07/25)
    - Related Jobs
  • Software Development Manager, LLM Inference Model…

    Amazon (Cupertino, CA)
    …of a vertically integrated system stack consisting of the PyTorch inference library, Neuron compiler , runtime, and collectives. A day in the life You will work with ... your senior management and technical leaders to define the model enablement and performance optimization for the latest SOTA LLMs, build and deliver them to customers. Meanwhile, lead the team to continue improving the model onboarding experience, as well as… more
    Amazon (09/06/25)
    - Related Jobs