• Senior Performance and Resilience Engineer…

    Red Hat (Sacramento, CA)
    …at internal/external forums **What you will bring:** + 3+ years in reliability , and/or performance engineering on large-scale distributed systems + Expertise in ... systems‑level software design + Expertise with Kubernetes and modern LLM inference server stack (eg, vLLM, TensorRT-LLM, TGI) + Observability & forensics skills with experience with Prometheus/Grafana, OpenTelemetry tracing, eBPF/BPFTrace/perf, Nsight Systems,… more
    Red Hat (08/28/25)
    - Related Jobs
  • Senior Analyst, Account Management

    CVS Health (Sacramento, CA)
    …of customer service through the execution of accuracy, responsiveness, reliability , and professionalism on all interactions. Examines sales, account management, ... and business retention metrics for products and services in support of profitable growth and other business objectives. Controls strategic business plans for accounts and customer relationships, focusing on revenue growth, member retention, and achievement of… more
    CVS Health (08/27/25)
    - Related Jobs
  • Senior Software Engineer

    Amazon (San Francisco, CA)
    …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a ... mentor, tech lead or leading an engineering team - 6+ years of professional software development or equivalent expertise - Strong background in Golang/Go - BA or BS in Computer Science or a related discipline, or equivalent years of experience - Built and… more
    Amazon (08/27/25)
    - Related Jobs
  • Senior Staff Software Engineer, Software…

    Google (Sunnyvale, CA)
    …who use Google services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running ... a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers. The US base… more
    Google (08/27/25)
    - Related Jobs
  • Senior Principal Mechanical Engineer…

    RTX Corporation (Fairfield, CA)
    …+ Support establishing plans and strategy for requirements management, reliability , logistics, coordination of different teams, evaluation measurements, and other ... disciplines for large or complex projects. + Ensures that all likely aspects of a project or system are considered and integrated into a whole. + Recommends investments or changes in technology, resources, procedures, equipment, systems, or other assets to… more
    RTX Corporation (08/26/25)
    - Related Jobs
  • Senior Software Engineer, TPU Performance,…

    Google (Sunnyvale, CA)
    …who use Google services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running ... a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers. The US base… more
    Google (08/26/25)
    - Related Jobs
  • Senior Software Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …frameworks to distributed runtime services to system-level components that ensure reliability , scalability, and performance. You should be someone who enjoys solving ... hard engineering problems, is passionate about platform engineering, and is committed to building high-quality, resilient software while driving operational excellence. By being part of this team, engineers can implement software engineering best practices and… more
    ServiceNow, Inc. (08/26/25)
    - Related Jobs
  • Senior , Software Engineer, ML Ops

    Walmart (Sunnyvale, CA)
    …pipelines, data storage, and data processing systems + Ensure the scalability and reliability of ML models and pipelines, troubleshooting issues as needed + Work ... with cross-functional teams to integrate ML models with large language models (LLMs) and other AI/ML technologies **What you'll bring:** + Bachelor's degree in Computer Science, Engineering, or related field + 3+ years of experience in ML Ops, data… more
    Walmart (08/24/25)
    - Related Jobs
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …failures and infrastructure issues and optimize CI/CD workflows for efficiency and reliability . + Comprehensive Testing & OS Vetting: Develop and enhance automated ... frameworks for System-on-Chip (SOC) validation, including daily sanity and regression testing. Integrate testing scripts into CI/CD pipelines to ensure continuous quality. Perform thorough sanity testing of various System Windows and Linux hardware and… more
    NVIDIA (08/22/25)
    - Related Jobs
  • Senior DGX Cloud Performance Engineer

    NVIDIA (Santa Clara, CA)
    …scale and make them more easily consumable by users (via improved scalability, reliability , cleaner abstractions, etc). What you will be doing: + Develop benchmarks, ... end to end customer applications running at scale, instrumented for performance measurements, tracking, sampling, to measure and optimize performance of meaningful applications and services; + Construct carefully designed experiments to analyze, study and… more
    NVIDIA (08/22/25)
    - Related Jobs