• Exempt to Permanent - Program Specialist…

    City and County of San Francisco (San Francisco, CA)
    …contributes to short and long-term projects and workgroups. + Provides technical support for assigned program management systems to ensure smooth functionality ... (WDD), Family and Children's Services (FCS), Investigations, SF Benefits Net, Program Support Operations, Alignment & Guidance as well as the Department of… more
    City and County of San Francisco (08/26/25)
    - Related Jobs
  • Sr Principal Site Reliability Engineer (Sase)

    Palo Alto Networks (Santa Clara, CA)
    …SLOs, and SLAs and experience implementing them. + Hands-on experience with incident management protocols and participating in on-call rotations. + Familiarity ... Incident Response: Act as a key leader during production incidents, driving resolution, and conducting blameless postmortems to...to build tools, frameworks, and cloud platforms that will support our company's growth over the next decade. If… more
    Palo Alto Networks (08/16/25)
    - Related Jobs
  • Food Service Team Leader (Hourly Starbucks…

    Target (San Jose, CA)
    …Beverage business fundamentals: department sales trends, freshness and quality, inventory management , guest shopping patterns and pricing and promotions strategies + ... Planning department(s) daily/weekly workload to support business priorities and deliver sales + Leading a...direct leader. + Assess Food Service back of house, production areas, dining spaces and merchandising spaces to ensure… more
    Target (09/07/25)
    - Related Jobs
  • Senior AI Infrastructure Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    incident response and blameless postmortems + Be part of an on call rotation to support production systems What We Need To See: + BS degree in Computer ... group. This engineering role will design, build and maintain large scale production systems with high efficiency and availability using the combination of software… more
    NVIDIA (08/08/25)
    - Related Jobs
  • Senior Network Operations Engineer - Layer 4-7…

    ServiceNow, Inc. (Santa Clara, CA)
    …similar languages for automation and tooling. + Experience with change management processes in high-availability production environments. + Excellent ... + Perform software upgrades, version control, and security patching across production systems. + Proactively analyze network metrics such as capacity, latency,… more
    ServiceNow, Inc. (08/21/25)
    - Related Jobs
  • Software Engineer III ATI

    Robert Half-Robert Half Corporate (San Ramon, CA)
    …+ Leads the analysis and resolution of moderate to complex issues in production platforms, defining incident response approaches and resolution playbooks. + ... Provides Level III support critical production issues, collaborating across development,...tools such as Autosys. + Solid understanding of project management principles and methodologies. + Strong analytical and problem-solving… more
    Robert Half-Robert Half Corporate (08/13/25)
    - Related Jobs
  • Environmental Health & Safety Specialist

    Ensign-Bickford Aerospace and Defense (Moorpark, CA)
    …floor on a regular basis to establish EH&S presence and provide collaborative support with factory stakeholders to ensure production & operational changes ... reviews, general risk assessments and other safety assessments to support Environmental, Health & Safety management . +...programs. Lead weekly new hire orientation safety training. + Support with the analysis of incident trends… more
    Ensign-Bickford Aerospace and Defense (08/28/25)
    - Related Jobs
  • Senior System Engineer - DGX Cloud Lepton

    NVIDIA (Santa Clara, CA)
    …and release practices that ensure traceability and integrity of what runs in production . + Monitoring & incident practice: establish health signals and SLOs; ... the autonomy to drive meaningful projects with strong mentorship and support . We practice blameless postmortems, iterate continuously, and encourage thoughtful… more
    NVIDIA (08/16/25)
    - Related Jobs
  • Senior ML Storage Engineer - GPU Clusters

    NVIDIA (Santa Clara, CA)
    …and quality of service (QoS) through operational excellence, proactive monitoring, and incident resolution. + Support a globally distributed on premise and ... performance and cost-effectiveness. + Continuously improve storage infrastructure provisioning, management , observability and day to day operation through automation.… more
    NVIDIA (07/31/25)
    - Related Jobs
  • Site Reliability Engineering Manager

    Two95 International Inc. (Sacramento, CA)
    …contracted service-level agreements. Manage and coordinate operational components of incident management , including detection, response and reporting. Maintain ... performance. The ISM's job is composed of a broad range of activities in support of IT program initiatives, including: + Strategic support + Reliability liaison… more
    Two95 International Inc. (06/09/25)
    - Related Jobs