• Observability Engineer - New Relic

    TEKsystems (Westlake Village, CA)
    …+ 7+ years of experience in a Cloud Engineering role (Observability, DevOps, SRE , etc). + Proven New Relic Expertise: 3+ years of hands-on experience with ... the New Relic platform, including deep knowledge of Dashboards, NRQL, APM and setting up effective alerting. + Strong IaC Proficiency: 3+ years of experience managing infrastructure and configurations with IaC tools like Terraform/OpenTofu (preferred), AWS… more
    TEKsystems (01/09/26)
    - Related Jobs
  • Senior Director of Site Reliability…

    JPMorgan Chase (Seattle, WA)
    …AWS, Azure, Google Cloud) and their services. + Experience in implementing SRE principles and practices to improve system reliability and availability. + Proficiency ... in SQL, NoSQL databases, and data warehousing solutions + Experience hiring, developing, and recognizing talent + Demonstrated prior experience influencing across highly matrixed, complex organizations and delivering value at scale + Experience leading complex… more
    JPMorgan Chase (01/07/26)
    - Related Jobs
  • Site Reliability Engineer Lead Analyst Vice…

    Citigroup (Tampa, FL)
    …issues and develop innovative solutions + Serve as advisor or coach to junior SRE engineers, allocating work as necessary + **Automation** - SREs' role is to develop ... and maintain automated tools and systems to manage and monitor the infrastructure. Reduce manual intervention, human errors and the time it takes to perform routine tasks. + **Capacity Planning and Scalability** - periodically assess the capacity of needs of… more
    Citigroup (01/07/26)
    - Related Jobs
  • Senior Site Reliability Engineer

    Centene Corporation (Austin, TX)
    …maintaining optimum platform infrastructure performance, reliability, and security using ** SRE practices** , observability tools, manual and automated procedures, ... documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability… more
    Centene Corporation (01/06/26)
    - Related Jobs
  • Manager Software Engineer

    General Motors (Austin, TX)
    …leadership in production support, root cause analysis, and incident resolution ( SRE principles). **Required Qualifications** + Bachelor's degree in Computer Science ... or related field (or equivalent experience). + 5+ years of managerial experience with eCommerce solutions. + 3+ full lifecycle implementations of eCommerce platforms. + Strong software engineering background with hands-on coding experience. + Experience with… more
    General Motors (01/06/26)
    - Related Jobs
  • Big Data Support Engineer - Assistant Vice…

    Citigroup (Irving, TX)
    …conflicting situations using multiple sources of information. + Ensure compliance with SRE best practices. + Timely and effectively escalate complex or unresolved ... issues to Level 2 SREs, development teams, or other specialized support groups, providing comprehensive handover information. + Applies in-depth disciplinary knowledge, contributing to the development of new techniques and the improvement of processes and… more
    Citigroup (12/30/25)
    - Related Jobs
  • Lead Site Reliability Engineer , AI/ML…

    JPMorgan Chase (Jersey City, NJ)
    …qualification with 5+ years professional experience. + Expertise in SRE principles, reliability, scalability and performance of application and infrastructure. ... + Have hands-on experience with cloud platforms (AWS, GCP, Azure) and IaC tools (Terraform, Ansible). + Extensive experience implementing advanced observability using tools like Open Telemetry, Dynatrace, Grafana, and/or cloud-native services. + Experience in… more
    JPMorgan Chase (12/25/25)
    - Related Jobs
  • (USA) Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    …for AI/ML and agentic systems. + Collaborate with data scientists, ML engineers, SRE , and product teams to operationalize AI/ML models and integrate them into ... production. + Mentor engineers, foster a culture of continuous learning, and contribute to internal platform standards and engineering playbooks. + Drive experimentation (A/B testing, multi-armed bandits, causal inference) and champion innovation. **Product… more
    Walmart (12/24/25)
    - Related Jobs
  • Service Engineer

    Microsoft Corporation (Redmond, WA)
    …Qualifications:** + Experience in cloud operations, technical communications, incident response, or SRE roles in platforms like Azure, AWS, or GCP. + Experience in ... a 24x7x365 enterprise environment. + Understanding of incident management frameworks (eg, ITIL) and customer communication strategies during high-impact events. + Experience with service health platforms and tooling for communicating incident status at scale… more
    Microsoft Corporation (12/21/25)
    - Related Jobs
  • Senior Staff Engineer

    Nutanix (San Jose, CA)
    …+ Proven ability to work across cross-functional engineering, product, and SRE teams. + Excellent system design documentation and architecture diagramming skills. ... + Strong problem-solving mindset and ability to think at platform scale. Qualifications and Experience: + Bachelor's, Master's, or PhD in Computer Science or a related technical field. + 15+ years of relevant software development experience, with a proven… more
    Nutanix (12/18/25)
    - Related Jobs