- Walmart (Sunnyvale, CA)
- …for AI/ML and agentic systems. + Collaborate with data scientists, ML engineers, SRE , and product teams to operationalize AI/ML models and integrate them into ... production. + Mentor engineers, foster a culture of continuous learning, and contribute to internal platform standards and engineering playbooks. + Drive experimentation (A/B testing, multi-armed bandits, causal inference) and champion innovation. **Product… more
- Microsoft Corporation (Redmond, WA)
- …Qualifications:** + Experience in cloud operations, technical communications, incident response, or SRE roles in platforms like Azure, AWS, or GCP. + Experience in ... a 24x7x365 enterprise environment. + Understanding of incident management frameworks (eg, ITIL) and customer communication strategies during high-impact events. + Experience with service health platforms and tooling for communicating incident status at scale… more
- Nutanix (San Jose, CA)
- …+ Proven ability to work across cross-functional engineering, product, and SRE teams. + Excellent system design documentation and architecture diagramming skills. ... + Strong problem-solving mindset and ability to think at platform scale. Qualifications and Experience: + Bachelor's, Master's, or PhD in Computer Science or a related technical field. + 15+ years of relevant software development experience, with a proven… more
- Microsoft Corporation (Redmond, WA)
- …and brings them to the attention of their Site Reliability Engineering ( SRE ) and/or product engineering teams. + Utilizes insights from performance and resource ... monitoring tools to identify whether there is a need to optimize the efficiency of component and feature code, or if changes to compute resources are required. Models the predicted effect of changes to code and/or compute resources across components or… more
- Honeywell (Atlanta, GA)
- …cloud security practices. + Support and work alongside the CTO and SRE to enhance best-in-class cloud posture in a multi-cloud environment. + Partner ... with Honeywell Global Security to understand and influence cloud security baselines, providing practical solutions that incorporate engineering considerations without introducing risk. + Drive the establishment of cloud security baselines through policy… more
- Citigroup (Irving, TX)
- …+ Ensure best engineering standards are followed by team including DevOps and SRE . + Be the second level reviewer for the application design and implementation. ... + Provide inputs, review the test plans and test cases for adequate coverage to ensure the product quality. + Be accountable for the releases to go smoothly. + Be single point of contact for production incidents at L3 level, troubleshoot, perform root cause… more
- Amazon (Herndon, VA)
- …systems - 5+ years of Systems Engineering, DevOps, Site Reliability Engineering ( SRE ) or Enterprise Production experience in Windows / Linux or similar environments ... - 3+ years' experience operating in a 24/7 production environment. - 3+ years experience with a scripting language: Perl, Python, Ruby, PowerShell or similar languages Preferred Qualifications - Bachelor's Degree in Computer Information Systems, Computer… more
- Insight Global (San Jose, CA)
- …new products or solutions. Understand the concepts of Site Reliability Engineering ( SRE ) to maximize automation, reduce waste, increase scale and apply systemic ... thinking Working with Azure cloud native tooling Passionate about driving new technology solutions (change) Good communication and presentation skills Team player Able to express ideas effectively in individual and group situations (including non-verbal… more
- MetLife (Cary, NC)
- …automation programs within large, regulated, global enterprises. * Exposure to SRE , MLOps, or AI-driven operational analytics. * Certifications in relevant ... infrastructure domains (eg, AWS/Azure Architect, ITIL). * Strategic thinker with the ability to translate complex technical initiatives into measurable business outcomes. Success Indicators * Early detection of issues before impact. * Reduced MTTR and volume… more
- CareFirst (Reston, VA)
- …Datacenter migration and Enterprise Cloud transformation efforts. + Experience with SRE principles and transformation. + 3+ years of experience with implementation ... of Containerization (Kubernetes), Cloud technologies (AWS, Azure, or Google, etc.), DevOps tool chain (Ansible, Jenkins, Artifactory, bitbucket, etc.), and technical patterns (IaC, Automated Provisioning/Release, CI/CD, etc.). + Solid understanding of Software… more
Recent Jobs
-
Sr. SDET
- ForeScout Technologies, Inc. (Dallas, TX)
-
The BEAT Rowdy Runners - Project Based Internship
- BEAT LLC (DE)