- Microsoft Corporation (Mountain View, CA)
- …seeks new knowledge that will improve the availability, reliability, efficiency, observability , and performance of products while also driving consistency in ... monitoring and operations at scale. **Qualifications** **Required Qualifications:** + Enrolled in a full time bachelor's or master's program in Computer Science,… more
- Walmart (Sunnyvale, CA)
- …automation. + Deploy and monitor products on **cloud platforms with agent observability ** , telemetry, and auditability in mind. + Develop and implement ... best-in-class **data health monitoring , traceability, and context enrichment** processes to ensure data used by agents is reliable and governed. + Lead technical… more
- Rubrik (Sacramento, CA)
- …and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and enable proactive identification ... of issues. * Coordinate and manage incidents, upgrades and changes for InfoSec's applications and services * Drive post-incident analysis with partner teams and/or vendors to identify root cause and ensure preventative measures are implemented promptly *… more
- Coinbase (Sacramento, CA)
- …Coinbase. * *We build infrastructure to provide the most secure and highest uptime*: Observability and monitoring is a cornerstone of the team's philosophy in ... order to ensure top-tier uptime and performance. * *We constantly innovate to bring the best yield for our customers*: We are always on the lookout for creative ways to optimize our operations as we continue to scale. * *We empower best in class staking… more
- ServiceNow, Inc. (San Diego, CA)
- …years of experience leading and delivering data-intensive applications in application performance monitoring , observability , or AI monitoring . + Master's ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...data needs. + Strong domain knowledge of application performance monitoring and observability tools (eg, New Relic,… more
- PennyMac (Westlake Village, CA)
- …maintaining service level agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation of ... comprehensive monitoring and observability practices using New Relic...capacity. + Advanced AWS certifications (Solutions Architect Professional, DevOps Engineer Professional, or similar). + Advanced knowledge and experience… more
- EPAM Systems (San Jose, CA)
- …future-ready service landscapes that leverage GenAI, AIOps, and advanced observability across cloud and on-premise environments while scaling our capabilities ... incident management; drive system performance improvements; and establish advanced monitoring and alerting capabilities + Architect comprehensive automation strategies… more
- NVIDIA (Santa Clara, CA)
- …a secure operational environment. + Lead initiatives to improve network observability by integrating advanced monitoring and alerting systems, collaborating ... GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader who… more
- McAfee, Inc. (San Jose, CA)
- …Terraform, CloudFormation, Ansible, Python, Bash, or PowerShell + Implement automated monitoring , logging, and alerting solutions to maintain system health and ... of cloud operations. + Develop and implement strategies for cost forecasting, monitoring , and optimization of AWS resources. + Continuously analyze and optimize AWS… more
- Deloitte (Costa Mesa, CA)
- …models, you'll be empowered to think like an entrepreneur and deliver like an engineer . + Build agentic workflows powered by AI that act autonomously with human ... tools using large language models (LLMs). + Serve as a Forward Deployed Engineer : embed with client teams to understand real-world challenges, co-create tailored AI… more