- Dynatrace (Mountain View, CA)
- …you will love being a Dynatracer** + Dynatrace is a leader in unified observability and security. + We provide a culture of excellence with competitive compensation ... packages designed to recognize and reward performance. + Our employees work with the largest cloud providers, including AWS, Microsoft, and Google Cloud, and other leading partners worldwide to create strategic alliances. + The Dynatrace platform uses… more
- Cisco (CA)
- …a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers ... love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our work with kindness. So bring your work experience, problem-solving… more
- Aeris Communications (San Jose, CA)
- …non-functional requirements such as high-availability, scalability, security and observability . Plan development activities, develop accurate schedule estimates and ... provide daily progress updates in a stand-up meeting. Deliver high quality software in a predictable and reliable manner. Collaborate actively with other developers and other cross-functional teams like QA, SRE, and Operations. Assist in support of the… more
- Amazon (Sunnyvale, CA)
- …and enforce engineering best practices (code quality, testing, CI/CD, logging, observability ). - Mentor and guide other engineers through code reviews, technical ... discussions, and pair programming. - Lead/Implement client and server-side performance optimizations for large-scale usage. - Stay up to date with evolving LLM capabilities, frontend frameworks, and backend technologies. - Propose and prototype new ideas to… more
- LinkedIn (Mountain View, CA)
- …traffic distribution, load balancing, and fault tolerance. Drive automation, observability , and fault tolerance initiatives, reducing downtime and improving MTTR ... (Mean Time to Recovery). Analyze network traffic telemetry to optimize load balancing, manage traffic spikes, and plan for future capacity needs. Establish and monitor key performance metrics, SLAs, and SLOs for DNS and traffic routing systems. Act as a… more
- DoorDash (San Francisco, CA)
- …powers all of DoorDash's business. + Improve the reliability, scalability, and observability of our training and inference infrastructure. We're excited about you ... because + BS, MS, or PhD. in Computer Science or equivalent + Exceptionally strong knowledge of CS fundamental concepts and OOP languages + 6+ years of industry experience in software engineering + Prior experience building machine learning systems in… more
- ServiceNow, Inc. (San Diego, CA)
- It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today - ... workflows. + Partner with DevOps and architecture teams to evolve observability standards and instrumentation practices. Customer & Stakeholder Engagement + Actively… more
- PennyMac (Westlake Village, CA)
- …agreements (SLAs) that meet or exceed business requirements. + Monitoring & Observability - Lead the development and implementation of comprehensive monitoring and ... observability practices using New Relic and other tools to...capacity. + Advanced AWS certifications (Solutions Architect Professional, DevOps Engineer Professional, or similar). + Advanced knowledge and experience… more
- Rubrik (Palo Alto, CA)
- …traffic routing, and policy enforcement across clouds. + Implement network observability (end-to-end traffic visibility, flow tracing, correlation ID propagation) to ... skills (Python, Bash, or similar) for automation. + Experience with observability and telemetry tools (OpenTelemetry, FluentBit, Prometheus, Grafana, Datadog). +… more
- NVIDIA (Santa Clara, CA)
- GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader who ... challenges through active troubleshooting and a commitment to network automation, observability , documentation, and operational excellence. What you'll be doing: +… more