- 
        Senior Manager, Network Site Reliability - GeForce…
- NVIDIA (Santa Clara, CA)
- 
             GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader who is dedicated to optimizing network performance and ensuring a smooth user experience. The position focuses on managing Network SRE to streamline network operations, minimize manual tasks, and achieve service level objectives (SLOs). In this position, you will have the opportunity to tackle challenges through active troubleshooting and a commitment to network automation, observability, documentation, and operational excellence. What you'll be doing: + Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture of collaboration, accountability, and technical excellence, along with offering mentorship. + Manage the design, implementation, and maintenance of robust and scalable network infrastructure across data centers, cloud environments, and edge locations to ensure consistent connectivity and performance. + Apply proactive reliability engineering techniques to reduce network disruptions and decrease Mean Time to Recovery (MTTR), improving overall service reliability and user satisfaction. + Work closely with Security and Compliance teams to ensure that all network infrastructure meets regulatory standards and internal policies, maintaining a secure operational environment. + Lead initiatives to improve network observability by integrating advanced monitoring and alerting systems, collaborating with multi-functional teams to implement network solutions that support business objectives and enhance user experiences. What we need to see: + Bachelor’s or Master’s degree in Computer Science or a related field, or equivalent experience. + 12+ overall years of proven experience in host and infrastructure networking + 6+ years in leadership roles managing teams focused on high-performance Software Defined Networking (SDN) solutions. + Strong understanding of networking protocols, with hands-on experience in kernel development and key technologies like routing, switching, load balancers, firewalls, VPNs, and cloud platforms such as AWS, GCP, and Azure. + Skilled in Infrastructure as Code (IaC) using automation tools like Ansible and Terraform, along with monitoring tools such as Prometheus, Grafana, and NetBox to improve network performance. + Proven ability to design network architectures for cloud and distributed systems, with practical experience in large-scale configurations and familiarity with SR-IOV, Xen virtualization, and Open Virtual Switch or similar SDN technologies. Ways to stand out from the crowd: + Extensive experience in managing hybrid cloud environments and large-scale distributed systems, showcasing effective infrastructure management skills. + Strong understanding of Site Reliability Engineering (SRE) concepts, including SLAs, SLOs, and incident management best practices. + Proven ability to use operational signals like SNMP, Syslog, and Streaming Telemetry for efficient issue identification and resolution. + Comprehensive knowledge of Open Virtual Switch (OVS) and SR-IOV RDMA for effective network management and optimization. + Experience in debugging and improving code, automating repetitive tasks, and working with Mellanox/Cumulus Linux, Palo Alto firewalls, and Netscaler load balancers With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and versatile people in the world working with us, and our engineering teams are growing fast in some of the most impactful fields of our generation: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative engineer who enjoys autonomy and shares our passion for technology, we want to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 248,000 USD - 396,750 USD. You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . Applications for this job will be accepted at least until August 8, 2025. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. 
 
 
- 
        
Recent Searches
- Senior Program Manager Quality (Minnesota)
- Oracle Financials Developer (Pennsylvania)
- Senior CCaaS UCaaS Engineer (Michigan)
- manager provider engagement remote (United States)
Recent Jobs
- 
                
                    Senior Manager, Network Site Reliability - GeForce Now
                
                - NVIDIA (Santa Clara, CA)
- 
                
                    Procurement Data Product Manager
                
                - ThermoFisher Scientific (Waltham, MA)
- 
                
                    Facilities Maintenance Technician
                
                - C&W Services (Midland, TX)