"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Distinguished Software Engineer, Reliability Infra

    LinkedIn (Mountain View, CA)



    Apply Now

    LinkedIn is the worlds largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. Were also committed to providing transformational opportunities for our own employees by investing in their growth. We aspire to create a culture thats built on trust, care, inclusion, and fun where everyone can succeed.

     

    At LinkedIn, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a LinkedIn office on select days, as determined by the business needs of the team.

     

    This role will be based in Sunnyvale, CA or San Francisco, CA.

    Responsibilities

    + Serve as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure

    + Re-architect LinkedIn's backend systems to enable granular failure domains and reduce the blast radius of incidents

    + Design and implement next-generation failure mitigation strategies that avoid full-region or full-datacenter failovers

    + Partner closely with across many different types of engineers to raise the bar for operational excellence and incident response

    + Define and build frameworks to improve monitoring, alerting, and observability across hundreds of services and systems

    + Define and own the roadmap of bringing observability to critical user journeys for LinkedIn's products to help capture and improve the experience of LinkedIn's members/customers

    + Spearhead a multi-year initiative to transition LinkedIn's infrastructure to a regionalized model with localized failover, enhancing both scalability and availability

    + Lead technical discussions on the future of Engineering at LinkedIn, what the function should evolve into over the next 3- 5 years

    + Deliver key insights, executive level reporting across the cross-functional engineering teams to enable the right business decisions around improving quality and reliability of our services and products

    + Act as a force multiplier by mentoring engineers, influencing technical direction across orgs, and contributing deeply to culture, hiring, and technical excellence

    + Lead incident response and post-incident reviews to identify root causes and implement preventive measures.

    + Develop and maintain incident management processes and procedures to ensure timely resolution of issues and minimize impact on customers

    Basic Qualifications

    + 15+ years of software engineering experience

    + 8+ years focused on infrastructure, reliability focused engineering, or distributed systems

    Preferred Qualifications

    + Hands-on experience with large-scale incident response, root cause analysis, and resiliency engineering

    + Strong communication and cross-functional collaboration skills, with experience influencing across multiple orgs and leadership levels

    + Proven success designing and leading architectural transformations at internet-scale companies

    + Deep knowledge of systems reliability, observability frameworks, and fault-tolerant architecture design

    + Experience with multi-region architecture, capacity planning, and failover strategies in large-scale cloud or hybrid environments

    + Background in CI/CD, platform reliability, and automation of ops-heavy systems.

    + Familiarity with modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana) and service mesh architecture

    + Track record of setting long-term technical strategy and driving systemic improvements in availability and performance

    + Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale technology company

    Suggested Skills

    + Site Reliability Engineering (SRE)

    + Leadership

    + Large scale infrastructure

     

    LinkedIn is committed to fair and equitable compensation practices. The pay range for this role is $238,000 to $390,000. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to skill set, depth of experience, certifications, and specific work location. This may be different in other locations due to differences in the cost of labor. The total compensation package for this position may also include annual performance bonus, stock, benefits and/or other applicable incentive compensation plans. For more information, visit https://careers.linkedin.com/benefits

     

    Equal Opportunity Statement

     

    We seek candidates with a wide range of perspectives and backgrounds and we are proud to be an equal opportunity employer. LinkedIn considers qualified applicants without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.

     

    LinkedIn is committed to offering an inclusive and accessible experience for all job seekers, including individuals with disabilities. Our goal is to foster an inclusive and accessible workplace where everyone has the opportunity to be successful.

     

    If you need a reasonable accommodation to search for a job opening, apply for a position, or participate in the interview process, connect with us at [email protected] and describe the specific accommodation requested for a disability-related limitation.

     

    Reasonable accommodations are modifications or adjustments to the application or hiring process that would enable you to fully participate in that process. Examples of reasonable accommodations include but are not limited to:

     

    + Documents in alternate formats or read aloud to you

    + Having interviews in an accessible location

    + Being accompanied by a service dog

    + Having a sign language interpreter present for the interview

    A request for an accommodation will be responded to within three business days. However, non-disability related requests, such as following up on an application, will not receive a response.

     

    LinkedIn will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by LinkedIn, or (c) consistent with LinkedIn's legal duty to furnish information.

     

    San Francisco Fair Chance Ordinance ​

     

    Pursuant to the San Francisco Fair Chance Ordinance, LinkedIn will consider for employment qualified applicants with arrest and conviction records.

     

    Pay Transparency Policy Statement ​

     

    As a federal contractor, LinkedIn follows the Pay Transparency and non-discrimination provisions described at this link: https://lnkd.in/paytransparency.

     

    Global Data Privacy Notice for Job Candidates ​

     

    Please follow this link to access the document that provides transparency around the way in which LinkedIn handles personal data of employees and job applicants: https://legal.linkedin.com/candidate-portal.

     


    Apply Now



Recent Searches

[X] Clear History

Recent Jobs

  • Distinguished Software Engineer, Reliability Infra
    LinkedIn (Mountain View, CA)
  • Intern - Yield Technology Equipment
    Micron Technology, Inc. (Boise, ID)
  • Emergency Management Planning Cadre
    AC Disaster Consulting (Portland, OR)
  • Engineer Intern
    Comfort Systems (Lexington, KY)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org