"Alerted.org

Job Title, Industry, Employer
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Advanced Search

Advanced Search

Cancel
Remove
+ Add search criteria
City & State or Zip Code
20 mi
  • 0 mi
  • 5 mi
  • 10 mi
  • 20 mi
  • 50 mi
  • 100 mi
Related to

  • Site Reliability Engineer

    IBM (Jersey City, NJ)



    Apply Now

    Introduction

    A career in IBM Software means you'll be part of a team that transforms our customer's challenges into industry-leading solutions. We are an infinitely curious team, always seeking new possibilities, and dedicated to creating the world's leading AI-powered, cloud-native software solutions. Our renowned legacy creates endless global opportunities for our network of IBMers. We are a team of deep product experts, ensuring exceptional client experiences, with a focus on delivery, excellence, and obsession over customer outcomes. This position involves contributing to HashiCorp's offerings, now part of IBM, which empower organizations to automate and secure multi-cloud and hybrid environments. You will join a team managing the lifecycle of infrastructure and security, enhancing IBM's cloud solutions to ensure enterprises achieve efficiency, security, and scalability in their cloud journey.

     

    Your role and responsibilities

     

    Our Team

     

    The Infrastructure Services team builds and maintains the backbone of HashiCorp’s cloud products. We focus on creating reliable, scalable, and secure infrastructure services that enable engineering teams to transition quickly without breaking things. Instead of just keeping the lights on, we’re constantly improving automation, reducing toil, and making infrastructure more self-service and developer-friendly.

     

    We work with Nomad, Consul, Vault, Terraform, and AWS services to power HashiCorp’s cloud offerings. Our mission is to provide infrastructure that’s easy to use, resilient, and secure by default so product teams can focus on delivering great experiences to customers.

     

    About this Role

     

    As a Site Reliability Engineer II on the Infrastructure Services team, you will help build, maintain, and improve the infrastructure that supports all HashiCorp cloud products. You will work alongside skilled engineers to ensure our systems are reliable, scalable, and secure while gaining hands-on experience in operating and automating cloud infrastructure. This role is ideal for an engineer looking to deepen their expertise in site reliability engineering, learn from senior engineers, and take on increasing responsibility over time.

    In this role, you can expect to:

    Contribute to the development and maintenance of core infrastructure services, ensuring reliability, scalability, and security

     

    Implement automation to improve operational efficiency and reduce manual toil

     

    Assist in monitoring, alerting, and logging improvements to enhance system observability

     

    Debug and address medium-complexity infrastructure issues with guidance from senior engineers

     

    Participate in on-call rotations after an initial onboarding period, learning incident response best practices

     

    Work within established team practices, exercising self-directed judgment on tasks while seeking guidance when necessary

     

    Propose and implement improvements to existing infrastructure components and deployment processes

     

    Write and maintain documentation for infrastructure configurations, procedures, and troubleshooting guides

     

    Collaborate with other teams to understand infrastructure needs and contribute to solutions

     

    Shadow interviews for entry-level candidates and participate in discussions on hiring evaluations

     

    This job can be performed from anywhere in the US

    Required technical and professional expertise

    * Have experience in site reliability engineering, cloud infrastructure management, or systems administration

    * Familiar with cloud platforms such as AWS and infrastructure as code tools like Terraform

    * Have some experience with observability tools such as Datadog, Prometheus, or Grafana

    * Enjoy problem-solving and working through operational challenges

    * Are comfortable writing scripts or simple automation in languages such as Python, Go, or Bash

    Preferred technical and professional experience

    * Communicate skillfully and collaborate well in a team environment

    * Are interested in growing into a senior SRE role and learning from skilled engineers

    * Have a growth mindset and seek continuous improvement in processes and technical skills

    * Knowledge and familiarity for HashiCorp and IBM products

     

    IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

     


    Apply Now



Recent Searches

[X] Clear History

Recent Jobs

  • Site Reliability Engineer
    IBM (Jersey City, NJ)
  • Ping engineer/Developer
    NTT DATA North America (Dallas, TX)
  • Senior Pre-Sales Solutions Architect
    MongoDB (Seattle, WA)
[X] Clear History

Account Login

Cancel
 
Forgot your password?

Not a member? Sign up

Sign Up

Cancel
 

Already have an account? Log in
Forgot your password?

Forgot your password?

Cancel
 
Enter the email associated with your account.

Already have an account? Sign in
Not a member? Sign up

© 2025 Alerted.org