- NTT America, Inc. (Plano, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. **Automation &… more
- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . + Lead root cause analysis and document post-incident reviews ... for major Linux-related issues. + Execute patch management, OS and kernel upgrades, and regular system maintenance. + Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. + Participate in on-call rotation and… more
- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and document post-incident reviews for ... major Linux-related issues. Execute patch management, OS and kernel upgrades, and regular system maintenance. Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. Participate in on-call rotation and after-hours… more
- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and document post-incident reviews for ... major Linux-related issues. Execute patch management, OS and kernel upgrades, and regular system maintenance. Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. Participate in on-call rotation and after-hours… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. Automation &… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . + Lead root cause analysis and post-mortem documentation for ... major incidents. + Execute patch management, upgrades, and regular maintenance activities. + Develop and maintain backup, disaster recovery, and failover strategies and operations. + Participate in on-call rotation and after-hours support as required.… more
- General Motors (Austin, TX)
- …and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused ... on enhancing the reliability , efficiency, and scalability of our distributed systems. We..., and reduce manual intervention. + Observability and Monitoring: Lead , Implement and improve monitoring and observability frameworks, enabling… more
- WestRock Company (Evadale, TX)
- Position: Senior Electrical Reliability Engineer Job Code: MEREP3 + Sr. Eng, Reliability Eng Location: Evadale, TX The Opportunity: As the Sr Engineer, Elec Rel, ... Mill, your primary responsibility is investigating and resolving electrical reliability issues within the facility. Maintain close working interaction with the… more
- Trane Technologies (Tyler, TX)
- …On Fridays, choose your work location, balancing what your work requires! As a ** Reliability Engineer** , you will play a key role in ensuring the reliable ... be to develop and implement strategies that enhance the reliability and performance of our products and critical components....failures (MTBF) and mean time to failure (MTTF). + Lead continuous improvement projects aimed at reducing failures and… more
- IKO (Ennis, TX)
- …by hiring people who hold these values. People like you! Job Description Role: Reliability Engineer *This is a Safety Sensitive position. * Job Summary: The ... Reliability Engineer is responsible for improving equipment performance, reducing...Prodac/Maximo interface, production loss and Maintenance cost data to lead Problem Solving / troubleshooting teams prioritized to generate… more