- NTT America, Inc. (Plano, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. **Automation &… more
- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and document post-incident reviews for ... major Linux-related issues. Execute patch management, OS and kernel upgrades, and regular system maintenance. Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. Participate in on-call rotation and after-hours… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. Automation &… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . + Lead root cause analysis and post-mortem documentation for ... major incidents. + Execute patch management, upgrades, and regular maintenance activities. + Develop and maintain backup, disaster recovery, and failover strategies and operations. + Participate in on-call rotation and after-hours support as required.… more
- General Motors (Austin, TX)
- …and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused ... on enhancing the reliability , efficiency, and scalability of our distributed systems. We..., and reduce manual intervention. + Observability and Monitoring: Lead , Implement and improve monitoring and observability frameworks, enabling… more
- General Motors (Austin, TX)
- …and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused ... on enhancing the reliability , efficiency, and scalability of our distributed systems. We...and reduce manual intervention. + **Observability and Monitoring** : Lead , Implement and improve monitoring and observability frameworks, enabling… more
- Halliburton (Odessa, TX)
- …analytical, problem-solving, communication, and documentation skills. + Ability to lead cross-functional efforts and drive reliability improvements across ... Reliability Engineer (Principal - Advisor) Artificial Lift Date:...- people who want to innovate, achieve, grow and lead . We attract and retain the best talent by… more
- WestRock Company (Evadale, TX)
- Position: Senior Electrical Reliability Engineer Job Code: MEREP3 + Sr. Eng, Reliability Eng Location: Evadale, TX The Opportunity: As the Sr Engineer, Elec Rel, ... Mill, your primary responsibility is investigating and resolving electrical reliability issues within the facility. Maintain close working interaction with the… more
- IKO (Ennis, TX)
- …by hiring people who hold these values. People like you! Job Description Role: Reliability Engineer *This is a Safety Sensitive position. * Job Summary: The ... Reliability Engineer is responsible for improving equipment performance, reducing...Prodac/Maximo interface, production loss and Maintenance cost data to lead Problem Solving / troubleshooting teams prioritized to generate… more
- Trane Technologies (Tyler, TX)
- …On Fridays, choose your work location, balancing what your work requires! As a ** Reliability Engineer** , you will play a key role in ensuring the reliable ... be to develop and implement strategies that enhance the reliability and performance of our products and critical components....failures (MTBF) and mean time to failure (MTTF). + Lead continuous improvement projects aimed at reducing failures and… more