- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and document post-incident reviews for ... major Linux-related issues. Execute patch management, OS and kernel upgrades, and regular system maintenance. Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. Participate in on-call rotation and after-hours… more
- NTT America, Inc. (Plano, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. Automation &… more
- NTT America, Inc. (Plano, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. **Automation &… more
- NTT America, Inc. (Plano, TX)
- …system monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and document post-incident reviews for ... major Linux-related issues. Execute patch management, OS and kernel upgrades, and regular system maintenance. Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure. Participate in on-call rotation and after-hours… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . Lead root cause analysis and post-mortem documentation for ... major incidents. Execute patch management, upgrades, and regular maintenance activities. Develop and maintain backup, disaster recovery, and failover strategies and operations. Participate in on-call rotation and after-hours support as required. Automation &… more
- NTT DATA North America (Austin, TX)
- …maintain monitoring, alerting, and logging solutions to ensure high availability and reliability . + Lead root cause analysis and post-mortem documentation for ... major incidents. + Execute patch management, upgrades, and regular maintenance activities. + Develop and maintain backup, disaster recovery, and failover strategies and operations. + Participate in on-call rotation and after-hours support as required.… more
- General Motors (Austin, TX)
- …and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused ... on enhancing the reliability , efficiency, and scalability of our distributed systems. We..., and reduce manual intervention. + Observability and Monitoring: Lead , Implement and improve monitoring and observability frameworks, enabling… more
- General Motors (Austin, TX)
- …and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused ... on enhancing the reliability , efficiency, and scalability of our distributed systems. We...and reduce manual intervention. + **Observability and Monitoring** : Lead , Implement and improve monitoring and observability frameworks, enabling… more
- Halliburton (Odessa, TX)
- …analytical, problem-solving, communication, and documentation skills. + Ability to lead cross-functional efforts and drive reliability improvements across ... Reliability Engineer (Principal - Advisor) Artificial Lift Date:...- people who want to innovate, achieve, grow and lead . We attract and retain the best talent by… more
- WestRock Company (Evadale, TX)
- Position: Senior Electrical Reliability Engineer Job Code: MEREP3 + Sr. Eng, Reliability Eng Location: Evadale, TX The Opportunity: As the Sr Engineer, Elec Rel, ... Mill, your primary responsibility is investigating and resolving electrical reliability issues within the facility. Maintain close working interaction with the… more