- Caris Life Sciences (Irving, TX)
- …and user support. **Job Responsibilities** + Installing and configuring Linux operating systems on HPC clusters, including network settings, storage ... Caris is where your impact begins.** **Position Summary** An HPC (High Performance Computing) Engineer is responsible...of computing resources. + Implementing security measures to protect HPC systems and data from unauthorized access.… more
- The University Of Texas At Dallas (Dallas, TX)
- …Job Description: Reporting to the Director of HPC Operations. This is a systems engineer with a background in a High Performance Computing environment and ... HPC ) resources and related research services. The engineer will demonstrate a customer service mindset and interact...support efforts, products and technologies. + Current knowledge of HPC best practice and systems deployment and… more
- Meta (Austin, TX)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: … more
- Meta (Austin, TX)
- **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI/ML initiatives supporting large scale AI Training and ... to Meta Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead the bring-up, validation,… more
- Meta (Austin, TX)
- …Meta's custom AI hardware 2. Collect requirements and develop specifications for Rackscale AI/ HPC systems . 3. Develop and maintain code, to collect, analyze, and ... **Summary:** The Accelerator Reference Design Team is looking for a System Engineer to design, implement, and maintain hardware designs for custom AI hardware… more
- Meta (Austin, TX)
- …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more