-
Entry Level AI Performance Analyst
- IBM (Rochester, MN)
-
Introduction
Your Impact
As part of the team that designs machines that run many of the most demanding workloads used in Fortune-500 companies, you’ll play a key role in ensuring that our clients continue to be delighted by the performance of our systems both within their datacenters and in IBM’s public cloud. You’ll focus on workloads that leverage artificial intelligence to solve real business problems, such as in the Retrieval Augmented Generation paradigm. You’ll help define and run workloads using cutting-edge technology, and help identify and resolve performance challenges at all levels of the stack, right from the hypervisor through the operating system right to the application layer. In particular, you’ll be analyzing how the latest large language models, inferencing frameworks and model servers perform on our CPUs as well as on dedicated AI accelerators.
Your role and responsibilities
We’re seeking an enthusiastic computer engineer or software developer to help us ensure that customers are delighted by the performance of AI-infused workloads running on our systems.
Description
In this role, you’ll be running workloads that simulate how clients apply artificial intelligence technologies to solve their problems. You’ll develop experimental plans to evaluate the performance of components like vector databases, vector encoders and of course various large language models, and how they run in diverse model serving frameworks and execution environments. You’ll study performance on CPUs as well as on off-chip accelerators. You’ll help us define targets for performance, and, when they’re not met, you’ll dive deeply in to the stack and partner with developers and engineers from the hypervisor right through to model designers to identify and test solutions.
Required technical and professional expertise
* Minimum BS or MS degree in Computer Engineering, electrical engineering, computer science or a related technical discipline or equivalent experience.
* Demonstrated understanding of modern artificial intelligence technologies such as large language models and vector encoders
* Demonstrated understanding of micro-architecture design, memory layout, multi-threading, I/O buses
* Experience with deploying, tuning and profiling applications running in Kubernetes environments
* Experience deploying applications on at least one public cloud
* Extensive experience with Python
* Knowledge of database design and some exposure to SQL is desirable
* Ability to work in a team and network with people outside of the team and effectively communicate in written and verbal presentations is essential.
Preferred technical and professional experience
* Agile/ Scrum methodology experience
* Experience with Ansible or other automation framework
* Experience with C/C++
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
-
Recent Searches
- Entry Research Assistant (Ohio)
- Automation Architect Lead FAA (United States)
- LTS Instructional Teaching Assistant (United States)
- Fire Installation Tech (Washington)
Recent Jobs
-
Entry Level AI Performance Analyst
- IBM (Rochester, MN)
-
Senior Director, Research Finance
- Tufts Medicine (Boston, MA)
-
Assembler - 2nd Shift (Dept 02)
- Eaton Corporation (South Milwaukee, WI)
-
Senior Principal Reliability Engineer
- RTX Corporation (Tucson, AZ)