- 
        Senior Engineer-AI Inference
- Bank of America (Addison, TX)
- 
             Senior Engineer-AI Inference Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware To proceed with your application, you must be at least 18 years of age. Acknowledge Refer a friend To proceed with your application, you must be at least 18 years of age. Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior-Engineer-AI-Inference\_25029879) Job Description: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day. Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve. Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us! Position Summary: Join a groundbreaking team at Bank of America, at the forefront of innovation in AI. We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what’s possible with AI. This position is focused on design, build, and serve the Gen AI inferencing capabilities. This job is responsible for defining and leading the engineering approach for complex features to deliver significant business outcomes. Key responsibilities of the job include delivering complex features and technology, enabling development efficiencies, providing technical thought leadership based on conducting multiple software implementations, and applying both depth and breadth in a number of technical competencies. Additionally, this job is accountable for end-to-end solution design and delivery. Responsibilities: + Ensures that the design and engineering approach for complex features are consistent with the larger portfolio solution + Define the technology tool stack for the solution and evaluate and adapt new testing tool/framework/practices for team(s) + Enables team(s)/applications with Continuous Integration/Continuous Development (CI/CD) capabilities and engages with other technical stakeholders pertaining to efficient functioning of CI-CD pipeline + Guides and influences team(s) on design and best practices for high code performance –e.g. pairing, code reviews + Provides end-to-end delivery of complex features, including automation, for either a single team or multiple teams, at the program level + Conducts research, design prototyping and other exploration activities such as evaluating new toolsets and components for release management, CI/CD, and features + Works with stakeholders to establish high-level solution needs and with architects for technical requirements + Collaborate with product teams, data analysts and data scientists to design and build solutions. + Design and execute the implementation plans to both move forward strategically, while at the same time ensuring the current technology stack is supporting current needs. + Manage multiple priorities, and simultaneously engage with multiple teams worldwide. + Be vocal and actively participate in all session with business stakeholders and agile teams. + Manage next generation of architectural decision for advanced analytics platform, create strategy, roadmaps, present to tech and non-tech leaders. + Coach and mentor team members. Required qualifications: + Minimum 8 years of relevant experience required. + Experience in Model Ops and design, software development with proven effectiveness in delivering technology in fast-paced, demanding, industry driven environment for AI/ML, and advanced analytics. + Hands on experience in both Python development on Linux. Strong understanding of modern open-source data science platform architecture for storage & compute separation, interactive development workbenches, containers, and toolsets such as Jupyter, VSCode etc. + Experience of data sources and Vector Store platforms such as Redis, Solar, Postgres DB, FAISS, Teradata, Oracle, SQL Server, Hadoop etc. + Experienced in using design patterns and following best software engineering practices. + An understanding of fundamental algorithms and ability to optimize existing code. + Proficient written and verbal communication skills to support and shape the platform and clearly articulate technical designs and concepts; and to communicate effectively with all levels within the organization. + Experience with deploying models using vLLM/Triton Inference Server + Performance Tuning those models and deployment to provide higher throughput. + Experience with various inference metrics, and related monitoring and observability. + Experience with serving multiple tenants/clients with model endpoints with secure boundaries. + Experience with Atheization & Authorization, Policy as Code, Systems Integration, and Model Routing + Model Evaluation frameworks to evaluate different models and their tradeoffs between efficiency and metrics. + Experience building RAG for various knowledge bases, and document types. + Model Monitoring – Ability to collect metrics to measure things like Model Drift, KPIs. + Self-starter with the ability to challenge conventions, excellent communication skills. + Strong analytical skills which enable ability to problem solve, apply reason, take initiative, use judgment, and perform concurrent tasks. + Follows Test Driven Development practices including continual integration and clean code principles. Desired Qualifications: + Experience developing Gen AI training and Inferencing platform with open-source model, Gen AI Model servicing capabilities, designing RAG frameworks, MCP modules for enterprise data systems. Skills: + Automation + Influence + Result Orientation + Stakeholder Management + Technical Strategy Development + Application Development + Architecture + Business Acumen + Risk Management + Solution Design + Agile Practices + Analytical Thinking + Collaboration + Data Management + Solution Delivery Process Shift: 1st shift (United States of America) Hours Per Week: 40 Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates. To view the "Know your Rights" poster, CLICK HERE (https://www.eeoc.gov/sites/default/files/2023-06/22-088\_EEOC\_KnowYourRights6.12.pdf) . View the LA County Fair Chance Ordinance (https://dcba.lacounty.gov/wp-content/uploads/2024/08/FCOE-Official-Notice-Eng-Final-8.30.2024.pdf) . Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy (“Policy”) establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment. Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. Should you be offered a role with Bank of America, your hiring manager will provide you with information on the in-office expectations associated with your role. These expectations are subject to change at any time and at the sole discretion of the Company. To the extent you have a disability or sincerely held religious belief for which you believe you need a reasonable accommodation from this requirement, you must seek an accommodation through the Bank’s required accommodation request process before your first day of work. This communication provides information about certain Bank of America benefits. Receipt of this document does not automatically entitle you to benefits offered by Bank of America. Every effort has been made to ensure the accuracy of this communication. However, if there are discrepancies between this communication and the official plan documents, the plan documents will always govern. Bank of America retains the discretion to interpret the terms or language used in any of its communications according to the provisions contained in the plan documents. Bank of America also reserves the right to amend or terminate any benefit plan in its sole discretion at any time for any reason. 
 
 
- 
        
Recent Jobs
- 
                
                    Senior Engineer-AI Inference
                
                - Bank of America (Addison, TX)
- 
                
                    Network Engineering Manager
                
                - Meta (Menlo Park, CA)
- 
                
                    Entry-Level Civil Engineer (Water/Wastewater) - Networking Event with AECOM - Los Angeles, California
                
                - AECOM (Los Angeles, CA)