- 
        Sr. AWS Cloud & DevOps Architect - Remote
- McAfee, Inc. (San Jose, CA)
- 
             _Job Title:_ Sr. AWS Cloud & DevOps Architect - Remote _Role Overview:_ We seek a highly skilled and experienced Sr. AWS Cloud & DevOps Architect to join our dynamic team. You will design, implement, and manage scalable, secure, and cost-effective cloud solutions using Amazon Web Services (AWS) in this role. You will lead the Cloud & DevOps practices, ensuring seamless integration, delivery, and deployment processes across various environments. The role will also forge the strategy for AIOps through AI/ML and NoOps, delivering strategic innovation to improve availability, stability, security and resiliency. The Architect will be the primary contact for other Technology & Eng teams within the organization for all matters related to building and optimizing Cloud Infrastructure and Services. You must partner with other leaders across the business to identify opportunities and risks to develop & deliver solutions that support business strategies. This is a remote position in the United States. We will only consider candidates currently in the United States and are not offering relocation assistance at this time. About the Role: + Design and architect complex AWS cloud solutions that are secure, scalable, and cost-effective. + Lead the design and implementation of AWS infrastructure and automation solutions. + Collaborate with cross-functional teams to define infrastructure requirements and ensure alignment with business goals. + Forge the strategy for AIOps by integrating AI/ML and NoOps principles to drive intelligent automation across cloud operations. + Leverage machine learning models to predict incidents, automate root cause analysis, and proactively remediate issues. + Design and implement self-healing systems that improve availability, stability, and resiliency of cloud infrastructure. + Collaborate with data science teams to integrate ML pipelines into DevOps workflows for continuous learning and optimization. + Utilize generative AI services such as AWS Bedrock, Amazon Nova (nova-micro, nova-lite), and foundation models like Anthropic Claude (claude-3-haiku, claude-sonnet-4) to enhance operational intelligence and automation. + Lead the development and implementation of CI/CD pipelines for continuous integration, testing, and deployment. + Oversee & also be hands-on in writing IAC and other automation using tools like Terraform, CloudFormation, Ansible, Python, Bash, or PowerShell + Implement automated monitoring, logging, and alerting solutions to maintain system health and security + Drive SRE practices by implementing strategies that improve reliability, availability, and scalability of cloud infrastructure. + Develop and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to ensure service reliability. + Lead incident response efforts, ensuring quick resolution and thorough post-mortems to prevent future issues. + Implement chaos engineering practices to proactively identify and resolve potential points of failure. + Implement and enforce security best practices across AWS environments, including identity and access management, encryption, and network security. + Ensure compliance and regulatory requirements, such as GDPR, HIPAA,PCI and SOC 2. + Conduct & participatein regular security audits and vulnerability assessments. + Monitor performance and recommend improvements to optimize resource usage and costs. + Troubleshoot and resolve infrastructure-related issues promptly. + Evaluate and implement new tools and technologies to improve the efficiency and reliability of cloud operations. + Develop and implement strategies for cost forecasting, monitoring, and optimization of AWS resources. + Continuously analyze and optimize AWS spending, leveraging reserved instances, spot instances, and other cost-saving opportunities. + Provide insights and recommendations on cost-saving measures without compromising performance or security. + Generate regular reports on cloud spending and budget forecasts, and present them to stakeholders. + Work closely with development, operations, and product teams to ensure seamless integration of cloud and DevOps practices. + Mentor and guide junior engineers in cloud architecture, DevOps, and SRE best practices. + Act as a subject matter expert on AWS cloud solutions, DevOps, and SRE practices within the organization. + Maintain comprehensive documentation of cloud architecture, configuration, and processes. Provide regular reports on infrastructure performance, costs, and security to management. _About you:_ + Bachelor’s degree in Computer Science, Information Technology, or a related field. Advanced degree preferred. + 7+ years of experience in cloud architecture and DevOps, with at least 5 years focused on AWS. + Strong expertise in AWS services, including EC2, S3, RDS, Lambda, VPC, IAM, CloudWatch, and more. + Proficiency in infrastructure as code tools such as Terraform, CloudFormation, and Ansible. + Experience with CI/CD tools like Jenkins, GitLab, or AWS CodePipeline. + Solid understanding of networking concepts, DNS, load balancing, and VPNs. + Experience with containerization technologies such as Docker and Kubernetes. + In-depth knowledge of security best practices, including IAM, encryption, and security group management. + Strong scripting skills in Python, Bash, or PowerShell. + Experience in working with SQL, no-SQL and big data frameworks, MS-SQL, Casandra, MongoDB, Apache spark, Hadoop etc., middleware like Kafka, Flume, MQTT, Redis etc. + Experience with deployment strategies such as blue/green, canary, etc + Experienced in implementing security best practices, vulnerability management and patching strategies + Excellent problem-solving skills and the ability to troubleshoot complex issues. + Proven ability to lead and mentor technical teams. + Strong communication skills, with the ability to explain complex technical concepts to non-technical stakeholders. + AWS Certified Solutions Architect – Professional or AWS Certified DevOps Engineer – Professional is highly desirable. + Familiarity with other cloud platforms (e.g., Azure, Google Cloud) is a plus. + Leadership and mentorship capabilities, with experience guiding and developing technical teams. + Ability to work independently and manage multiple priorities in a fast-paced environment. + Need extensive experience on CNCF projects + We prefer candidates with Hands-on knowledge with Kubernetes/EKS (cluster upgrades/resource quota/sidecar injector/cluster-autoscaler/topology-hints/node groups/HPA/RBAC/secret management drivers/storage drivers/DaemonSets/CoreDNS/security best practices) + Experience implementing Calico/others CNI, network policies and best practices + We prefer you to have experience implementing service mesh(Istio), virtual services, telemetry, policy enforcements and security best practices, kiali/jaeger/opensearch/prometheus/grafana for observability + We prefer you to have experience with event-driven(preferably kafka) autoscaling with Keda + We prefer you to have experience with Kong(various plugins) and other API gateways, reverse proxies, nginx skills, familiarity with managed and unmanaged load-balancers (layer 4/7), throttling and controls + We prefer you to have experience with monitoring and logging tools like Prometheus, Grafana, or ELK Stack. + We prefer you to have experience in serverless architecture and microservices design patterns. + Preferred experience with AWS cost management tools like AWS Cost Explorer, Trusted Advisor, or third-party tools for cost analysis and optimization. \#LI-Remote _Company Overview_ McAfee is a leader in personal security for consumers. Focused on protecting people, not just devices, McAfee consumer solutions adapt to users’ needs in an always online world, empowering them to live securely through integrated, intuitive solutions that protects their families and communities with the right security at the right moment. _Company Benefits and Perks:_ We work hard to embrace diversity and inclusion and encourage everyone at McAfee to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees. + Bonus Program + 401k Retirement Plan + Medical, Dental, Vision, Basic Life, Short Term Disability and Long-Term Disability Coverage + Paid Parental Leave + Support for Community Involvement + 14 Paid Company Holidays + Unlimited Paid Time Off for Exempt Employees + 96 Hours of Sick Time and 120 Hours of Vacation for Non-Exempt Employees Accrued Each Year We're serious about our commitment to diversity which is why McAfee prohibits discrimination based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status. The starting pay range for this position is $135,910.00-$223,285.00. McAfee takes into consideration an individual’s skillset, experience and location in making final salary determinations. For further details, please discuss with the Talent Acquisition Partner. Please click here (https://www.mcafee.com/content/dam/consumer/en-us/docs/legal/mcafe-job-applicant-ccpa-notice.pdf) to view and download the Job Applicant Privacy Notice, which applies to all McAfee job applicants who are residents of the state of California. 
 
 
- 
        
Recent Jobs
- 
                
                    Sr. AWS Cloud & DevOps Architect - Remote
                
                - McAfee, Inc. (San Jose, CA)
- 
                
                    Subcontracts Project Manager
                
                - Textron (New Orleans, LA)