- Amazon (Arlington, VA)
- …fundamentals, experience building large-scale distributed systems and machine learning infrastructure . You take initiative to improve operational excellence. You're ... science or equivalent - 1+ years of building large-scale machine-learning infrastructure for online recommendation, ads ranking, personalization or search experience… more
- Oracle (Cheyenne, WY)
- **Job Description** OCI (Oracle Cloud) AI Infrastructure Innovation team is pioneering the creation of next-generation AI/HPC networking for GPU superclusters at ... to validate throughput, latency, and tail behavior. + Collaborate with GPU platform , storage, database, and control-plane teams to deliver end-to-end solutions and… more
- Amazon (Arlington, VA)
- …fundamentals, experience building large-scale distributed systems and machine learning infrastructure . You're a self-starter who thrives in fast-paced, collaborative ... environments, with strong verbal and written communication skills. Most importantly, you're passionate about solving complex problems on behalf of customers. In this role, you will: - Design, develop, test, deploy, deliver, and maintain large-scale, highly… more
- JPMorgan Chase (Jersey City, NJ)
- …in SRE principles, reliability, scalability and performance of application and infrastructure . + Have hands-on experience with cloud platforms (AWS, GCP, Azure) ... and IaC tools (Terraform, Ansible). + Extensive experience implementing advanced observability using tools like Open Telemetry, Dynatrace, Grafana, and/or cloud-native services. + Experience in architecting distributed systems and cloud-native architecture in… more
- Cognizant (Bannockburn, IL)
- …applications to edge devices. + DevSecOps: Proficiency with CI/CD and infrastructure -as-code tools (eg, Harness, GitHub, Terraform, SonorCloud). **Design Skills-** + ... Proven experience designing scalable, resilient, and secure applications. + Strong ability to lead design sessions and collaborate across engineering, product, and business teams. + Experience managing offshore build teams and driving project execution from… more
- OneMain Financial (Fort Mill, SC)
- …such as AWS Systems Manager (SSM), Azure Update Management, or infrastructure -as-code (IaC) automation tools (Terraform, Ansible). + Oversee patch and configuration ... compliance across various operating systems. + Ensure compatibility and performance validation before and after patch cycles. + Support and advise on cloud configuration hardening initiatives in parallel to patching activities. + Participate in technology… more
- Marriott (Bethesda, MD)
- …regarding degraded or missed service levels Coordinates with Operations and Infrastructure teams for deployment and production support activities **IT Governance** ... Follows all defined IR standards and processes (ie IT Governance, SM&G, Architecture, etc.), and provides input for improvements to the appropriate process owners as needed Maintains a proper balance between business and operational risk Follows the defined… more
- NVIDIA (Santa Clara, CA)
- …be doing: + Drive next generation fleet management solutions for scaling AI infrastructure using GPUs and Grace solution from Nvidia. Work with customers, product ... management and other architects to narrow down on requirements for implementation to ensure speed of light product development. + Bring up clarity on architecture for fleet health monitoring and fault-remediation solution at scale. Work with customers and… more
- Amazon (Cupertino, CA)
- …development and management of Compute, Database, Storage, Internet of Things (IoT), Platform , and Productivity Apps services in AWS, including support for customers ... Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform . We pioneered cloud computing and never stopped innovating - that's why… more
- Vanguard (Malvern, PA)
- …organizational objectives. + Partner with CSOC and other stakeholders to advise on platform usage, threat detection, and incident response. + Act as the subject ... years of experience with CrowdStrike modules Deep expertise in CrowdStrike platform design, deployment, and operations Proven success in leading cross-functional… more
Recent Jobs
-
Sr Software Engineer
- The Walt Disney Company (Bristol, CT)
-
Education Coordinator
- HANAC, Inc. (Queens, NY)
-
Senior Embedded Software Engineer
- Carnegie Mellon University (Arlington, VA)
-
Labor Efficiency Analysis Intern
- Pacific Seafood (Kodiak, AK)