- DoorDash (San Francisco, CA)
- …experience managing large fleets of database clusters, including building tools, monitoring , capacity planning, backups and disaster recovery strategies. + You are ... proficient in system-level fundamentals including operating systems, hardware, and networking. + You understand the tradeoffs of distributed system consistency, failure modes, and partition tolerance. + You have contributed to or are interested in contributing… more
- NVIDIA (Santa Clara, CA)
- …software related to managing fleets of GPU nodes. + Implementing monitoring and health management capabilities that enable industry leading reliability, ... availability, and scalability of GPU assets. You will be harnessing multiple data streams, ranging from GPU hardware diagnostics to cluster and network telemetry. + Working with teams across NVIDIA to ensure production AI clusters run reliability and… more
- Walmart (Red Bluff, CA)
- …needs; determining and carrying out necessary processes and practices; monitoring progress and results; recognizing and capitalizing on improvement opportunities; ... and adapting to competing demands, organizational changes, and new responsibilities. Coordinates, completes, and oversees job-related activities and assignments by developing and maintaining relationships with key stakeholders; supporting plans and initiatives… more
- pony.ai (Fremont, CA)
- …is accurate, synchronized, and reliable, including calibration, error detection, and health monitoring . + Integrate sensor data into the perception stack and build ... efficient data flows that power real-time algorithms. + Preprocess multi-sensor inputs to improve perception performance, such as time synchronization and ground detection. + Contribute to the overall perception pipeline, from raw sensor integration to… more
- Walmart (Sunnyvale, CA)
- …management system. You'll independently handle high impact, critical software/systems monitoring issues, troubleshoot business and production issues. As a member ... of this fast-moving and highly entrepreneurial team, you'll be able to say that you work for the worlds largest retailer and contribute to innovation and development to best-in-class methodologies that impacted perception and drastically changed business as we… more
- Abbott (Alameda, CA)
- …products based on the FreeStyle Libre platform, the world's #1 continuous glucose monitoring (CGM) solution. This position will be based out of Abbott Diabetes Care ... Division in Alameda, CA. The position will be responsible for leading and strategizing CAPA projects through its entire life cycle, and mentor quality engineers. The position will help being new insights to effectively strategize and implement continuous… more
- Cardinal Health (Sacramento, CA)
- …Fiori, CRM, SCM, Portal, PO,BI, Netweaver, SLT, SideCar) + Daily administration, monitoring , troubleshooting and tuning of a complex SAP production environment. + ... Analyzes production and test system Problems, determines causes and takes timely corrective action. + Works with the Unix/Linux/Windows Administrator on capacity planning and performance tuning of servers hosted on GCP cloud. GCP/AWS cloud experience is… more
- Noblis (San Diego, CA)
- …implement robust storage solutions, backup and recovery systems, and network monitoring capabilities + Create comprehensive test plans and conduct thorough testing ... of software solutions + Automate repetitive tasks through efficient scripting to improve productivity and reduce human error + Collaborate with cross-functional teams to integrate software solutions within the broader system architecture + Document processes,… more
- Microsoft Corporation (Mountain View, CA)
- …and performance of products while also driving consistency in monitoring and operations at scale. **Qualifications** **Required Qualifications:** + Enrolled ... in a full time bachelor's or master's program in Computer Science, Engineering, or related field during the academic term immediately before the internship. + Must have at least 1 semester/term remaining following the completion of the internship. + One year… more
- Abbott (Livermore, CA)
- …mobile and web platforms. * Drive operational excellence through automation, monitoring , and continuous improvement of engineering processes. * Collaborate closely ... with Directors and VPs to influence technical strategy and make high-impact architectural decisions. * Mentor and grow software engineers and junior database engineers, fostering a culture of ownership, innovation, and technical rigor. **Required… more