- NVIDIA (Santa Clara, CA)
- …systems like Ray, Spark Rapids + Familiarity with metrics collection, health monitoring , and observability tools + Building, operating and maintaining full stack ... software deployments coupled with excellent software programming skills + Master's or Bachelor's degree in Computer Science or Electrical Engineering or CE or equivalent experience. + A minimum of 5yrs experience with a background in software engineering and… more
- Coinbase (Sacramento, CA)
- …*Obsess over data quality and reliability: *Create and enforce rigorous testing, monitoring , and validation frameworks to ensure data is accurate, consistent, and ... trusted at all times. * *Develop deep domain expertise: *Ensure your team deeply understands finance, CX, HR, compliance and product data, building targeted data marts and tools that solve real business problems. * *Leverage AI and LLMs:* Investigate how LLMs… more
- TEKsystems (Beverly Hills, CA)
- …performance across all locations. Hands-on experience with IT infrastructure monitoring and logging solutions, ensuring comprehensive tracking, analysis, and ... reporting of system performance, security events, and operational health. Exceptional interpersonal skills with the ability to build positive relationships across teams. Strong communication abilities, both verbal and written, to effectively convey technical… more
- Coinbase (Sacramento, CA)
- …cause analysis, and blameless retrospectives * Define metrics and bolster monitoring /observability across corporate IAM systems * Participate in regular on-call ... rotation to ensure 24x7 uptime for critical systems *What We Look For In You: * 5+ years of experience building, iterating upon, and maintaining corporate IAM systems * 5+ years of experience with operational procedures and application development * Deep… more
- Coinbase (Sacramento, CA)
- …of handling high throughput and low latency * Experience with observability and monitoring systems such as Kibana, Datadog, etc. * Familiarity with working in rapid ... growth environments * Experience in Ruby, Go, and Terraform * Experience with AWS, GCP, Azure, or other cloud environment * Experience designing and building reliable systems * Experience working in a highly regulated environment * Experience writing… more
- Walmart (San Bruno, CA)
- …for scalable partner engagement - including API usage tracking, performance monitoring , and analytics infrastructure._ + _Engage with internal cross-functional teams ... to ensure alignment on product changes, integration processes, and technical developments with clear and consistent communication_ + _Lead thought leadership initiatives, surfacing new strategic opportunities and promoting innovative thinking within the team… more
- Amazon (Cupertino, CA)
- …data center. After launch you will oversee the fleet of servers you develop, monitoring their quality and how they are meeting the customer requirements. This is a ... fast-paced, intellectually challenging position, and you'll work with thought leaders in multiple technology areas. You'll have high standards for yourself and everyone you work with, and you'll be constantly looking for ways to improve your products'… more
- Google (Sunnyvale, CA)
- …+ Build security tools and processes for critical infrastructure protection, monitoring , and remediation. + Transform Access Security Engineering by leveraging ... recent advances in AI. Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age,… more
- Google (Sunnyvale, CA)
- …launch reviews. Maintain services once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through ... mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. + Practice sustainable incident response and blameless postmortems. Google is proud to be an equal opportunity workplace and is an affirmative action… more
- Palo Alto Networks (Santa Clara, CA)
- …in infrastructure as code (eg, Terraform, Ansible), CI/CD pipelines, and monitoring /observability tools. + Expertise in SQL programming and database management ... systems (eg, BigQuery). + Hands-on experience with ETL tools and technologies (eg, Apache Spark, Apache Airflow). + Experience with cloud platforms such as Google Cloud Platform (GCP), and experience with relevant services (eg, GCP Dataflow, GCP DataProc,… more