Google logo

Technical Program Manager III, GPU Infrastructure Reliability, Google Cloud

Google

Google is a leading technology company that focuses on creating opportunities through innovative products. As a Technical Program Manager, you will lead complex projects in ML workload monitoring and diagnostics, collaborating with cross-functional teams to ensure successful project outcomes and drive the development of scalable solutions.

Responsibilities

  • Collaborate with cross-functional teams to define project scope, goals, and deliverables. Develop detailed project plans, identify dependencies, and manage timelines
  • Communicate with stakeholders across engineering, product, and research to ensure alignment and drive progress
  • Identify and mitigate risks that could impact project success as well as delivery. Drive projects to completion, ensuring high-quality results
  • Understand the technical aspects of ML workload monitoring and diagnostics, including distributed systems, performance optimization, and ML model convergence
  • Work with engineers, researchers, and product managers to translate business requirements into technical solutions

Skills

  • Bachelor's degree in a technical field, or equivalent practical experience
  • 5 years of experience in program management
  • Experience with infrastructure reliability
  • Experience with GPUs or GPU Systems
  • 5 years of experience managing cross-functional or cross-team projects
  • 5 years of experience in technical program management, with a focus on software engineering and ML infrastructure projects
  • Knowledge of software development, distributed systems, and ML infrastructure or GPU systems
  • Ability to think critically and solve problems
  • Excellent project management skills, and experience with project planning, execution, and risk management
  • Excellent communication and collaboration skills, with the ability to build relationships and influence across all levels of the organization

Benefits

  • Health, dental, vision, life, disability insurance
  • Retirement Benefits: 401(k) with company match
  • Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
  • Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
  • Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
  • Baby Bonding Leave: 18 weeks
  • Holidays: 13 paid days per year

Company Overview

  • Google specializes in internet-related services and products, including search, advertising, and software. It is a sub-organization of Alphabet. It was founded in 1998, and is headquartered in Mountain View, California, USA, with a workforce of 10001+ employees. Its website is https://www.google.com.

Company H1B Sponsorship

  • Google has a track record of offering H1B sponsorships, with 8763 in 2025, 8872 in 2024, 9682 in 2023, 11626 in 2022, 9109 in 2021, 9785 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Job Type

Job Type
Full Time
Location
Sunnyvale, CA

Share this job: