Skip to main content
Posted 05 June, 2026
DeepMind

Senior Staff Research Scientist, Gemini Safety Post-Training, DeepMind

Mountain View CA USA Full Time

Minimum qualifications:

  • PhD in Computer Science, a related field, or equivalent practical experience.
  • 6 years of experience in Machine Learning Algorithms and Language Modeling.
  • One or more scientific publications in the ML/AI conferences or journals (e.g., NeurIPS, ICML, ICLR, CVPR).

Preferred qualifications:

  • 6 years of experience in ML research, with 3 years of experience shipping Reinforcement Learning-based (or equivalent) post-training pipelines.
  • 5 years of experience leading the cross-functional teams in complex, matrixed environments and ability to influence stakeholders, resolve incentives, and provide strategic technical judgment.
  • Ability to deploy the performance improvements in production foundation models.

About the job

As models become more agentic, executing long-horizon tasks, using tools, writing and running code, operating across multi-step workflows, the challenge of making them safe fundamentally changes. Surface-level safety methods (output filtering, refusal tuning, policy guardrails) were designed for single-turn interactions. They are not enough for agents that plan, act, and adapt over extended horizons.

We are looking for a Senior Staff Research Scientist to rethink safety post-training for this new reality. You will bring frontier post-training expertise, to develop training methods that make Gemini models deeply safe and aligned, especially in agentic settings. This role sits in Gemini Safety and partners closely with the Artificial General Intelligence (AGI) Safety team and the Gemini post-training organization.

Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer learning opportunities and varied career pathways
for those driven to achieve exceptional results through collective effort.

Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $262000 - $365000 (USD) + 25% bonus target + bonus + equity + benefits

Learn more about benefits at Google.
Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $262000 - $365000 (USD) + 25% bonus target + bonus + equity + benefits

Learn more about benefits at Google.

Responsibilities

  • Rethink how safety is trained into models, especially for agentic, long-horizon behavior.
  • Design and ship post-training recipes (Reinforcement Learning (RL), Supervised Fine-Tuning (SFT), and beyond) that install safety and alignment properties into Gemini models. You own the path from research to production.
  • Build the metrics and evaluations that tell us whether training is actually making models safer in deployment, not just on benchmarks.
  • Work directly with the post-training pipeline and infrastructure. Partner with the AGI Safety team to bring alignment research into practical training. Translate between research and production.
  • Shape the road map for where safety post-training goes next. Build and grow the team to execute on it.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.