Research Engineer / Research Scientist, Finetuning

Anthropic

  • USA
  • Permanent
  • Full-time
  • 20 days ago
You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an engineer. As a Research Scientist or Research Engineer on the Finetuning team, you'll contribute to research on improving language models through techniques like constitutional AI. You will have the opportunity to do creative, cutting-edge research on frontier models, and to see your work result in concrete improvements in performance and safety.We generally expect research scientists to be able to iterate on their own experiments. We also provide opportunities for engineers to pursue their own research projects. Therefore this role can be more research oriented or more engineering oriented, depending on the experience and interests of the candidate.Note: Currently, the team has a preference for candidates who are able to be based in the Bay Area. However, we remain open to any candidate who can meet the organization's 25% in-person policy.Representative projects:
  • Help develop novel finetuning techniques to improve language model behavior and make models more helpful, honest, and harmless
  • Test out techniques like constitutional AI at scale and measure their impacts on model behavior
  • Build tooling and infrastructure to enable efficient fine-tuning experiments on large language models
  • Develop novel prompts and prompting strategies to improve and test model behaviors
  • Run experiments that feed into key AI research and safety efforts at Anthropic
You may be a good fit if you:
  • Have significant Python, machine learning, research engineering, or research experience
  • Prefer fast-moving collaborative projects with concrete goals that involve improving model behaviors
  • Are results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Care about the impact of AI and of your work
Strong candidates may also:
  • Haver prior experience with large language model finetuning techniques such as RLHF
  • Have experience with complex shared codebases and RL infrastructure
  • Have experience authoring research papers in machine learning, NLP, or AI alignment or similar industry experience
Deadline to apply: None. Applications will be reviewed on a rolling basis.

Anthropic