Research Engineer / Research Scientist, Finetuning
Anthropic
- USA
- Permanent
- Full-time
- Help develop novel finetuning techniques to improve language model behavior and make models more helpful, honest, and harmless
- Test out techniques like constitutional AI at scale and measure their impacts on model behavior
- Build tooling and infrastructure to enable efficient fine-tuning experiments on large language models
- Develop novel prompts and prompting strategies to improve and test model behaviors
- Run experiments that feed into key AI research and safety efforts at Anthropic
- Have significant Python, machine learning, research engineering, or research experience
- Prefer fast-moving collaborative projects with concrete goals that involve improving model behaviors
- Are results-oriented, with a bias towards flexibility and impact
- Pick up slack, even if it goes outside your job description
- Care about the impact of AI and of your work
- Haver prior experience with large language model finetuning techniques such as RLHF
- Have experience with complex shared codebases and RL infrastructure
- Have experience authoring research papers in machine learning, NLP, or AI alignment or similar industry experience