Reinforcement Learning Engineer

AI & Machine Learning Temps plein Intermédiaire London, England, United Kingdom
Negotiable

Description du poste

Develop reinforcement learning systems for training and aligning AI models. Work on RLHF and advanced RL techniques.

Exigences

- MSc/PhD in ML, Robotics, or related
- 3+ years RL experience
- Strong knowledge of policy optimization
- Experience with RLHF, PPO, DPO
- Proficient in Python and RL frameworks
- Research publications a plus

Responsabilités

- Implement RL training pipelines
- Develop reward modeling systems
- Optimize RL algorithms
- Collaborate with safety team
- Research new RL techniques
- Document and share learnings

Avantages

- Salary £80,000 - £125,000
- Research-focused role
- Conference attendance
- Stock options
- Premium benefits
- Gym membership

Aperçu du poste

Type d'emploi Temps plein
Niveau d'expérience Intermédiaire
Lieu London, England, United Kingdom
Postes vacants 2

Postuler maintenant