Reinforcement Learning Engineer

AI & Machine Learning Tiempo completo Nivel medio London, England, United Kingdom
Negotiable

Descripción del puesto

Develop reinforcement learning systems for training and aligning AI models. Work on RLHF and advanced RL techniques.

Requisitos

- MSc/PhD in ML, Robotics, or related
- 3+ years RL experience
- Strong knowledge of policy optimization
- Experience with RLHF, PPO, DPO
- Proficient in Python and RL frameworks
- Research publications a plus

Responsabilidades

- Implement RL training pipelines
- Develop reward modeling systems
- Optimize RL algorithms
- Collaborate with safety team
- Research new RL techniques
- Document and share learnings

Beneficios

- Salary £80,000 - £125,000
- Research-focused role
- Conference attendance
- Stock options
- Premium benefits
- Gym membership

Resumen del puesto

Tipo de empleo Tiempo completo
Nivel de experiencia Nivel medio
Ubicación London, England, United Kingdom
Vacantes 2

Aplicar ahora