Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time
Company Overview
We are a frontier AI research and product company building the next generation of autonomous and interpretable AI systems.
Founded by an experienced team of researchers and operators from leading AI organisations, we are rapidly scaling our research, engineering, and product efforts. Our goal is to build one of the highest-performing and most talent-dense AI teams in Europe.
Based in London, we are looking for ambitious and highly capable individuals who share our vision of a future where humans interact continuously, safely, and productively with autonomous AI agents. We value self-starters who can take ownership, move quickly, and thrive in a fast-paced, high-growth environment.
The Role
Members of Technical Staff operate as high-agency generalists. You will be expected to own projects end-to-end while also contributing across multiple initiatives as needed. Because we work at the frontier of several technical domains, the ability to learn quickly and adapt on the job is essential.
We are currently a small but rapidly growing team. Early hires will play a key role in shaping both the technical direction and company culture. We are hiring across a range of seniority levels, from experienced technical experts to highly driven early-career candidates. The organisation operates with a flat structure and provides significant autonomy and ownership.
What You Might Work On
We are hiring both deep technical specialists and exceptionally fast-learning generalists. Areas of work may include:
Memory, Retrieval, and Long-Context Reasoning
- Learned retrieval systems
- Fast adaptation and meta-learning
- KV-cache reuse and compression
- Context distillation
- Unified retrieval and reasoning systems
Agentic Systems, Reinforcement Learning, and Self-Improvement
- Tool use and multi-step reasoning
- Preference learning and reward modelling
- LLM-as-judge systems
- Process reward models
- Synthetic trajectory generation
- Offline-to-online reinforcement learning
- RL in unverifiable domains
World Models, Simulators, and Optimisation Loops
- Learned simulators of real deployment environments
- Agents that propose experiments and optimise prompts, policies, and data pipelines
- Agentic approaches to data science and experimentation
Infrastructure, Training, and Observability
- Distributed training systems
- Efficient inference and quantisation
- Observability, auditing, and rollback tooling for safe deployment
About You
- You have experience at a leading research lab, high-growth startup, or similarly ambitious and fast-paced environment.
- You move quickly and have a track record of producing outsized results in short periods of time.
- You think from first principles about what the company needs, not just the task immediately in front of you.
- You are ambitious, competitive, and motivated by solving difficult problems.
- You are a self-starter who operates effectively with limited supervision or incomplete specifications.
Practicalities
Location: London-based, with a strong preference for in-person collaboration and culture-building.
Compensation: Competitive salary and equity package.
Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time