Apply to the vacancy...
Unfortunately, something went wrong while opening the page. Please try again.

Loading window...

Apply to the vacancy...
Unfortunately, something went wrong while opening the page. Please try again.

Loading window...

Sign up for Jobbird
An error occurred while opening the sign-up page. Please try again.

Loading window...

Forgot my password
Unfortunately, something went wrong while opening the page. Please try again.

Loading window...

Log out
Unfortunately, something went wrong while signing out. Please try again.

Loading window...

Job application sent
Something went wrong while logging in. Please try again.
Something went wrong while signing up. Please try again.

Loading window...

logo
  • 5 km
  • 10 km
  • 30 km
  • 50 km

  • All
  • 5 km
  • 10 km
  • 30 km
  • 50 km

  • All
Filters
Filters
Location and distance
  • 5 km
  • 10 km
  • 30 km
  • 50 km

  • All
Jobs posted from
Salary from (per month)
Filters
How our sorting works

The order in which job vacancies are displayed is determined by a composite score based on the following factors:

  • Keyword Relevance: How well your search terms match the vacancy details. We prioritize matches found in the job title, followed by job requirements, location names, and educational levels. Matches within general employer information or the organization's name carry a lower weight.
  • Commercial Prioritization (Premium Jobs): Vacancies paid for by employers ('Premium' or 'Sponsored') receive a ranking boost and will appear higher in the search results.
  • Recency (Date Relevance): Newer vacancies are prioritized. The relevance score of a vacancy is reduced by half once the posting is older than 30 days.
  • Proximity (Distance Relevance): Vacancies located closer to your search location are ranked higher. For vacancies located more than 30 km from the search center, the relevance score is halved.
The final ranking is established by multiplying all these individual factors to calculate the total relevance score.

E

Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time

Enigma London
new


Show Recently closed jobs

    E

    Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time

    Enigma London
    new
    Status Open
    Apply now

    Apply on the employer's website


    What we ask

    Education

    No minimum education required

    What we offer

    Salary

    Job description

    Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time


    Company Overview

    We are a frontier AI research and product company building the next generation of autonomous and interpretable AI systems.


    Founded by an experienced team of researchers and operators from leading AI organisations, we are rapidly scaling our research, engineering, and product efforts. Our goal is to build one of the highest-performing and most talent-dense AI teams in Europe.


    Based in London, we are looking for ambitious and highly capable individuals who share our vision of a future where humans interact continuously, safely, and productively with autonomous AI agents. We value self-starters who can take ownership, move quickly, and thrive in a fast-paced, high-growth environment.


    The Role

    Members of Technical Staff operate as high-agency generalists. You will be expected to own projects end-to-end while also contributing across multiple initiatives as needed. Because we work at the frontier of several technical domains, the ability to learn quickly and adapt on the job is essential.


    We are currently a small but rapidly growing team. Early hires will play a key role in shaping both the technical direction and company culture. We are hiring across a range of seniority levels, from experienced technical experts to highly driven early-career candidates. The organisation operates with a flat structure and provides significant autonomy and ownership.


    What You Might Work On

    We are hiring both deep technical specialists and exceptionally fast-learning generalists. Areas of work may include:


    Memory, Retrieval, and Long-Context Reasoning

    • Learned retrieval systems
    • Fast adaptation and meta-learning
    • KV-cache reuse and compression
    • Context distillation
    • Unified retrieval and reasoning systems


    Agentic Systems, Reinforcement Learning, and Self-Improvement

    • Tool use and multi-step reasoning
    • Preference learning and reward modelling
    • LLM-as-judge systems
    • Process reward models
    • Synthetic trajectory generation
    • Offline-to-online reinforcement learning
    • RL in unverifiable domains


    World Models, Simulators, and Optimisation Loops

    • Learned simulators of real deployment environments
    • Agents that propose experiments and optimise prompts, policies, and data pipelines
    • Agentic approaches to data science and experimentation


    Infrastructure, Training, and Observability

    • Distributed training systems
    • Efficient inference and quantisation
    • Observability, auditing, and rollback tooling for safe deployment


    About You

    • You have experience at a leading research lab, high-growth startup, or similarly ambitious and fast-paced environment.
    • You move quickly and have a track record of producing outsized results in short periods of time.
    • You think from first principles about what the company needs, not just the task immediately in front of you.
    • You are ambitious, competitive, and motivated by solving difficult problems.
    • You are a self-starter who operates effectively with limited supervision or incomplete specifications.


    Practicalities

    Location: London-based, with a strong preference for in-person collaboration and culture-building.

    Compensation: Competitive salary and equity package.


    Member of Technical Staff | Large Language Models | Reinforcement Learning | Post-Training | Pre-Training | Long-Context Reasoning | London, Full-Time

    About the employer

    Enigma
    Apply now

    Apply on the employer's website

    Apply now

    Apply on the employer's website


    Vacancy actions

    Save as favorite
    Share vacancy
    Or apply later


    London England

    Jobs

    • Search for jobs
    • Jobs per location
    • Jobs per job profession
    • Jobs per employment
    • Jobs per educational attainment

    Jobbird

    • Switch to different region
    • Terms and Conditions
    © 2026 Jobbird