{"id":1242,"date":"2026-03-29T12:30:56","date_gmt":"2026-03-29T12:30:56","guid":{"rendered":"https:\/\/coderseditor.com\/itjobs\/?post_type=job_listing&#038;p=1242"},"modified":"2026-03-29T12:31:07","modified_gmt":"2026-03-29T12:31:07","slug":"backend-software-engineer-research-team","status":"publish","type":"job_listing","link":"https:\/\/coderseditor.com\/itjobs\/job\/backend-software-engineer-research-team\/","title":{"rendered":"Backend Software Engineer (Research team)"},"content":{"rendered":"<div class=\"section page-centered\" data-qa=\"job-description\">\n<div>ABOUT THE OPPORTUNITY<\/div>\n<div><\/div>\n<div>We\u2019re looking for <strong>Backend Software Engineers <\/strong>who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.<\/div>\n<div><\/div>\n<div>REPRESENTATIVE PROJECTS<\/div>\n<div><\/div>\n<div>Here is a list of example projects which you might build and ship in your first 6 months.<\/div>\n<div><\/div>\n<div>&#8211; Internal tooling for efficiently running and analyzing evaluations. For example, a tool that quickly investigates thousands of agentic eval runs in parallel and surfaces interesting information automatically<\/div>\n<div>&#8211; Automated evaluation pipelines to minimize the time from getting access to a new model for pre-deployment testing to analyzing the most important results and sharing them<\/div>\n<div>&#8211; Orchestration tools that allow researchers to run thousands of agentic evaluations in parallel on remote machines with high security and reliability<\/div>\n<div>&#8211; LLM proxy service that enables us to monitor all of our coding agent traffic in real time and identify undesired behavior automatically (in the spirit of <a class=\"postings-link\" href=\"https:\/\/arxiv.org\/abs\/2312.06942\" rel=\"noopener noreferrer\">Control<\/a>)<\/div>\n<div>&#8211; LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent <a class=\"postings-link\" href=\"https:\/\/news.ycombinator.com\/item?id=44646151\" rel=\"noopener noreferrer\">major failures<\/a><\/div>\n<div>&#8211; CI pipeline optimisations to reduce execution time and eliminate flaky tests<\/div>\n<div>&#8211; Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability<\/div>\n<div>&#8211; Data warehousing pipeline and service to store thousands of eval transcripts which researchers can study and build datasets from<\/div>\n<div>&#8211; Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.<\/div>\n<\/div>\n<div class=\"section page-centered\">\n<div>\n<h3>KEY RESPONSIBILITIES<\/h3>\n<div class=\"posting-requirements plain-list\" data-qa=\"posting-requirements\">\n<div>\n<ul>\n<li>Rapidly prototype and iterate on internal tools and libraries for building and running frontier language model evaluations<\/li>\n<li>Lead the development of major features from ideation to implementation<\/li>\n<li>Collaboratively define and shape the software roadmap and priorities<\/li>\n<li>Establish and advocate for good software design practices, codebase health, and coding agent practices<\/li>\n<li>Work closely with researchers to understand what challenges they face<\/li>\n<li>Assist researchers with implementation and debugging of research code<\/li>\n<li>Communicate clearly about technical decisions and tradeoffs<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"section page-centered\">\n<div>\n<h3>KEY REQUIREMENTS<\/h3>\n<div class=\"posting-requirements plain-list\" data-qa=\"posting-requirements\">\n<div>\n<ul>\n<li>You must have experience writing production-quality python code<\/li>\n<li>We value candidates from diverse backgrounds and recognise that candidates may demonstrate their skills in different ways.<\/li>\n<\/ul>\n<div><\/div>\n<div>For example, we might be impressed if you have:<\/div>\n<ul>\n<li>Led the development of a successful software tool or product over an extended period (e.g. 1 year or more)<\/li>\n<li>Started and built the tech stack for a company, e.g in a start-up<\/li>\n<li>Worked your way up in a large organisation, repeatedly gaining more responsibility and influencing a large part of the codebase<\/li>\n<li>Authored and\/or maintained a popular open-source tool or library<\/li>\n<li>Placed in a prestigious programming competition (IOI, ICPC, etc.)<\/li>\n<li>5+ years of professional software engineering experience<\/li>\n<\/ul>\n<div><\/div>\n<div>The following would be a bonus:<\/div>\n<ul>\n<li>Experience working with LLM agents or LLM evaluations<\/li>\n<li>Infosecurity \/ cybersecurity experience<\/li>\n<li>Experience working with AWS<\/li>\n<li>Interest in AI Safety<\/li>\n<\/ul>\n<div><\/div>\n<div>We want to emphasize that people <strong>who feel they don\u2019t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply.<\/strong> We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"section page-centered\">\n<div>\n<h3>LOGISTICS<\/h3>\n<div class=\"posting-requirements plain-list\" data-qa=\"posting-requirements\">\n<div>\n<ul>\n<li>Time Allocation: Full-time<\/li>\n<li>Location: This is an in-person role working out of our London or San Francisco office.<\/li>\n<li><strong>Visa sponsorship:<\/strong> We sponsor visas in both the UK and US. Sponsorship isn&#8217;t guaranteed for every role or candidate, but if we make you an offer, we&#8217;ll work with you to find the right visa route.\n<div data-test-render-count=\"1\"><\/div>\n<div aria-hidden=\"true\"><\/div>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"section page-centered\">\n<div>\n<h3>BENEFITS<\/h3>\n<div class=\"posting-requirements plain-list\" data-qa=\"posting-requirements\">\n<div>\n<ul>\n<li><strong>This role offers market competitive salary, equity, and competitive benefits.<\/strong><\/li>\n<li>Salary: 100k &#8211; 200k GBP (~135k &#8211; 270k USD)<\/li>\n<li>Flexible work hours and schedule<\/li>\n<li>Unlimited vacation<\/li>\n<li>Unlimited sick leave<\/li>\n<li>Up to 6 months of paid parental leave<\/li>\n<li>Comprehensive health, dental and vision insurance<\/li>\n<li>Retirement savings with competitive employer matching (e.g. 401(k) for US employees)<\/li>\n<li>Lunch, dinner, and snacks are provided for all employees on workdays<\/li>\n<li>Paid work trips, including staff retreats, business trips, and relevant conferences<\/li>\n<li>A yearly $1,000 (USD) professional development budge<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"author":1,"featured_media":0,"template":"","meta":{"_acf_changed":false,"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_promoted":"","_job_location":"London","_application":"https:\/\/jobs.lever.co\/apolloresearch\/604b1964-c746-4b6a-bb7b-0ef9b421950a","_company_name":"Apollo Research","_company_website":"","_company_tagline":"","_company_twitter":"","_company_video":"","_filled":0,"_featured":0,"_remote_position":0,"_job_salary":"","_job_salary_currency":"","_job_salary_unit":""},"job-types":[38],"class_list":{"0":"post-1242","1":"job_listing","2":"type-job_listing","3":"status-publish","6":"job-type-experienced"},"acf":[],"aioseo_notices":[],"jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/job-listings\/1242","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/job-listings"}],"about":[{"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/types\/job_listing"}],"author":[{"embeddable":true,"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/users\/1"}],"wp:attachment":[{"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/media?parent=1242"}],"wp:term":[{"taxonomy":"job_listing_type","embeddable":true,"href":"https:\/\/coderseditor.com\/itjobs\/wp-json\/wp\/v2\/job-types?post=1242"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}