Back to careers
Research Engineer (Neo Research, Contract)
Implement and run safety evaluations on frontier models, with a focus on loss-of-control and harmful manipulation. (1-3 months contract)
- Start date
- As soon as possible
- Reporting to
- Clement Neo - Research Lead, Founder
Neo Research is an independent frontier safety research and evaluations organisation, based in Singapore. We focus on loss-of-control and harmful manipulation risks in frontier AI models.
What you'll do
- Run safety evaluations (e.g. CyBench, SHADE-Arena) against target models.
- Perform open-ended research on models and/or eliciting dangerous behaviours (e.g. scheming, sandbagging, self-replication).
- Draft sections of safety reports documenting methodology, elicitation, and findings.
What we are looking for
- Hands-on experience running or creating LLM safety evaluations.
- Comfortable with Python and agentic evaluation frameworks (Inspect or similar).
- Rigorous about elicitation methodology: scaffolding, tool access, sampling, and knowing where evals fail.
- Clear technical writing.
Good to have
- Dangerous capability evaluations experience.
- Familiarity with frontier lab safety reports.
- Reading and writing in Mandarin.
Concrete details
- 3–6 hour paid work test to start.
- 1–3 months contract, with potential for full-time transition.
- Remote-friendly.
How to Apply
Use the application link for the full role requirements and submission instructions.
Apply