Singapore Technical Alignment Research Accelerator (STARA)

The first Singapore-based AI Safety bootcamp to accelerate careers in research engineering

Dates: Tentatively June 2–30
Venue: Singapore AI Safety Hub (22 Cross Street)

About STARA

STARA (Singapore Technical Alignment Research Accelerator) is a Singapore-based AI Safety Bootcamp based on the global ARENA program — a hands-on, fast-paced accelerator that’s launched people into OpenAI, Anthropic, and top safety research fellowships like MATS.

Curriculum

Week 1

Fundamentals

In this week, you’ll learn about some coding best practices, become familiar with the PyTorch library, and build & train your own neural networks (CNNs and ResNets).

Week 2

Transformers and Mechanistic Interpretability

The transformer is an important neural network architecture used for language modelling, and it has made headlines with the introduction of models like ChatGPT.

In this week, you will learn all about transformers, and build and train your own. You’ll also learn about Mechanistic Interpretability of transformers, a field which has been advanced by Anthropic’s transformer circuits sequence and work by Neel Nanda.

Week 3

LLM Evaluations

In this week, you’ll learn how to evaluate LLMs. We’ll take you through the process of building a multiple-choice benchmark from scratch and using this to evaluate current models.

We’ll then move on to study LM agents: how to build them, and how to evaluate them.

Week 4

Capstone Project

In this week, you will work on a capstone project.

You’ll have the chance to dig into a topic that you found interesting during the course, applying the skills and knowledge you’ve accumulated over the last four weeks.

Express your interest

We are currently gauging interest and will be able to run this program once we have enough participants. Please click the button below and fill up the form and we will be in touch with you shortly!

Express interest here