Note on `autoresearch`

What is autoresearch?

autoresearch lets AI Agents autonomously conduct Machine Learning Research overnight on a single GPU

Note : It is a playful but serious exploration of where AI Assisted Research might be heading

The Core Idea

Instead of a manually tweaking Hyperparameters and Code, you give an AI Agent (like Claude or Codex) control of a small but real LLM Training setup

The Agent then :

  • Modifies the Training Code (train.py)
  • Runs a 5-minute Training Experiment(Interesting)
  • Checks if the Validation Loss Improved
  • Keeps or discards the change
  • Repeats Autonomously

So, it is essentially turning “what Hyperparameters/Architectures should I try?”

Requirements

  • Single NVIDIA GPU (has been tested on H100, but others should work)
  • Python 3.10+
  • uv package manager

Setup Steps

1. Clone the repo

git clone https://github.com/karpathy/autoresearch
cd autoresearch

2. Install uv (if you don’t have it)

curl -LsSf https://astral.sh/uv/install.sh | sh

3. Install dependencies

uv sync

4. Download Data and Train Tokenizer (one-time, ~2 min)

uv run prepare.py

5. Test with a manual training Run (~5 min)

uv run train.py

Thanks for Reading!