What is autoresearch?
autoresearch lets AI Agents autonomously conduct Machine Learning Research overnight on a single GPU
Note : It is a playful but serious exploration of where AI Assisted Research might be heading
The Core Idea
Instead of a manually tweaking Hyperparameters and Code, you give an AI Agent (like Claude or Codex) control of a small but real LLM Training setup
The Agent then :
- Modifies the Training Code (train.py)
- Runs a 5-minute Training Experiment(Interesting)
- Checks if the Validation Loss Improved
- Keeps or discards the change
- Repeats Autonomously
So, it is essentially turning “what Hyperparameters/Architectures should I try?”
Requirements
- Single NVIDIA GPU (has been tested on H100, but others should work)
- Python 3.10+
- uv package manager
Setup Steps
1. Clone the repo
git clone https://github.com/karpathy/autoresearch
cd autoresearch
2. Install uv (if you don’t have it)
curl -LsSf https://astral.sh/uv/install.sh | sh
3. Install dependencies
uv sync
4. Download Data and Train Tokenizer (one-time, ~2 min)
uv run prepare.py
5. Test with a manual training Run (~5 min)
uv run train.py
Thanks for Reading!