Skip to content

Commit

Permalink
Merge pull request karpathy#463 from goswamig/test1
Browse files Browse the repository at this point in the history
Fixing eval path in README
  • Loading branch information
karpathy authored Jun 3, 2024
2 parents 7c7e627 + 3ab86ce commit 9755682
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -131,10 +131,10 @@ Finally, to train on a single GPU simply run the `python train.py` script. Have
OpenAI GPT-2 checkpoints allow us to get some baselines in place for openwebtext. We can get the numbers as follows:

```sh
python train.py eval_gpt2
python train.py eval_gpt2_medium
python train.py eval_gpt2_large
python train.py eval_gpt2_xl
$ python train.py config/eval_gpt2.py
$ python train.py config/eval_gpt2_medium.py
$ python train.py config/eval_gpt2_large.py
$ python train.py config/eval_gpt2_xl.py
```

and observe the following losses on train and val:
Expand Down

0 comments on commit 9755682

Please sign in to comment.