X-Git-Url: https://fleuret.org/cgi-bin/gitweb/gitweb.cgi?a=blobdiff_plain;f=README.txt;h=1bd713a42e3b7a6cc5d2f21cea750063da8a0974;hb=HEAD;hp=31710e2d0d8a60af81d33a4e1164519447481156;hpb=1b3b0e7b194a80af2edce9f80dc3e860bc543062;p=beaver.git diff --git a/README.txt b/README.txt index 31710e2..1bd713a 100644 --- a/README.txt +++ b/README.txt @@ -1,9 +1,17 @@ -To run the experiments: +To train the shortest-path solving GPT ./beaver.py -to train the shortest-path solving GPT, and +Same, lighter settings (~95% test success instead of ~99%): - ./beaver.py --overwrite --one_shot --nb_epochs=10 + ./beaver.py --nb_train_samples=25000 --nb_test_samples=10000 -to train the one-shot MLP read-out. +To train with a non-causal attention on the prompt + random +auto-regression order: + + ./beaver.py --nb_epochs=50 --learning_rate_schedule='25: 2e-4' --random_regression_order --noncausal_prompt + +to get the one-shot prediction from an existing checkpoint (trained +with --random_regression_order and --noncausal_prompt): + + ./beaver --oneshot