From 40f25d42df66a57cd606871997fbbf595867002e Mon Sep 17 00:00:00 2001 From: =?utf8?q?Fran=C3=A7ois=20Fleuret?= Date: Fri, 7 Apr 2023 19:12:20 +0200 Subject: [PATCH] Update --- README.txt | 17 +++++++++++++---- 1 file changed, 13 insertions(+), 4 deletions(-) diff --git a/README.txt b/README.txt index f25211f..1bd713a 100644 --- a/README.txt +++ b/README.txt @@ -1,8 +1,17 @@ -To train the shortest-path solving GPT, and train the one-shot MLP -read-out: +To train the shortest-path solving GPT - ./beaver.py --oneshot + ./beaver.py Same, lighter settings (~95% test success instead of ~99%): - ./beaver.py --nb_train_samples=25000 --nb_test_samples=10000 --oneshot + ./beaver.py --nb_train_samples=25000 --nb_test_samples=10000 + +To train with a non-causal attention on the prompt + random +auto-regression order: + + ./beaver.py --nb_epochs=50 --learning_rate_schedule='25: 2e-4' --random_regression_order --noncausal_prompt + +to get the one-shot prediction from an existing checkpoint (trained +with --random_regression_order and --noncausal_prompt): + + ./beaver --oneshot -- 2.39.5