Skip to content

Instantly share code, notes, and snippets.

@zheyuye
Created June 15, 2020 18:54
Show Gist options
  • Save zheyuye/3268de894db7a3ac06503ea3798bd359 to your computer and use it in GitHub Desktop.
Save zheyuye/3268de894db7a3ac06503ea3798bd359 to your computer and use it in GitHub Desktop.
#!/bin/bash
set -e
set -x
export TASK=SQUAD
export SQUAD_VERSION=2.0
export MODEL_NAME=large
export SQUAD_DATA=/home/ubuntu/SQuAD_data
export BS=2
export ACCUMULATE=4
GBS=$(($BS * $ACCUMULATE))
export LR=5e-5
export MSL=512
export LWD=0.9
export WD=0.0
export EP=2
export MGN=0.1
export SEED=28
export OUTPUT_DIR=electra_layerwise/${TASK}${SQUAD_VERSION}_${MODEL_NAME}_${GBS}_${LR}_${WD}_${EP}_${MGN}_${LWD}_${SEED}
pip3 install numpy
set +x
python3 -m run_squad \
--model_name=google_electra_${MODEL_NAME} \
--do_eval \
--do_train \
--data_dir=${SQUAD_DATA} \
--output_dir=${OUTPUT_DIR} \
--layerwise_decay=${LWD} \
--gpus=0,1,2,3 \
--num_accumulate=${ACCUMULATE} \
--version=${SQUAD_VERSION} \
--batch_size=${BS} \
--lr=${LR} \
--wd=${WD} \
--seed=${SEED} \
--max_seq_length=${MSL} \
--eval_batch_size=32 \
--epochs=${EP} \
--warmup_ratio=0.1 \
--overwrite_cache \
--max_grad_norm=${MGN} \
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment