DylanJHJ's picture
add t5 mesh tensorflow checkpoint
36cb93b
raw
history blame contribute delete
946 Bytes
/home/cfdaclip/.local/bin/t5_mesh_transformer --tpu=node-2 --gcp_project=convrerank --tpu_zone=us-central1-f --model_dir=gs://convrerank/checkpoints/monot5-base-canard4ir-t40 --gin_param=init_checkpoint = 'gs://castorini/monot5/experiments/base/model.ckpt-1100000' --gin_file=dataset.gin --gin_file=models/bi_v1.gin --gin_file=gs://t5-data/pretrained_models/base/operative_config.gin --gin_param=utils.tpu_mesh_shape.model_parallelism = 1 --gin_param=utils.tpu_mesh_shape.tpu_topology = '2x2' --gin_param=utils.run.train_dataset_fn = @t5.models.mesh_transformer.tsv_dataset_fn --gin_param=tsv_dataset_fn.filename = 'gs://convrerank/dataset/canard4ir.train.convrerank.t40.txt' --gin_file=learning_rate_schedules/constant_0_001.gin --gin_param=run.train_steps = 1150000 --gin_param=run.save_checkpoints_steps = 10000 --gin_param=utils.run.sequence_length = {'inputs': 512, 'targets': 4} --gin_param=utils.run.batch_size=('tokens_per_batch', 131072)