You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
CodeGeeX/configs/codegeex_13b_parallel.sh

17 lines
686 B
Bash

# CodeGeeX-13B parallel configuration
# Parallel checkpoints are named under the format "mp_rank_0{i}_model_states.pt", where i is the rank, start from 0.
CHECKPOINT_PATH="<path where you put all parallel checkpoints (e.g., XXX/tp4/)>"
MODEL_ARGS="--num-layers 39 \
--hidden-size 5120 \
--num-attention-heads 40 \
--max-position-embeddings 2048 \
--attention-softmax-in-fp32 \
--load "$CHECKPOINT_PATH" \
--layernorm-epsilon 1e-5 \
--fp16 \
--ws-encoding-start-id 10 \
--ws-encoding-length 10 \
--make-vocab-size-divisible-by 52224 \
--seq-length 2048"