I am using fairseq (version: 1.0.0a0+14c5bd0) to fine-tune a model as per this link. However, there are lots of parameters used that I cannot find in the docs nor when I run fairseq-train --help
. Examples include:
--warmup-updates
--encoder-normalize-before
--label-smoothing
Are they replaced by some other params?