Spaces:

mshukor
/

UnIVAL

Running

App Files Files Community

UnIVAL / fairseq /examples /constrained_decoding /README.md

mshukor

init

26fd00c over 1 year ago

preview code

raw

history blame

5.54 kB

	# (Vectorized) Lexically constrained decoding with dynamic beam allocation

	This page provides instructions for how to use lexically constrained decoding in Fairseq.
	Fairseq implements the code described in the following papers:

	* [Fast Lexically Constrained Decoding With Dynamic Beam Allocation](https://www.aclweb.org/anthology/N18-1119/) (Post & Vilar, 2018)
	* [Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting](https://www.aclweb.org/anthology/N19-1090/) (Hu et al., 2019)

	## Quick start

	Constrained search is enabled by adding the command-line argument `--constraints` to `fairseq-interactive`.
	Constraints are appended to each line of input, separated by tabs. Each constraint (one or more tokens)
	is a separate field.

	The following command, using [Fairseq's WMT19 German--English model](https://github.com/pytorch/fairseq/blob/main/examples/wmt19/README.md),
	translates the sentence Die maschinelle Übersetzung ist schwer zu kontrollieren. with the constraints
	"hard" and "to influence".

	echo -e "Die maschinelle Übersetzung ist schwer zu kontrollieren.\thard\ttoinfluence" \
	\| normalize.py \| tok.py \
	\| fairseq-interactive /path/to/model \
	--path /path/to/model/model1.pt \
	--bpe fastbpe \
	--bpe-codes /path/to/model/bpecodes \
	--constraints \
	-s de -t en \
	--beam 10

	(tok.py and normalize.py can be found in the same directory as this README; they are just shortcuts around Fairseq's WMT19 preprocessing).
	This will generate the following output:

	[snip]
	S-0 Die masch@@ in@@ elle Über@@ setzung ist schwer zu kontrollieren .
	W-0 1.844 seconds
	C-0 hard
	C-0 influence
	H-0 -1.5333266258239746 Mach@@ ine trans@@ lation is hard to influence .
	D-0 -1.5333266258239746 Machine translation is hard to influence .
	P-0 -0.5434 -0.1423 -0.1930 -0.1415 -0.2346 -1.8031 -0.1701 -11.7727 -0.1815 -0.1511

	By default, constraints are generated in the order supplied, with any number (zero or more) of tokens generated
	between constraints. If you wish for the decoder to order the constraints, then use `--constraints unordered`.
	Note that you may want to use a larger beam.

	## Implementation details

	The heart of the implementation is in `fairseq/search.py`, which adds a `LexicallyConstrainedBeamSearch` instance.
	This instance of beam search tracks the progress of each hypothesis in the beam through the set of constraints
	provided for each input sentence. It does this using one of two classes, both found in `fairseq/token_generation_contstraints.py`:

	* OrderedConstraintState: assumes the `C` input constraints will be generated in the provided order
	* UnorderedConstraintState: tries to apply `C` (phrasal) constraints in all `C!` orders

	## Differences from Sockeye

	There are a number of [differences from Sockeye's implementation](https://awslabs.github.io/sockeye/inference.html#lexical-constraints).

	* Generating constraints in the order supplied (the default option here) is not available in Sockeye.
	* Due to an improved beam allocation method, there is no need to prune the beam.
	* Again due to better allocation, beam sizes as low as 10 or even 5 are often sufficient.
	* [The vector extensions described in Hu et al.](https://github.com/edwardjhu/sockeye/tree/trie_constraints) (NAACL 2019) were never merged
	into the main Sockeye branch.

	## Citation

	The paper first describing lexical constraints for seq2seq decoding is:

	```bibtex
	@inproceedings{hokamp-liu-2017-lexically,
	title = "Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search",
	author = "Hokamp, Chris and
	Liu, Qun",
	booktitle = "Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
	month = jul,
	year = "2017",
	address = "Vancouver, Canada",
	publisher = "Association for Computational Linguistics",
	url = "https://www.aclweb.org/anthology/P17-1141",
	doi = "10.18653/v1/P17-1141",
	pages = "1535--1546",
	}
	```

	The fairseq implementation uses the extensions described in

	```bibtex
	@inproceedings{post-vilar-2018-fast,
	title = "Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation",
	author = "Post, Matt and
	Vilar, David",
	booktitle = "Proceedings of the 2018 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)",
	month = jun,
	year = "2018",
	address = "New Orleans, Louisiana",
	publisher = "Association for Computational Linguistics",
	url = "https://www.aclweb.org/anthology/N18-1119",
	doi = "10.18653/v1/N18-1119",
	pages = "1314--1324",
	}
	```

	and

	```bibtex
	@inproceedings{hu-etal-2019-improved,
	title = "Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting",
	author = "Hu, J. Edward and
	Khayrallah, Huda and
	Culkin, Ryan and
	Xia, Patrick and
	Chen, Tongfei and
	Post, Matt and
	Van Durme, Benjamin",
	booktitle = "Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)",
	month = jun,
	year = "2019",
	address = "Minneapolis, Minnesota",
	publisher = "Association for Computational Linguistics",
	url = "https://www.aclweb.org/anthology/N19-1090",
	doi = "10.18653/v1/N19-1090",
	pages = "839--850",
	}
	```