This application is part of ( apertium ) this tool is part of the apertium machine translation architecture: http://apertium.org.
apertium-preprocess-corpus-lextor data_dir translation_dir input_file output_file
apertium-preprocess-corpus-lextor is the application responsible for preprocessing the training corpus for the lexical selector training.
This tool currently has no options.
These are the kinds of files and directories used with this tool:
data_dir the path to the linguistic data to use.
translation_dir the translation direction to use.
input_file contains a large corpus in raw format.
output_file The file which gets the preprocessed corpus.
Lots of...lurking in the dark and waiting for you!
(c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved.