Main Page Namespace List Class Hierarchy Alphabetical List Compound List File List Namespace Members Compound Members File Members Related Pages
Provides a command line interface for interacting with probabilistic dictionaries (PDict objects). It supports the following activities:
- Create a new dictionary from one or more token delimited input files.
- Merge two existing dictionaries into a new dictionary.
- Subtract an existing dictionary (prune) from a dictionary.
- Normalize probabilities for all entries
- Write the dictionary to a token delimited output file.
- Interactively look up individual entries, which are displayed (unfiltered) to the console device.
These activities are supported in interactive mode and in batch mode via command line parameters.
For interactive mode the commands are:
- help -- command list.
- quit -- exit the program.
- open <dictName> -- open an existing dicitionary.
- close -- close the open dictionary.
- stats -- show dictionary vocabulary statistics.
- dump <outputFile> <delimiter> -- dump the dictionary to outputFile, using delimiter as the separator.
- look <term> -- lookup term in the dictionary.
- normalize -- Normalize probabilities for all entries.
- create <dictName> <inputFile> <delimiter> -- create a new dictionary from an input file.
- merge <firstDict> <secondDict> <newName> -- merge firstDict and secondDict into a new dictionary named newName.
- subtract <dictName> -- Remove all entries in dictName from the current dictionary. Destructively modifies the current dictionary.
For batch mode the command line parameters are:
- -help -- usage message.
- -dump <inputDict> <outputFile> <delimiter> -- dump the dictionary inputDict to outputFile, using delimiter as the separator.
- -look <inputDict> <term> -- lookup term in the dictionary.
- -normalize <inputDict> -- Normalize probabilities of all entries
- -create <dictName> <inputFile> <delimiter> -- create a new dictionary from an input file.
- -merge <firstDict> <secondDict> <newName> -- merge firstDict and secondDict into a new dictionary named newName
- -subtract firstDict secondDict -- Remove all entries in secondDict from firstDict. Destructively modifies firstDict
The following functionality will be implemented in the future.
- Add Stopper for source/and target language.
- Add access to stemmer for source, target language.
- Add parameter file processing (for specifying stemmer, etc).
- Add parameter to specify starting in counts domain
Generated on Wed Nov 3 13:00:03 2004 for Lemur Toolkit by
1.2.18