A highly selective MT bibliography

Of direct relevance to MT

A. Berger, P. Brown, S. Della Pietra, V Della Pietra, J. Lafferty, H. Printz, and L. Ures (1994). The Candide system for machine translation. In Proceedings of the ARPA Conference on Human Language Technology.

A. Berger, S. Della Pietra, and V. Della Pietra (1996). A maximum entropy approach to natural language processing. Computational Linguistics, 22(1), 39--71.

P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, J. Lafferty, R. Mercer, and P. Roosin (1990). A statistical approach to machine translation. Computational Linguistics, 16, 79--85.

P. Brown, S. Della Pietra, V. Della Pietra, and R. Mercer (1991). A statistical approach to sense disambiguation in machine translation. In Proceedings of the DARPA Workshop on Speech and Natural Language, 146--151.

P. Brown, S. Della Pietra, V. Della Pietra, and R. Mercer (1991). The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19(2), 263--311.

B. Merialdo. Tagging text with a probabilistic model (1990). In Proceedings of the IBM Natural Language ITL, 161--172.

Background Reading

L. Bahl, P. Brown, P. de Souza, and R. Mercer (1989). A tree-based statistical language model for natural language speech recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37(7).

L. Bahl, F. Jelinek, and R. Mercer (1983). A maximum likelihood approach to continuous speech recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence, PAMI-5(2), 179--190.

S. Della Pietra, V. Della Pietra, J. Gillett, J. Lafferty, H. Printz, and L. Ures (1994). Inference and estimation of a long-range trigram model. In Proeedings of the Second International Symposium on Grammatical Inference.

A. Dempster, N. Laird, and D. Rubin (1977). Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, 39(B), 1--38.

W. Feller (1957). An introduction to probability theory and its applications, volume 1. John Wiley & Sons.

F. Jelinek and R. Mercer (1980). Interpolated estimation of markov source parameters from sparse data. In Proceedings, Workshop on Pattern Recognition in Practice.



Adam Berger