No single book covers all the material in this class. The first half is covered reasonably well by:
Foundations of Statistical Natural Language Processing, Manning and Schutze, MIT press,1999.
It is available for sale in the University Book Store, and can also be accessed on-line (from within CMU only) at http://cognet.mit.edu/library/books/view?isbn=0262133601.
Other
useful books are: 
        - Statistical Methods for Speech
Recognition, F. Jelinek, MIT
press 1997. 
        - Elements of Information Theory,
Cover and Thomas, Wiley & Sons 1991. 
        - Text Compression, Bell Clearly and
Witten, Prentice Hall, 1990. 
        - Probability and Statistics, M. DeGroot, second edition, Addison Wesley. 
        - All of Statistics: A Concise
Course in Statistical Inference, Larry A. Wasserman, first edition, Springer,
2004. 
Last
modified: Aug 27, 2017