Information Theory
This enthusiastic introduction to the fundamentals of information theory builds from classical Shannon theory through to modern applications in statistical learning, equipping students with a uniquely well-rounded and rigorous foundation for further study. Introduces core topics such as data compression, channel coding, and rate-distortion theory using a unique finite block-length approach. With over 210 end-of-part exercises and numerous examples, students are introduced to contemporary applications in statistics, machine learning and modern communication theory. This textbook presents information-theoretic methods with applications in statistical learning and computer science, such as f-divergences, PAC Bayes and variational principle, Kolmogorov's metric entropy, strong data processing inequalities, and entropic upper bounds for statistical estimation. Accompanied by a solutions manual for instructors, and additional standalone chapters on more specialized topics in information theory, this is the ideal introductory textbook for senior undergraduate and graduate students in electrical engineering, statistics, and computer science.
- Provides a systematic treatment of information-theoretic techniques in statistical learning and high-dimensional statistics
- Develops information theory for both continuous and discrete variables providing examples relevant to statistical and machine learning applications
- Focuses on finite block length (non-asymptotic) results, equipping students with information theory knowledge required for modern applications such as 6G and future network design
- Advanced material suitable for skipping on first reading is clearly indicated, enabling a fast introduction to fundamental concepts which can be enhanced with additional material on re-reading
Reviews & endorsements
'Polyanskiy and Wu's book treats information theory and various subjects of statistics in a unique ensemble, a striking novelty in the literature. It develops in depth the connections between the two fields, which helps to presenting the theory in a more complete, elegant and transparent way. An exciting and inspiring read for graduate students and researchers.' Alexandre Tsybakov, CREST-ENSAE, Paris
'Since the publication of Claude E. Shannon's A Mathematical Theory of Communication in 1948, information theory has expanded beyond its original focus on reliable transmission and storage of information to applications in statistics, machine learning, computer science, and beyond. This textbook, written by two leading researchers at the intersection of these fields, offers a modern synthesis of both the classical subject matter and these recent developments. It is bound to become a classic reference.' Maxim Raginsky, University of Illinois, Urbana-Champaign
'The central role of information theory in data science and machine learning is highlighted in this book, and will be of interest to all researchers in these areas. The authors are two of the leading young information theorists currently active. Their deep understanding of the area is evident in the technical depth of the treatment, which also covers many communication theory-oriented aspects of information theory.' Venkat Anantharam, University of California, Berkeley
'Written in a mathematically rigorous yet accessible style, this book offers information-theoretic tools that are indispensable for high-dimensional statistics. It also presents the classic topic of coding theorems in the modern one-shot (finite block-length) approach. To put it briefly, this is the information theory textbook of the new era.' Shun Watanabe, Tokyo University of Agriculture and Technology
Product details
January 2025Hardback
9781108832908
748 pages
261 × 182 × 43 mm
1.78kg
Available