Getting the download link through email is temporarily not available. Please check back later.
To get the free app, enter your mobile phone number.
The Voice in the Machine: Building Computers That Understand Speech (MIT Press) Hardcover – March 23, 2012
ITPro.TV Video Training
Take advantage of IT courses online anywhere, anytime with ITPro.TV. Learn more.
Frequently Bought Together
Customers Who Bought This Item Also Bought
With the explosive growth in speech applications on Android, iPhone and other devices, The Voice in the Machine is a timely read. It relates the 50+ year quest to develop voice recognition and synthesis, explains how the technologies work, and contains enough anecdotes to make it fun.(Alfred Z. Spector, Vice President of Research, Google, Inc.)
There are many books on speech technology, but this is the first to explain the technology against a backdrop of the broader forces that have shaped the field. This will become a must-read text for those interested in what speech technology is and how it has developed.(Robert Dale, Centre for Language Technology, Macquarie University)
Roberto Pieraccini's fascinating book takes us on a tour of human speech, modern techniques for speech understanding and generation, and the problems of deploying it in real industrial applications. By using examples, he conveys the essence of modern statistical speech processing without resorting to mathematics. This book is both entertaining and educational, and highly recommended.(Steve Young, Professor of Information Engineering, University of Cambridge)
This is a fascinating tour of the development of modern speech technologies and applications…A wonderful historical account of the growth of speech technology.(Choice)
About the Author
Roberto Pieraccini, Director of ICSI, the International Computer Science Institute in Berkeley, California, has been active for more than thirty years in speech research and technology.
Top Customer Reviews
Mostly about developments in the speech recognition field (for completeness, Pieraccini has one chapter on Text-to-Speech), it's a very well-written, comprehensive survey of the history and current developments in speech technology.
It covers everything from the earliest attempts, through all the government-sponsored ARPA speech recognition challenges, to recent commercial deployments. The book would well serve as a reference for a college course or just for leisure reading: it's the best example I've ever seen of a book that explains concepts behind complex math, intuitively, without using a single equation. Roberto's writing style could almost be called poetic. It definitely conveys the passion behind the science. You must get this!
The author starts out by describing in convincing detail why human speech is so complex and difficult to understand, and to recreate in a lab or a commercial setting. He then goes on to describe early attempts inspired by AI, eventually arriving at statistical approaches that are the basis of most modern speech processing systems.
I like the book in its broad coverage, and while I do realize that the book is not aimed at techies, I'd have appreciated a little more coverage of HMMs and EM.
At a handful of places, there are some editing oversights that are simply disappointing for a book from a writer of this caliber (Ch. 5: "...De Mori, who pursued a brilliant carrier first at McGill..." -- career, not carrier).
Nonetheless, the book is a good read for someone interested in this technology.
Be aware that the target audience is NEITHER
- people who already understand computer speech technology (unless perhaps they want to learn some history) OR
- the intellectually lazy. This is a difficult subject, and to get the most out of it, you will occasionally have to close the book and think about what you have just read.
But assuming you are in this target audience (you're an engineer in another field, a physicist, an astronomer, basically someone curious about the world around you) and want to learn the basic history, ideas, successes, and failures of computer speech understanding, I have never come across a book close to as good as this.
I only wish there were a comparable book in similar fields like computer vision, or computer translation.
Most Recent Customer Reviews
A very good high-level overview. The earlier parts of the book provide more implementation details than the later parts, which tend to gloss over the details in favor of recounting... Read morePublished 14 months ago by Brandon Fosdick