"For statisticians with a little background in biology, this book delivers a very readable presentation on the analysis of DNA sequences to determine whether a motif is of statistical significance due to its overabundance (or underabundance) in terms of frequencies or location. This book is concise but sufficiently detailed. Biologists without a background in mathematical statistics may find the learning curve a little steep but tractable. The authors' continuous use of practical examples will be greatly appreciated by biologists and statisticians interested in learning about DNA sequences and motifs."
J. Wade Davis, University of Missouri-Columbia, Journal of the American Statistician
"... A welcome introduction to word analysis..."
Daniel M. Burns, Jr., Mathematical Reviews
One of the most important problems in computational biology is whether a particular gene sequence (mathematically, a 'word') is simply random or is of biological significance, for example, because of its frequency or location. This book introduces the mathematical and statistical ideas used in solving this so-called exceptional word problem.