Top critical review
on October 26, 2015
First half of the book I noticed that there are a few code smells, for instance most of the objects, like "parsers" could have been refactored as static method groups instead of objects. I don't know if there is code later on in the book which uses these same implementations, only giving them purpose to be implemented in that manner, but I am beginning to doubt that's the case. I think this code smell is called "the stateless object" but am unsure. Also, there are other frameworks out there for parsing HTTP, such as the HTML agility pack. This pack might not have been around when this book was written, I would have to look up the dates. I will continue reading this book, it is still a fairly good introduction to web spiders, but everybody should be aware of the code smell.