- Paperback: 336 pages
- Publisher: Packt Publishing (August 19, 2009)
- Language: English
- ISBN-10: 1847195881
- ISBN-13: 978-1847195883
- Product Dimensions: 7.5 x 0.8 x 9.2 inches
- Shipping Weight: 1.3 pounds (View shipping rates and policies)
- Average Customer Review: 12 customer reviews
- Amazon Best Sellers Rank: #2,393,663 in Books (See Top 100 in Books)
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
Other Sellers on Amazon
Solr 1.4 Enterprise Search Server Paperback – August 19, 2009
|New from||Used from|
This month's Book With Buzz: "The Lying Game" by Ruth Ware
From the instant New York Times bestselling author of blockbuster thrillers "In a Dark, Dark Wood" and "The Woman in Cabin 10" comes Ruth Ware’s chilling new novel, "The Lying Game." See more
What other items do customers buy after viewing this item?
About the Author
Born to code, David Smiley is a senior software engineer, book author, conference speaker, and instructor. He has 12 years of experience in the defense industry at MITRE, specializing in Java and Web technologies. David is the principal author of "Solr 1.4 Enterprise Search Server", the first book on Solr, published by PACKT in 2009. He also developed and taught a two-day course on Solr for MITRE. David plays a lead technical role in a large-scale Solr project in which he has implemented geospatial search based on geohash prefixes, wildcard ngram query parsing, searching multiple multi-valued fields at coordinated positions, part-of-speech search using Lucene payloads, and other things. David consults as a Solr expert on numerous projects for MITRE and its government sponsors. He has contributed code to Lucene and Solr and is active in the open-source community. Prior to his Solr work, David first used Lucene back in 2000, as well as Hibernate-Search and Compass since then. He also used the competing Endeca commercial product, too, but hopes to never use it again. Fascinated by the 'craft' of software development, Eric Pugh has been heavily involved in the open source world as a developer, committer, and user for the past five years. He is an emeritus member of the Apache Software Foundation and lately has been mulling over how we solve the problem of finding answers in datasets when we don't know the questions ahead of time to ask. In biotech, financial services, and defense IT, he has helped European and American companies develop coherent strategies for embracing open source search software. As a speaker, he has advocated the advantages of Agile practices with a focus on testing in search engine implementation. Eric became involved with Solr when he submitted the patch SOLR-284 for Parsing Rich Document types such as PDF and MS Office formats that became the single most popular patch as measured by votes! The patch was subsequently cleaned up and enhanced by three other individuals, demonstrating the power of the open source model to build great code collaboratively. SOLR-284 was eventually refactored into Solr Cell as part of Solr version 1.4. He blogs at http://www.opensourceconnections.com/.
Top customer reviews
I'm giving this book 4 stars only because I think the authors could have done a better job explaining the process and best practices for deploying solr to production.
If you are looking to build a fast and accurate search engine for large amounts of data, Solr is the way to go and this book will help you.
The Ant script blew up. It turns out that you also need to set up and run the instance of Solr first. And to do that, you have to run a multi-core instance of Solr. Don't know what that is? Go ahead and read the book first, and then you'll know, at which point you can go back to just trying to set up the stuff you need to follow along with the book. The java command they specify to run a multi-core Solr instance is about a mile long. They provide some notes about each option, but what good are notes there? If you don't already know a lot about the options used to start Java, you won't know what it's doing, and if anything goes wrong, you're stuck.
The note on all of this also says that if you have memory problems, you can refer to chapter 9, the very last chapter in the book, for help.
So to get started with this book you need to read and understand it first.
There's another point where the author goes off on a tangent about remote streaming, starts to explain a topic, then says, "I won't go any further into that. If you already know about this, you're good. If you don't, you can go learn about it." That's perfect: get me started then drop the subject and shift gears.
Plus too much of the text is unclear and ambiguous, because the authors use poor and misleading syntax. There was no professional editing of the copy. There were several passages I had to read repeatedly and ponder, before I understood, and it was not because the concept was difficult. It was because a simple idea was clouded by confusing syntax that is reminiscent of spoken speech.
There is much to learn here but there are sizable roadblocks placed in your way. The data set is large and complex, so it serves to illustrate many concepts, but it's way too much work to get it set up. They should have gone with a single data file, which you could then download directly and feed to Solr with the java post.jar file. Demonstrating multi-core could have been done with a couple other files, if you wanted to dig into it. As it is, you jump through all of the hoops, from the start, or you have nothing.
I've finally decided that I don't want to play along with their examples; they have me convinced that every time I jump through a hoop, I will discover another, and then another. So the book's usefulness to me has now dropped considerably.
If there were any other books available, I'd probably recommend not buying this book. If you need to get started with Solr, this is your only option except for a brand new "cookbook" from the same publisher that isn't even available yet.
This book started out a bit slow, but was pretty well organized. It showed numerous ways of bringing data into Solr, and numerous ways of getting data out. The publisher lets you download the source code and data on their site, and you can stick that into Solr.
I also like the fact that the book is pretty recent. I could not find anything in the book which had become "deprecated". All in all, the book helped me go from knowing nothing about Solr, to going live in 2-3 weeks which is pretty darn good.