Perl & LWP and over one million other books are available for Amazon Kindle. Learn more



or
Sign in to turn on 1-Click ordering
More Buying Choices
Have one to sell? Sell yours here
Start reading Perl & LWP on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.
Sorry, this item is not available in
Image not available for
Color:
Image not available

To view this video download Flash Player

 

Perl & LWP [Paperback]

Sean M. Burke
4.6 out of 5 stars  See all reviews (12 customer reviews)

List Price: $39.99
Price: $26.52 & FREE Shipping. Details
You Save: $13.47 (34%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In Stock.
Ships from and sold by Amazon.com. Gift-wrap available.
Want it Friday, May 24? Choose One-Day Shipping at checkout. Details
Free Two-Day Shipping for College Students with Amazon Student

Formats

Amazon Price New from Used from
Kindle Edition $17.27  
Paperback $26.52  
Amazon.com Textbooks Store
Shop the Amazon.com Textbooks Store and save up to 70% on textbook rentals, 90% on used textbooks and 60% on eTextbooks.

Book Description

June 27, 2002 0596001789 978-0596001780 1

Perl soared to popularity as a language for creating and managing web content, but with LWP (Library for WWW in Perl), Perl is equally adept at consuming information on the Web. LWP is a suite of modules for fetching and processing web pages.

The Web is a vast data source that contains everything from stock prices to movie credits, and with LWP all that data is just a few lines of code away. Anything you do on the Web, whether it's buying or selling, reading or writing, uploading or downloading, news to e-commerce, can be controlled with Perl and LWP. You can automate Web-based purchase orders as easily as you can set up a program to download MP3 files from a web site.

Perl & LWP covers:

  • Understanding LWP and its design
  • Fetching and analyzing URLs
  • Extracting information from HTML using regular expressions and tokens
  • Working with the structure of HTML documents using trees
  • Setting and inspecting HTTP headers and response codes
  • Managing cookies
  • Accessing information that requires authentication
  • Extracting links
  • Cooperating with proxy caches
  • Writing web spiders (also known as robots) in a safe fashion
Perl & LWP includes many step-by-step examples that show how to apply the various techniques. Programs to extract information from the web sites of BBC News, Altavista, ABEBooks.com, and the Weather Underground, to name just a few, are explained in detail, so that you understand how and why they work. Perl programmers who want to automate and mine the web can pick up this book and be immediately productive. Written by a contributor to LWP, and with a foreword by one of LWP's creators, Perl & LWP is the authoritative guide to this powerful and popular toolkit.

Frequently Bought Together

Perl & LWP + Spidering Hacks
Price for both: $46.49

One of these items ships sooner than the other.

Buy the selected items together
  • Spidering Hacks $19.97


Editorial Reviews

Amazon.com Review

Perl & LWP sets out to unwrap the Library for the Web in Perl (LWP), which is a collection of modules that make it easier to access and pick apart Web pages (and FTP-accessible files, and outgoing e-mail messages) from within your Perl programs. The book succeeds wonderfully, not only in conveying the technical aspects of LWP programming, but in making clear the fun of doing work that's very well suited to Perl. Sean Burke assumes that his readers know something about Perl, albeit not much, and a similar amount about HTML. He does a great job of explaining how LWP functions fit into Perl programs, and how you can use them to make reference to Internet resources far more easily than before.

Burke's narrative takes the form of a guided tour in which he introduces his readers to aspects of the LWP modules one by one. His tone is generally straightforward (sharp commentary alternates with brief code listings, with occasional passages of reference material), but there's sometimes an undercurrent of exuberance that makes the reader want to get going with his or her own programming right away. Overall, the emphasis is on teaching both LWP and Perl itself to the extent necessary to do LWP work. Because of the concise and nicely indexed code modules, though, you'll find this book useful as a reference after you're under way with LWP. --David Wall

Topics covered: How to program with LWP and Perl itself. All of LWP's strong points--including HTML parsing (with tokens and trees as well as with regular expressions), HTML generation and modification, manipulation of HTML forms, and the operation of spiders--are covered. This book has more of a tutorial tone than any similar reference material on the Internet.

About the Author

Sean Burke is an active member in the Perl community and one of CPAN's most prolific module authors. He has been a columnist for The Perl Journal since 1998, and is an authority on markup languages. Trained as a linguist, he also develops tools for software internationalization and Native language preservation.

Product Details

  • Paperback: 262 pages
  • Publisher: O'Reilly Media; 1 edition (June 27, 2002)
  • Language: English
  • ISBN-10: 0596001789
  • ISBN-13: 978-0596001780
  • Product Dimensions: 7 x 0.5 x 9.2 inches
  • Shipping Weight: 1.1 pounds (View shipping rates and policies)
  • Average Customer Review: 4.6 out of 5 stars  See all reviews (12 customer reviews)
  • Amazon Best Sellers Rank: #181,478 in Books (See Top 100 in Books)

More About the Author

Sean Michael Burke is the author of "Perl & LWP" (softcover version, and free online version) and "RTF Cookbook", and was a columnist for "The Perl Journal" for several years. His columns often focused on processing (human) languages, and most are collected in "The Best of the Perl Journal: Games, Diversions, and Perl Culture" and free online.

He has a master's degree in linguistics from Northwestern University, with the thesis "The Design of Online Lexicons", which appears as a chapter in "A Practical Guide to Lexicography". He has written several dozen Perl Archive (CPAN) modules including Maketext, the basis of Perl software localization; HTML-Tree, Perl's notably robust HTML parser; and the specification and modern implementation for Pod, Perl's documentation system.

Burke currently works for several projects for the preservation of endangered North American languages. He wrote "RTF Pocket Guide" after having learned RTF (a text format like .doc) as part of those projects, for the step where lexicon databases need to be turned into conventionally formatted dictionaries for publication. Examples now in print: "Dictionary of Alaskan Haida" and Dictionary of Jicarilla Apache: Abáachi Mizaa Iłkee' Siijai".

He makes programmers think he's basically a linguist and linguists think he's basically a programmer. He is a sparklingly charismatic yet always modest person who really does enjoy writing about himself in the third person like this. He currently lives on a small island off the coast of Canada.

(This Sean Burke is not the (Irish scholar) Sean Burke who wrote "Death and Return of the Author", "The Ethics of Writing", and "Deadwater"; nor is he the (Canadian athlete) Sean Burke who was a goalee in the National Hockey League; but he assumes they're nice people anyway.)

Customer Reviews

4.6 out of 5 stars
(12)
4.6 out of 5 stars
Most Helpful Customer Reviews
18 of 18 people found the following review helpful
Format:Paperback
I was definitely interested when I first heard that O'Reilly were publishing a book on LWP. LWP is a definitive collection of perl modules covering everything you could think of doing with URIs, HTML, and HTTP. While 'web services' are the buzzword friendly technology of the day, sometimes you need to roll your sleeves up and get a bit dirty scraping screens and hacking at HTML. For such a deep subject, this book weighs in at a slim 242 pages. This is a very good thing. I'm far too busy to read these massive shelf-destroying tomes that seem to be churned out recently.

It covers everything you need to know with concise examples, which is what makes this book really shine. You start with the basics using LWP::Simple through to more advanced topics using LWP::UserAgent, HTTP::Cookies, and WWW::RobotRules. Sean shows finger saving tips and shortcuts that take you more than a couple notches above what you can learn from the lwpcook manpage, with enough depth to satisfy somebody who is an experienced LWP hacker.

This book is a great reference, just flick through and you'll find a relevant chapter with an example to save the day. Chapters include filling in forms and extracting data from HTML using regular expressions, then more advanced topics using HTML::TokeParser, and then my preferred tool, the author's own HTML::TreeBuilder. The book ends with a chapter on spidering, with excellent coverage of design and warnings to get your started on your web trawling.

Comment | 
Was this review helpful to you?
15 of 15 people found the following review helpful
Format:Paperback
If you aren't yet comfortable using object-oriented Perl modules, the multitude of examples will at least allow you see how it's done even if you're a bit fuzzy on what's happening 'underneath' when you call object methods. If you're comfortable learning how to do something without knowing exactly why it works, then the author's clear step-by-step explantions and numerous progressively more powerful examples should make this book accessible even to relatively innexperienced Perl programmers.

More experienced programmers will understand better why things work, but any Perl programmer will set this book down feeling empowered to turn the web into their own valet. No longer do you need to check multiple sites looking for interesting information. Instead, you can readily author code to do that for you and alert you when items of interest are found. You can use these tools to free up personal time, to harvest information to inform business decisions, to automate tedious web application testing, and a zillion other things.

The author's clear exploration of the relevant Perl modules leaves the reader with a good depth of understanding of what these modules do, when you might want to use which module, and how to use them for real world tasks. Before reading the book, I knew of these modules, but they were a rather intimidating pile. I'd used a few of them on occasion for rather limited projects, but was reluctant to invest the time required to read all of the documentation from the whole collection. Mountains of method-level documentation do not a tutorial make. This book takes all of that information, selects the most important parts, and ensures that those parts are covered in progressively more powerful and/or flexible examples.

If you know Perl and you're sick of 'working the web' to get information and you want the web to work for you instead, then you need this book. I had a personal project that was on the back burner for a couple of years because it just sounded too hard. The weekend after I finished this book, I wrote what I had previously thought to be the hard part of that project and it was both easy and fun. This book makes hard things not just possible, but actually easy.

-matt

Comment | 
Was this review helpful to you?
14 of 14 people found the following review helpful
5.0 out of 5 stars Very Informative and useful August 7, 2002
Format:Paperback
As a web programmer, I had dealt with several such projects dealing with web automation and writing simple crawlers even before I read "Perl & LWP". The book was the first book I've read on the subject, and I'm by no means disappointed. The book is very well organized, very informative and nails the subject in the head. I am pleased.

I noticed some inaccuracies in the discussions, some chopped off paragraphs and sentences. But this doesn't affect the usability of the book much. Author Sean Burke does a great job in walking one through the most of the aspects of web automation and data extraction in the web using Perl and LWP (libwww in Perl ).

The codes the book gives are very well organized, well written and easily debugable. The steps are pretty consistent across all the examples:
a) Inspect the HTML source code of the page;
b) Determine the tokens and patterns of interest;
c) Write the first code;
d) Fine tune the code;

As usual, I'll be commenting on individual chapters to give you an idea of the
coverage of the book in more details...

Comment | 
Was this review helpful to you?
Most Recent Customer Reviews
4.0 out of 5 stars Is it horse or goat?
I can't tell if that is a goat or a horse, or 2 goats or 2 horses...maybe one or each...whatever. I did enjoy something about this book, or I would not be all the way down... Read more
Published 8 months ago by Otter
5.0 out of 5 stars Good but...
I needed to use LWP to interact with various web APIs; 'Perl & LWP' turned out to be exactly what I was looking for. Read more
Published on February 24, 2011 by James
5.0 out of 5 stars Honest Assessment of Burke's Perl & LWP
This is not your typical clunker with endless pages of filler material. It gets right to the point. If you want to learn about using Perl to interact with the internet, this would... Read more
Published on July 13, 2007 by Pall C
5.0 out of 5 stars A Wonderful Book
I bought this book to get information automatically on japanese stocks(for example, charts, price, volume, PER, PBR, ROE, ROA, News, messages on Yahoo! Read more
Published on August 18, 2006 by Japanese Reviewer
5.0 out of 5 stars Great book!
If you are unfamiliar with LWP and web scraping, or HTML parsing using tokens and trees, I strongly recommend this book. Read more
Published on March 15, 2003 by Matthew D. Huwiler
1.0 out of 5 stars Terrible, bug-infested book...
I really don't know how the previous 5 reviews gave this book 5 stars. I was really excited about this book when I first read the reviews, and now here I am only a few chapters in... Read more
Published on November 5, 2002 by Kevin
5.0 out of 5 stars Exploit the web with power and ease
Disclaimer: The author is an online-type-friend and I used to work with the author of the foreword. I even got my copy for free. Read more
Published on September 1, 2002 by Neil Kandalgaonkar
5.0 out of 5 stars Fabulous book!
This book is a comprehensive and authoritative guide to web automation. It reads as both a gentle tutorial and a well organized reference. Read more
Published on August 30, 2002 by K. Boggs
5.0 out of 5 stars A must-read for exploiting the web in a GOOD way.
A great book for anyone who wishes to automate daily tasks on the web. Sean does an outstanding job of showing how Perl can be used to extract and manipulate not just data but... Read more
Published on July 11, 2002 by Kevin Healy
Search Customer Reviews
Only search this product's reviews

What Other Items Do Customers Buy After Viewing This Item?


Forums

There are no discussions about this product yet.
Be the first to discuss this product with the community.
Start a new discussion
Topic:
First post:
Prompts for sign-in
 



So You'd Like to...


Create a guide


Look for Similar Items by Category