or
Sign in to turn on 1-Click ordering
Sell Us Your Item
For a $10.30 Gift Card
Trade in
Kindle Edition
Read instantly on your iPad, PC, Mac, Android tablet or Kindle Fire
Buy Price: $49.56
Rent From: $17.47
 
 
 
More Buying Choices
Have one to sell? Sell yours here
Sorry, this item is not available in
Image not available for
Color:
Image not available

To view this video download Flash Player

 

The Practitioner's Guide to Data Quality Improvement (The MK/OMG Press) [Paperback]

David Loshin
4.8 out of 5 stars  See all reviews (12 customer reviews)

List Price: $63.95
Price: $53.19 & FREE Shipping. Details
You Save: $10.76 (17%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Usually ships within 1 to 4 weeks.
Ships from and sold by Amazon.com. Gift-wrap available.
Free Two-Day Shipping for College Students with Amazon Student

Formats

Amazon Price New from Used from
 
Kindle Edition
Rent from
$49.56
$17.47
 
Paperback $53.19  
Rent Your Textbooks
Save up to 70% when you rent your textbooks on Amazon. Keep your textbook rentals for a semester and rental return shipping is free.

Book Description

October 29, 2010 0123737176 978-0123737175 1

Business problems are directly related to missed data quality expectations. Flawed information production processes introduce risks preventing the successful achievement of critical business objectives. However, these flaws are mitigated through data quality management and control: controlling the quality of the information production process from beginning to end to ensure that any imperfections are identified early, prioritized, and remediated before material impacts can be incurred. The Practitioner's Guide to Data Quality Improvement shares the fundamentals for understanding the impacts of poor data quality, and guides practitioners and managers alike in socializing, gaining sponsorship for, planning, and establishing a data quality program. This book shares templates and processes for business impact analysis, defining data quality metrics, inspection and monitoring, remediation, and using data quality tools. Never shying away from the difficult topics or subjects, this is the seminal book that offers advice on how to actually get the job done.

  • Offers a comprehensive look at data quality for business and IT, encompassing people, process, and technology.
  • Shows how to institute and run a data quality program, from first thoughts and justifications to maintenance and ongoing metrics.
  • Includes an in-depth look at the use of data quality tools, including business case templates, and tools for analysis, reporting, and strategic planning.

Frequently Bought Together

The Practitioner's Guide to Data Quality Improvement (The MK/OMG Press) + Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information (TM) + Data Quality Assessment
Price for all three: $156.85

Some of these items ship sooner than the others.

Buy the selected items together


Editorial Reviews

Review

"There is NOTHING like this out there that I am aware of, and certainly nothing from anyone with same stature as David Loshin."--David Plotkin, Wells Fargo Bank

"The book provides a comprehensive look at data quality from both a business and IT perspective. It does not just cover technology issues, but discusses people, process, and technology. And that is important, because this is the mix that is needed in order to initiate any type of quality improvement regimen."--Data Technology Today Blog

From the Back Cover

Business problems are directly related to missed data quality expectations. Flawed information production processes introduce risks preventing the successful achievement of critical business objectives. However, these flaws are mitigated through data quality management and control: controlling the quality of the information production process from beginning to end to ensure that any imperfections are identified early, prioritized, and remediated before material impacts can be incurred. The Practitioner's Guide to Data Quality Improvement shares the fundamentals for understanding the impacts of poor data quality, and guides practitioners and managers alike in socializing, gaining sponsorship for, planning, and establishing a data quality program. This book shares templates and processes for business impact analysis, defining data quality metrics, inspection and monitoring, remediation, and using data quality tools. Never shying away from the difficult topics or subjects, this is the seminal book that offers advice on how to actually get the job done.


Product Details

  • Paperback: 432 pages
  • Publisher: Morgan Kaufmann; 1 edition (October 29, 2010)
  • Language: English
  • ISBN-10: 0123737176
  • ISBN-13: 978-0123737175
  • Product Dimensions: 7.5 x 0.9 x 9.2 inches
  • Shipping Weight: 1.6 pounds (View shipping rates and policies)
  • Average Customer Review: 4.8 out of 5 stars  See all reviews (12 customer reviews)
  • Amazon Best Sellers Rank: #81,154 in Books (See Top 100 in Books)

More About the Author

David Loshin, president of Knowledge Integrity, Inc, (www.knowledge-integrity.com), is a recognized thought leader and expert consultant in the areas of data quality, master data management, and business intelligence. David is a prolific author regarding BI best practices, via the expert channel at www.b-eye-network.com and numerous books and papers on BI and data quality. His book, "Business Intelligence: The Savvy Manager's Guide" (June 2003) has been hailed as a resource allowing readers to "gain an understanding of business intelligence, business management disciplines, data warehousing, and how all of the pieces work together." His book, "Master Data Management," has been endorsed by data management industry leaders, and his valuable MDM insights can be reviewed at www.mdmbook.com.

David can be reached at loshin@knowledge-integrity.com.

Customer Reviews

4.8 out of 5 stars
(12)
4.8 out of 5 stars
3 star
0
2 star
0
1 star
0
Most Helpful Customer Reviews
3 of 3 people found the following review helpful
5.0 out of 5 stars Improve Your Data Quality By Reading This Book November 16, 2010
Format:Paperback
David Loshin's new book, The Practitioner's Guide to Data Quality Improvement, is well-organized, helpful, and on topic. One of my pet peeves is the poor state of data quality rampant just about everywhere these days... and Loshin's text offers expert guidance on how organizations can remedy that situation.

The book provides a comprehensive look at data quality from both a business and IT perspective. It does not just cover technology issues, but discusses people, process, and technology. And that is important, because this is the mix that is needed in order to initiate any type of quality improvement regimen.

In the book, Loshin shows how to institute and run a data quality program, from start to finish. And this is all helpful information. But I think my favorite chapter of the book is the one on Data Quality Service Level Agreements. This is so because data quality is not a project that can be started and completed. It needs to become an on-going component of our everyday procedures. And only through adopting a service level agreement mentality when it comes to data quality can we ever hope to make data quality monitoring and improvement an accepted, regular component of what we do.
Comment | 
Was this review helpful to you?
2 of 2 people found the following review helpful
Format:Paperback|Amazon Vine™ Review (What's this?)
Some of the other reviews that have been posted here provide some interesting observations from perspectives that are not always centered on data architecture or general enterprise architecture, and the hope of this reviewer is that he will be able to offer feedback to others on this text based on his consulting experience in these areas. In his preface, David Loshin comments that "this book is intended to provide the fundamentals for developing the enterprise data quality program, and is intended to guide both the manager and the practitioner in establishing operational data quality control throughout an organization, with particular focus on the ability to build a business case for instituting a data quality program", "the assessment of levels of data quality maturity", "the guidelines and techniques for evaluating data quality and identifying metrics related to the achievement of business objectives", "the techniques for measuring, reporting, and taking action based on these metrics", and "the policies and processes used in exploiting data quality tools and technologies for data quality improvement".

With these goals in mind, this reviewer thinks Loshin succeeded in this effort. Taking into account the fact that data quality is an enormous practice area, and success requires understanding of both data and the business to succeed, this introductory text walks the reader step-by-step through a considerable number of topics over which many authors would likely stumble. Some of the explanations that Loshin provides, such as the one in the chapter entitled "Developing a Business Case and a Data Quality Road Map" on how data flaws can incur business impacts, are extremely well done, especially when married with effective diagrams. And in his chapter entitled "Metrics and Performance Improvement", the author provides an explanation on drilling through key performance indicators that this reviewer has not seen elsewhere until this effort, and the presentation is exceedingly well done. Other areas of this text that this reviewer especially appreciates are the chapters entitled "Data Requirements Analysis", "Metadata and Data Standards", and "Inspection, Monitoring, Auditing, and Tracking".

This reviewer however would like to make potential readers of this book aware that most of what Loshin provides here is high level walkthroughs and examples of pertinent elements within data quality, rather than practical advice on how to approach much of the lower level work that should be expected to take place on a day-to-day basis. For example, in the chapter entitled "Entity Identity Resolution", the author provides a section on matching algorithms that briefly discusses parsing and standardization, abbreviation expansion, edit distance, phonetic comparison, and n-gramming, which consumes just a few short paragraphs. The author does not explain that there are many more matching algorithms currently in use in industry, that in most cases matching exercises need to take into account multiple rather than single algorithms in isolation, that in the world today internationalization takes an ever more important role when performing matching, and that there is a wide variety of commercial tooling available that needs to be assessed based on the needs of the organization.

However, armed with this knowledge the reader is sure to make use of this work by utilizing it while planning and strategizing data quality, as well as making use of it as introductory material to understanding what it might take to pursue efforts that require a higher level of data quality maturity such as master data management (MDM), in which case this reviewer recommends "Enterprise Master Data Management: An SOA Approach to Managing Core Information" by Allen Dreibelbis, Eberhard Hechler, Ivan Milman, Martin Oberhofer, Paul van Run, and Dan Wolfson (see my review). In the opinion of this reviewer, what Loshin provides here is best suited for managers looking to piece together all of the steps associated with data quality pursuits as well as get a better handle on how each of the steps are interrelated and whether each is a requirement or just an option, possibly looking to solve some aspects of data quality in an evolutionary, piecemeal fashion rather than revolutionary endeavor.
Was this review helpful to you?
2 of 2 people found the following review helpful
5.0 out of 5 stars Judge this book by its cover January 17, 2011
Format:Paperback|Amazon Vine™ Review (What's this?)
Mr. Loshin has produced a hands-on, practical work on data quality improvement and management. If you need some theory but a nuts and bolts focus with a framework laid out for you, this is the book for you. You can do a thorough assessment of the quality of your data across several dimensions, and develop a roadmap for making a program of specific improvements.

Transactional data is distinguished here from informational data, especially from what quality means. They differ in nature and in function. Informational data is often mishandled by trying to apply the same standards and principles as you would for transactional data. Because transactional data is primary and in full view of your business operations, it can overshadow informational data. While quality is necessary and vital for transactional data, it is not sufficient for an optimally profitable process. You may be in a line of business where the race is won on the competitive advantage gotten from complex, accurate and flexible informational data. Quality flaws here are not always obvious up front.

I use this book to:

- develop action plans at all levels from assessments to strategy to hard dollar reporting.
- teach myself, my staff and my colleagues (each needing different education)

On the technical level, this book is certainly not the last word; but it is detailed enough and thorough enough to get real work done while organizing your research and development efforts to plan an enterprise level program of data quality improvement both transactional and informational. I have been able to define a stream of benefits at each stage of the program.

The vulnerabilities I can address using this book are related to regulatory non-compliance, audit failures for data integrity in reporting and contractual liabilities. On the other hand, I use the data profiling techniques to improve revenue streams as allowed by contracts and related instruments. In particular, contracts that provide for incentives (and penalties), premiums, bonuses and performance schedules. Data profiling, parsing and standardization are the big efforts to achieving profit results especially when you are dealing with heterogeneous data sets, for example transactional portfolios from different customers or partners where you need to establish audit worthy claims to monies due you. We leave a lot of money on the table that we either overlook or miscalculate or do not claim in accordance with agreed terms and conditions. All interested parties need to have confidence in the reliability and accuracy of data.

Informationally, profit and performance align by themselves to create optimal processes and systems. Data quality tools are used to engineer that alignment. Of course, the really nasty side of improving data quality is the organizational. The graves are full of visionaries, pioneers, prophets or whatever buzz word you like or despise. Here I mean the doers, and not the happy babble types that float in and out of our rather more difficult working lives. So Mr. Loshin here has a good list of topics in the organizational area, so that you can see and understand what are the obstacles and any possible paths for progress. Maturity, readiness, will, governance, policy and leadership are all pre-requisites for making quality improvements that can be measured in hard dollar pay-offs. He has working templates and descriptions for charting all the components of organizational maturity.. He has all the worksheets and exhibits you will need to determine and to document your assessments and recommendations. I followed his roadmap without making a big political (confrontational) deal about getting agreement, one player at a time until the executive leadership decided what geniuses they were for coming up with it.

I would have been satisfied enough with just the first five chapters, just to get me launched. But the real meat is in the metrics. You will see the standard old warhorse of statistical process control. I have really come to like the granularity of daily performance at the shop floor level because you can find the weak and the broken events as they happen, especially as you start to deploy process changes. People really respond when you are able to show them within a day of launch if they are still doing x or not yet doing y. By the way, all the mathematics here are early high school level, so you can expect to roll them out to all your analysts, leads and coordinators with just a little presentation, review and coaching.

In highly federated organizations, your first steps will present themselves out of common ground, dependencies and other basic interfaces. Communities of interest and best practices follow especially if you can establish a single showcase.

Mr. Loshin lays out the dimensions of data quality, each requiring a method of measurement. He says that each line of business can pick these up on a dashboard that can eventually roll up to an enterprise view. Furthermore, he wants these dimensions to create a hierarchy of categories that lay out according to a pyramidal framework where the broad base is constructed from simple measurements and many rules, upward toward the enterprise view of few rules and complex metrics. This undertaking is ambitious, far beyond the statistical process control and continuous process improvement of the industrial beginnings.

His dimensions are categorized between the intrinsic and the contextual. Both are problematic. Contextual dimensions are by definition formed in relation to one another, tested for consistency and coherence. There is no way of knowing if you have done a complete job at any point in time. Anyway all these systems and process exist in a dynamic world anyway, so you are never truly done, but you get the point. But the idea is that if they make relational sense, you are likely on the right track.

We are still far from some grand unification theory of business process and data quality. Indeed, just when you get to the critical detail of this schema, you hit a patch like this:

Yet although information policies (such as those governing security or privacy) are a major source of data quality assertions, they imply the need for data governance, which is covered in chapter 7. - [i.e. the previous chapter]

Huh? If you read it really fast you can pretend that it means something. But just try to parse it. In a tough and complicated exposition where you most need clarity, he goes all muddy. This is no time for guessing, which is precisely what you are left to do. In a moment of a Godel-like nightmare, Mr. Loshin makes completeness and coherence two of the contextual dimensions. He does say consistency rather than coherence, but to little comfort. Talk about asking for trouble, especially under the scrutiny of those who have not yet bought into this program...

Yet his failure to attain the elegance of mathematical theory or the rigor of applied science does not invalidate his approach. Seeing is believing, which is his ultimate aim. The intrinsic dimensions are more straightforward and serve a bit as guideposts in otherwise uncharted territory. Again it is his tables that provide you with handholds to his framework. Chances are you will not be able to farm out a lot of this work to your people, especially at the beginning. You must make the high level customization to your organization, your business context. Then others can carry it forward into their own work and departments. A core team of architects, analysts, process mappers, project coordinators, data stewards and report producers will require a material operating expense just to get started, about a six month period, and about eighteen to be really firing on all cylinders. You will be integrated into the familiar world of budgeting and forecasting, capacity planning, performance measurement, and project/data governance.

His treatment of a Data Requirements Analysis Process shows you how to transform the way you manage data and processes. The BIG however, is that it presumes a level of project and development life cycle management far advanced from what is usually existing in operational business units.

A further complication is that informational data by nature is a highly derived and constructed. Because strategic foresight and organizational cooperation (let alone external cooperation) are rare in our customary approach to work, whether business or government, these derivations and constructions are so much more divergent at their origins. You can build cooperation into your methodologies and governance in order to make data standards a priority. The best thing you can start with is a proper assessment of the state of standardization (and data quality, for that matter) and decide on ways to move toward your desired point of arrival. A common fallacy, stemming from shortsightedness, is to choose highly proprietary solutions, confusing obscurity with innovation and competitive advantage. When leadership imagines that it is way out ahead of the pack, it is actually only temporarily so, because you often motivate the seemingly disadvantaged to cooperate against the outlier. Think Beta Max. By contrast, when contemplating the critical employment of meta data, Mr. Loshin does so via ISO 11179.

I cannot resist pointing out that, as he is illustrating the problem of data quality, he gives as an example all the different ways of representing California. He includes "06" as the number in alphabetical order. The table he references meanwhile clearly shows California in position five. I almost think he is doing this on purpose to prove his point in a self-validating example. That or he just wants to se if I am paying attention. Read more ›
Was this review helpful to you?
Most Recent Customer Reviews
5.0 out of 5 stars The Textbook for Data Quality Programs
This book has everything you need to know about putting data quality standards into practice. As other reviewers have noted, it's organized in textbook fashion which actually... Read more
Published 1 month ago by Jami Garrison
5.0 out of 5 stars I learned so much!
As our clients call for more data quality services to complement other data hygiene services we provide for them I turned to David's book to become better educated in this DQ... Read more
Published 11 months ago by M. Shaw
4.0 out of 5 stars Oriented for Academia
Every organization struggles with data quality. This is a very thorough reference book that attacks the data governance model with gusto. Read more
Published 19 months ago by GX
5.0 out of 5 stars Dense, but worthwhile
Wow, is this book detailed. The first several chapter are hard to get through because he's laying the foundation for the details later in the book. Read more
Published 21 months ago by Brian A. Roush
5.0 out of 5 stars Mark Allen, co-author "Master Data Management in Practice: Achieving...
Having been a regular practitioner and advocate of data quality improvement in a large corporate environment, making headway in this challenging area requires all the support,... Read more
Published on February 10, 2011 by Mark S. Allen
4.0 out of 5 stars Almost Indispensable
Professionals with a modicum of experience and sense run like the wind away from any job with the title "Data Quality" in it----and with very good reason: it is a sure... Read more
Published on February 5, 2011 by Jeffrey A. Veyera
5.0 out of 5 stars For Those Responsible for Data Governance and Those Promoting Data...
This book will certainly be helpful if your company is responsible for data quality service level agreements. Read more
Published on December 1, 2010 by Citizen John
5.0 out of 5 stars Another book that hits the mark from David
David Loshin is one of the most recognized experts in the areas of data management. David's many books and papers on data management are required reading for those in the field. Read more
Published on November 30, 2010 by Data Man
5.0 out of 5 stars A Great DQ Book
The book is a very good source for any professional interested in data quality. The book covers a broad variety of data quality aspects: organizational, business and technical... Read more
Published on November 24, 2010 by Larry Dubov
Search Customer Reviews
Only search this product's reviews

What Other Items Do Customers Buy After Viewing This Item?


Forums

There are no discussions about this product yet.
Be the first to discuss this product with the community.
Start a new discussion
Topic:
First post:
Prompts for sign-in
 



So You'd Like to...


Create a guide


Look for Similar Items by Category