Fundamentally, computers just deal with numbers. They store letters and other characters by assigning a number for each one. There are hundreds of different encoding systems for mapping characters to numbers, but Unicode promises a single mapping. Unicode enables a single software product or website to be targeted across multiple platforms, languages and countries without re-engineering. It's no wonder that industry giants like Apple, Hewlett-Packard, IBM andMicrosoft have all adopted Unicode.
Containing everything you need to understand Unicode, this comprehensive reference from O'Reilly takes you on a detailed guide through the complex character world. For starters, it explains how to identify and classify characters - whether they're common, uncommon, or exotic. It then shows you how to type them, utilize their properties, and process character data in a robust manner.
The book is broken up into three distinct parts. The first few chapters provide you with a tutorial presentation of Unicode and character data. It gives you a firm grasp of the terminology you need to reference various components, including character sets, fonts and encodings, glyphs and character repertoires.
The middle section offers more detailed information about using Unicode and other character codes. It explains the principles and methods of defining character codes, describes some of the widely used codes, and presents code conversion techniques. It also discusses properties of characters, collation and sorting, line breaking rules and Unicode encodings. The final four chapters cover more advanced material, such as programming to support Unicode.
You simply can't afford to be without the nuggets of valuable information detailed in Unicode Explained.
{"itemData":[{"priceBreaksMAP":null,"buyingPrice":47.99,"ASIN":"059610121X","isPreorder":0},{"priceBreaksMAP":null,"buyingPrice":49.45,"ASIN":"0596102429","isPreorder":0},{"priceBreaksMAP":null,"buyingPrice":48.8,"ASIN":"0596514476","isPreorder":0}],"shippingId":"059610121X::1Aa32%2FlfFEtC8FXgohYkCMm8bcF%2Fu7kCLNN0sen2nWWaRXD8uDsQtv7aHHLniQUdOf1m24EHL2qUT%2FudzTn%2BzSQamKDcp76F1qfckyGRpq8F0WOGJU2FwQ%3D%3D,0596102429::VRtV2iAZt8Qq%2ByhewvQChYb%2BYxcrePpGY%2FB5qq91DDAs2FzqI2G%2BwKguddK%2B%2BMehf4Uv8BKLQnJmUQLwcopo6lD3vzvcWE7oqKAippx9SOA%3D,0596514476::gdacIbsMGddTt3a5AZBOONW32pgkItjZRT7oqfC3R%2BGg2pRaUtSHR8kpz%2By2c05AItDs78ftNwqlXd5fcmfZFnC4ps7YgPOqe9uB6oBV5nA%3D","sprites":{"addToWishlist":["wl_one","wl_two","wl_three"],"addToCart":["s_addToCart","s_addBothToCart","s_add3ToCart"],"preorder":["s_preorderThis","s_preorderBoth","s_preorderAll3"]},"currenyCode":"USD","shippingDetails":{"xz":"same","yz":"same","xy":"same","xyz":"same"},"tags":["x","y","z"],"strings":{"addToWishlist":["add to wishlist","Add both to Wish List","Add all three to Wish List"],"addToCart":["Add to Cart","Add both to Cart","Add all three to Cart"],"showDetailsDefault":"Show availability and shipping details","shippingError":"An error occurred, please try again","hideDetailsDefault":"Hide availability and shipping details","priceLabel":["Price:","Price for both:","Price for all three:"],"preorder":["Pre-order this item","Pre-order both items","Pre-order all three items"]}}
Jukka Korpela is a consultant who specializes in character codes, localization, orthography, usability, and accessibility. After graduating from Helsinki University of Technology, he taught these subjects in the university's Computer Science department and worked on localization and accessibility issues at TIEKE before becoming a full-time author and consultant. His previous books on CSS and XHTML were published in Finland by Docendo press.
I'm an IT generalist and specialist with strong interest in humanities--and humans. This explains why I have focused on character codes, localization, and accessibility.
After M.Sc. degree in mathematics, I worked with IT services, user support, software development, and teaching at Helsinki University of Technology over 25 years and also started writing books. Later, I have worked as an author and consultant, in addition to employment periods at the Finnish Information Society Development Center (IT standardization) and at the Blue1 airline company (multilingual online sales and web site). My books in Finnish deal with programming languages (Fortran, Pascal, C), Unix, web publishing, HTML, CSS, data security, office document design and style, and web typography.
My motto is "Docendo disco, scribendo cogito"--by teaching I learn, by writing I think. I love to get deeper insight and wider understanding by formulating things for others, in a manner that is both understandable and informative, correct, and useful--a real challenge.
I'm an advocate of standardization, but with realism learned the hard way. I always try to promote relevant standards and consensus-based recommendations, but I also try to point out their shortcomings and problems in acceptance and in implementations.
You might find me on various discussion forums on the Internet, but I'm partly an old-timer who loved the good old Usenet, where I frequented in many web-related groups. I'm adapting to the new modes of communication, which are partly much more effective (like moderated e-mail lists and StackOverflow).
¶ I had another Unicode book on my desk for a long time. Hardbound, thick, impressive. Never found a way to derive useful information from it however. This book is different.
¶ I had high expectations for this book because the author, Jukka Korpela, is one of those erudite and patient people who work hard to raise the signal to noise ratio in Internet newsgroups and other forums. I certainly have quite a few posts from "Yucca" in my working archive of Web tips.
¶ Working with Web pages and applications, one can run into practical problems with text display. For Americans especially, often using default software configurations, some of the problems of displaying content in other languages can seem intractable. They are not of course -- but a bit of help from workers in the rest of the world can be a real lift. After all, they deal with these issues in a practical way more often.
¶ I had a nasty run-in (also known as "learning experience") with browser display issues when my "CSS Cheatsheet" rose in popularity in Google and other search engines. I decided to create a page quoting comments from linking sites in their native languages. Everything was fine until I got to Russian. I felt as if I were up against a conspiracy of browsers, tools, operating systems and even particular custom configurations!
¶ If you are like me and your focus is practical, I recommend:
The first two chapters in Part 1: Characters as Data; Writing Characters
All the advanced topics in Part 3: these 5 chapters covered character issues involved with programming and developing in the Internet environment.
¶ Overall, this book is well-organized and quite readable, with lots of relevant illustrations. Important material is repeated and summarized for greater clarity. The author also used lots of examples from Windows programs that are familiar to many of us. This is a real plus.
If you're like me, you probably think of Unicode as "expanded ASCII" and that's about it. But there is infinitely more to the subject than I thought, and Unicode Explained by Jukka K. Korpela is an exhaustive reference to all that is Unicode. And in this increasingly global computing environment, you will need to know this information...
Contents:
Part 1 - Working with Characters: Characters as Data; Writing Characters; Character Sets and Encoding
Part 2 - A Systematic Look at Unicode: The Structure of Unicode; Properties of Characters; Unicode Encodings
Part 3 - Advanced Unicode Topics: Characters and Languages; Character Usage; The Character Level and Above; Characters in Internet Protocols; Characters in Programming
Appendix - Tables for Writing Characters; Index
In concept, Unicode is real simple. An expanded character set using 16 bit encoding, and you can accommodate far more languages and symbols than straight ASCII. But the implementation is far more complex than that. Korpela starts with the basics of characters... what they are, what they mean, and the nuances involved. From there, you learn about how applications have to interpret the different encoding standards and handle things like case, sort orders, line breaks, etc. When I saw the size of the book (600+ pages), I wondered if the material was just a lot of reference tables that could be found online. Gladly, it's not... This is an exploration of everything that is Unicode, and you'd have to wade through a lot of web pages to even begin to glean the level and value of information that you'll find here.
If you have anything to do with programming or designing global software, this book purchase is a no-brainer. And even if you're not doing anything in that area right now, this is one of those reference titles that is worth having on your bookshelf and available for the first time you *do* need it. It won't take long to pay for itself...
This book is excellent. The author's writing style is easy to read and he pretty much explains everything about Unicode. It's perfect if you're working with multi-lingual Web sites or email, or just if you want to start using Unicode for all of your Web site development (something everyone should do).
The only thing disappointing about this book is that all of his examples and screen shots are for and from Windows. A reader could come away with the feeling that Mac OS X and Linux don't have as much support for Unicode as Windows which, of course, is not the case at all. The least he could have done is to mention and give screenshots of Linux's "Character Map" app and Mac OS X's built-in "Character Palette", both of which are pretty much just like the Windows "Character Map" app.
I'm surprised O'Reilly allowed a book about such a platform-neutral subject to be so Windows-centric. Hopefully they can hire someone to add Linux and Mac OS X examples into the second edition.