The Unicode Standard, Version 2.0


Book Description

Version 1.1 aligns the Unicode standard with ISO/IEC 10646-1, and includes changes and additions that have been made in the process of this alignment. This work incorporates these changes and the Version 1.0 material. The accompanying CD-ROM provides the electronic files to be used by implementers.




The Unicode Standard, Version 3.0


Book Description

On Unicodes characters




The Unicode Standard, Version 4.0


Book Description

bull; Most detailed, comprehensive guide to the Unicode programming standard. bull; Created and authorized by the Unicode Consortium: the world's leading hardware and software vendors. bull; Accompanying CD-ROM contains the entire Unicode Character Database, plus other materials.




Unicode Tutorials - Herong's Tutorial Examples


Book Description

This Unicode tutorial book is a collection of notes and sample codes written by the author while he was learning Unicode himself. Topics include Character Sets and Encodings; GB2312/GB18030 Character Set and Encodings; JIS X0208 Character Set and Encodings; Unicode Character Set; Basic Multilingual Plane (BMP); Unicode Transformation Formats (UTF); Surrogates and Supplementary Characters; Unicode Character Blocks; Python Support of Unicode Characters; Java Character Set and Encoding; Java Encoding Maps, Counts and Conversion. Updated in 2024 (Version v5.32) with minor changes. For latest updates and free sample chapters, visit https://www.herongyang.com/Unicode.




Unicode Demystified


Book Description

Unicode is a critical enabling technology for developers who want to internationalize applications for global environments. But, until now, developers have had to turn to standards documents for crucial information on utilizing Unicode. In Unicode Demystified, one of IBM's leading software internationalization experts covers every key aspect of Unicode development, offering practical examples and detailed guidance for integrating Unicode 3.0 into virtually any application or environment. Writing from a developer's point of view, Rich Gillam presents a systematic introduction to Unicode's goals, evolution, and key elements. Gillam illuminates the Unicode standards documents with insightful discussions of character properties, the Unicode character database, storage formats, character sequences, Unicode normalization, character encoding conversion, and more. He presents practical techniques for text processing, locating text boundaries, searching, sorting, rendering text, accepting user input, and other key development tasks. Along the way, he offers specific guidance on integrating Unicode with other technologies, including Java, JavaScript, XML, and the Web. For every developer building internationalized applications, internationalizing existing applications, or interfacing with systems that already utilize Unicode.




The Unicode Standard 5.0


Book Description

"Hard copy versions of the Unicode Standard have been among the most crucial and most heavily used reference books in my personal library for years." --Donald E. Knuth, The Art of Computer Programming "For more than a decade, Unicode has been a foundation for many Microsoft products and technologies; Unicode Standard Version 5.0 will help us deliver important new benefits to users." --Bill Gates, chairman, Microsoft Corporation "The path W3C follows to making text on the Web truly global is Unicode." --Sir Tim Berners-Lee, kbe, Web inventor and director of the World Wide Consortium (W3C) "Without Unicode, Java wouldn't be Java, and the Internet would have a harder time connecting the people of the world." --James Gosling, Inventor of Java, Sun Microsystems, Inc. These and other software luminaries recognize that Unicode has become an indispensable tool for supporting an increasingly global marketplace (see inside for more acclaim). A comprehensive system of standards for representing alphabets throughout the world, Unicode is the basis for modern programming-- Windows, XML, Python, PERL, Mac OS, Linux--and every major search engine and browser in operation today. New to Unicode Version 5.0 A stable foundation for Unicode Security Mechanisms Property data for the Unicode Collation Algorithm and Common Locale Data Repository Improvements to the Unicode Encoding Model for UTF-8 Rigorous stability of case folding and identifiers for improved interoperability and backward compatibility--enabling additional new ways to optimize code A systematic framework for improved text processing for greater reliability--covering combining characters, Unicode strings, line breaking, and segmentation This new edition of Unicode's official reference manual has been substantially updated to document the latest revisions to the Unicode Standard, with hundreds of pages of new information. It includes major revisions to text, figures, tables, definitions, and conformance clauses, and provides clear and practical answers to common questions. For the first time, the book contains the Unicode Standard Annexes, which specify vital processes such as text normalization and identifier parsing. These improvements are so important that Version 5.0 is the basis for Microsoft's Vista generation of operating systems, and is included in upgrade plans for Google, Yahoo!, and ICU, to name but a few. This is the one book all developers using Unicode must have.




The Java Language Specification


Book Description

For nearly five years, one book has served as the definitive reference to Java for all serious developers: The Java Language Specification, by James Gosling, Bill Joy, and Guy Steele. Now, these world-renowned Java authorities (along with new co-author Gilad Bracha) have delivered a monumental update. This completely revised Second Edition covers the Java 2 Platform Standard Edition Version 1.3 with unprecedented depth and precision, offering the invaluable insights of Java's creators to every developer. There is no better source for learning everything about the Syntax and Semantics of the Java programming language. Developers will turn to this book again and again.




CJKV Information Processing


Book Description

The ultimate English-language source of information for information on processing text in Chinese, Japanese, Korean, and Vietnamese. In this update, Lunde reexamines the challenges of working with these languages, showing developers in a wide range of fields the latest tools for sharing information that can reach East Asia directly.




The C++ Programming Language


Book Description

The new C++11 standard allows programmers to express ideas more clearly, simply, and directly, and to write faster, more efficient code. Bjarne Stroustrup, the designer and original implementer of C++, has reorganized, extended, and completely rewritten his definitive reference and tutorial for programmers who want to use C++ most effectively. The C++ Programming Language, Fourth Edition, delivers meticulous, richly explained, and integrated coverage of the entire language—its facilities, abstraction mechanisms, standard libraries, and key design techniques. Throughout, Stroustrup presents concise, “pure C++11” examples, which have been carefully crafted to clarify both usage and program design. To promote deeper understanding, the author provides extensive cross-references, both within the book and to the ISO standard. New C++11 coverage includes Support for concurrency Regular expressions, resource management pointers, random numbers, and improved containers General and uniform initialization, simplified for-statements, move semantics, and Unicode support Lambdas, general constant expressions, control over class defaults, variadic templates, template aliases, and user-defined literals Compatibility issues Topics addressed in this comprehensive book include Basic facilities: type, object, scope, storage, computation fundamentals, and more Modularity, as supported by namespaces, source files, and exception handling C++ abstraction, including classes, class hierarchies, and templates in support of a synthesis of traditional programming, object-oriented programming, and generic programming Standard Library: containers, algorithms, iterators, utilities, strings, stream I/O, locales, numerics, and more The C++ basic memory model, in depth This fourth edition makes C++11 thoroughly accessible to programmers moving from C++98 or other languages, while introducing insights and techniques that even cutting-edge C++11 programmers will find indispensable. This book features an enhanced, layflat binding, which allows the book to stay open more easily when placed on a flat surface. This special binding method—noticeable by a small space inside the spine—also increases durability.




Unicode


Book Description

"Unicode is a character encoding standard that includes all o f the major scripts of the world in a simple and consistent manner ... [It] is the product of the Unicode Consortium ... Version 3.0 is current at the time of this writing"--Preface.