LYCOS RETRIEVER Beta Retriever Home  |  What is Lycos Retriever?   
Unicode: Unicode Character Database
built 628 days ago
Short description: The Unicode Character Database [UnicodeData] lists all characters that have been deprecated in [Unicode]. This list may grow (slowly) over time. Deprecated characters remain valid characters forever, but their use is strongly discouraged. Deprecation of characters is applied only in exceptional circumstances. It is never the result of historical changes of a writing system: characters no longer in current, modern use are retained in Unicode, as they are needed for the representation of historical documents.
Source:
The Unicode consortium is dedicated to maintaining a character set that allows computers to deal with the vast array of human writing systems. When you think of computers that manage such a large and complex data set, you think databases, and this is precisely what the consortium provides for computer access to versions of the Unicode standard. The Unicode Character Database comprises files that present detailed information for each character and class of character. The strong tie between XML and Unicode means this database is very valuable to XML developers and authors. In this article Uche Ogbuji introduces the Unicode Character Database and shows how XML developers can put it to use.
Source:
The following table gives an overview of the various compatibility characters, organized by "compatibility tag". The first column, Tag value, contains the value of the "compatibility tag" from the Unicode Character Database [UnicodeData]. Although these tags use "<" and ">", they do not appear as such in markup and should not be confused with XML tags. Code range indicates a further break down by code points. Action summarizes the recommended action to be taken whenever markup is first applied to non-XML text. Each entry indicates whether the characters can be substituted using the compatibility equivalent according to Normalization Form KC of [UAX 15], can be replaced by equivalent markup where available, or should be retained. For some cases, instead of or in addition to markup, style information [CSS] is needed.
Source:
Animated GIF of various characters The pages in the following list can be used to display the ranges of characters defined in the Unicode 3.0 Character Database, within the limitations imposed by your Web browser and the proportional font that you are using. There is ... a page with a sample of Unicode characters from each range.
Source:
SEARCH
MORE ABOUT