Quote:
Originally Posted by Gambit
thats cool....
wait, how did you get that japanees(excuse my spelling) wirting in a post?
|
Welcome to the wonderful world of
Unicode. ASCII encodes the basic English character set as numbers, with 'high-bit ASCII' changing with country and computer type -- the Windows 'code page' arrangement for getting Scandinavian characters like 'ð' and 'ø'. Unicode, however, creates a coding system to allow characters from
any language to be used -- Cyrillic characters like 'ш', Japanese
hiragana characters like 'が', Chinese ideographs like '丈', and others. Note that, while the
character may be understood by your computer, you may not be able to
see it -- you still need the appropriate font installed on your computer for it to draw the character on your screen -- I see the aleph 'א' on my screen, but you won't unless you have a font with Hebrew support, and I don't see either the Devnagari canda e 'ऍ' character or the cuneiform sign abtu times gan2 tenu '𒀘' (and yes, that
is the name of the sign; blame the linguists).
The easiest way to get Unicode into your post is to copy and paste it from another browser window -- copy-and-paste from a word-processor may or may not work, particularly with Microsoft's 'do it our way or don't play' attitude; you may find that the internal coding for, say, the Arabic font on your computer doesn't match the Unicode standard, so the document in the word-processor relies on that particular font to look right, and doesn't have glyphs in the right code positions.
The other way is to look the characters up in the Unicode character tables and encode them directly into your document. The way you do this is to use the same method HTML uses for other special characters; an ampersand '&', an octothorpe or pound sign '#', the coding number, and a semicolon. So for the Cyrillic 'ш' you would write 'ш', since the Unicode index for that character is 1096.