Unicode

Introduction

ASCII
Back in the old days of punched cards a 7-bit character coding system was used. 7 bits gave 2⁷; = 128 possible combinations, enough for 26×2 letters, 10 numbers, about 15 punctuation characters, and 20 or so symbols. Finally, 33 of the codes were used as control characters, e.g. line feed, tab, bell etc. The del code was taken as the last character, number 127, that is 111111 in binary. This meant all 7 spaces representing the character on the card were punched out, thus allowing any mistakes to be deleted.
Extended ASCII
European languages require letters with diacritical marks (accents), for example: in France à, á, â, ç (grave, acute, circumflex, cedilla); in Spain ñ (tilde); in Germany ä, å (diaeresis, ring). Other characters are required, such as æ, ¥, ß, £, ©, etc., so the 7-bit system was extended to 8 bits, with these new letters taking values from 128 to 255. This is a clean result, as each character is now represented by one byte. This is the system that is in general use in the western world today.
However, the 256 characters were really not quite enough. So the first 128 letters are usually the same, but the last 128 depend on what language you are using. So the Latin-1 set is for West Europe, Latin-2 for Central and East Europe, Latin-3 is additional (e.g. Catalan, Turkish) and Latin-4 for other additional (e.g. Estonian, Lappish). Other systems for Russia etc. exist. An altogether different set also in common use is the symbol set, basically for use in mathematics, containing Greek letters and mathematical operators.
Unicode Standard
The above system becomes a problem if you wish to exchange documents with people who use different character sets. For example, if you are using a Latin-1 font and your friend has used a Latin-2 font, then a () in your friends document will appear as a () for you. A second problem are the thousands of characters from China, Japan and Korea (CJK), for which other systems exist.
Unicode provides a consistent way of encoding multilingual plain text and brings order to the chaotic state of affairs outlined above. The Unicode Standard provides the capacity to uniquely encode all of the characters used for the written languages of the world. It uses a 16 bit (2 byte) encoding allowing for over 65,000 characters.
Each character is assigned a unique name that specifies it and no other. For example, U+0041 is assigned the character name "LATIN CAPITAL LETTER A". The standard defines rules for the working of composite characters (characters generated by combining others, e.g. �). Many such characters exit in their own right (as for �).

UCS - Universal (Multiple-Octet Coded) Character Set
More accurately called UCS-4, this is a massive character set taking 31 bits to specify a character. Note that Octet is just another name for a byte. UCS-4 allows for 2^31 = 2,147,483,648 encoding points. The Unicode system, which can be reffered to as UCS-2, corresponds exactly to the first 65,536 entries of UCS-4.
Usefull Links
Unicode

Unicode Map

00 10 20 30 40 50 60 70 80 90 A0 B0 C0 D0 E0 F0

01 11 21 31 41 51 61 71 81 91 A1 B1 C1 D1 E1 F1

02 12 22 32 42 52 62 72 82 92 A2 B2 C2 D2 E2 F2

03 13 23 33 43 53 63 73 83 93 A3 B3 C3 D3 E3 F3

04 14 24 34 44 54 64 74 84 94 A4 B4 C4 D4 E4 F4

05 15 25 35 45 55 65 75 85 95 A5 B5 C5 D5 E5 F5

06 16 26 36 46 56 66 76 86 96 A6 B6 C6 D6 E6 F6

07 17 27 37 47 57 67 77 87 97 A7 B7 C7 D7 E7 F7

08 18 28 38 48 58 68 78 88 98 A8 B8 C8 D8 E8 F8

09 19 29 39 49 59 69 79 89 99 A9 B9 C9 D9 E9 F9

0A 1A 2A 3A 4A 5A 6A 7A 8A 9A AA BA CA DA EA FA

0B 1B 2B 3B 4B 5B 6B 7B 8B 9B AB BB CB DB EB FB

0C 1C 2C 3C 4C 5C 6C 7C 8C 9C AC BC CC DC EC FC

0D 1D 2D 3D 4D 5D 6D 7D 8D 9D AD BD CD DD ED FD

0E 1E 2E 3E 4E 5E 6E 7E 8E 9E AE BE CE DE EE FE

0F 1F 2F 3F 4F 5F 6F 7F 8F 9F AF BF CF DF EF FF

E&OE!