Unicode modifier characters are often considered part of a word, but when dealing with the apostrophe itself as a character it is not recommended to delimit punctuation rather than delimiting it. Thus, for example, the apostrophe letters can be used for spelling of Arabic glottal stop symbols (hamza), Cyrillic "soft mark", or c'h like Brittany. Independent 3 letters However, some people think that this letter should be used for nicknames in English instead of U + 0027 or U + 2019. ICANN also believes that U + 02BC is the correct function of the IDN Ukrainian nickname. This character is the same as U + 2019 in the Unicode code diagram. The distinction between apostrophes of letters and apostrophes of punctuation marks can not be used in practice, so we warn the Unicode standard that you should not assume that text is encoded.
The character set already defines the exact number representing the rule of each character. We have different definitions of these rules. Very popular are Unicode and ASCII. JavaScript works very nicely with the Unicode character set. In fact, the browser's Unicode indicates that 76 should represent L. To understand this, as mentioned earlier, the binary representation of the number 12 is 1100. 8 indicates that the computer must add a bit to the left of the 12 actual binary representation of the number to make it one byte. 12 must be stored as 00001100. Is it meaningful?
The Unicode character set can be encoded as a byte for storage or transfer in various ways called "encoding". It is well known that Unicode itself defines the encoding that covers the entire track, including UTF - 8 and UTF - 16. There are many other text encodings before Unicode, such as ASCII and ISO / IEC 8859, but in most cases these character tracks are a subset of the Unicode character set. In XML, you can use any encoding defined in Unicode and other encodings whose characters are displayed in Unicode. XML also provides a mechanism by which the XML processor can reliably determine which encoding is being used without prior knowledge. Each XML parser does not necessarily recognize encodings other than UTF - 8 and UTF - 16.