2183 lines
51 KiB
HTML
2183 lines
51 KiB
HTML
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
|
|
<html>
|
|
<head>
|
|
<meta name="generator" content="HTML Tidy, see www.w3.org">
|
|
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
|
|
<link type="text/css" rel="stylesheet" href="style.css"><!-- Generated by The Open Group's rhtm tool v1.2.1 -->
|
|
<!-- Copyright (c) 2001 The Open Group, All Rights Reserved -->
|
|
<title>Character Set</title>
|
|
</head>
|
|
<body bgcolor="white">
|
|
<script type="text/javascript" language="JavaScript" src="../jscript/codes.js">
|
|
</script>
|
|
|
|
<basefont size="3"> <!--header start-->
|
|
<center><font size="2">The Open Group Base Specifications Issue 6<br>
|
|
IEEE Std 1003.1-2001<br>
|
|
Copyright © 2001 The IEEE and The Open Group, All Rights reserved.</font></center>
|
|
|
|
<!--header end-->
|
|
<hr size="2" noshade>
|
|
<h2><a name="tag_06"></a>Character Set</h2>
|
|
|
|
<h3><a name="tag_06_01"></a>Portable Character Set</h3>
|
|
|
|
<p>Conforming implementations shall support one or more coded character sets. Each supported locale shall include the <i>portable
|
|
character set</i>, which is the set of symbolic names for characters in <a href="#tagtcjh_3">Portable Character Set</a> . This is
|
|
used to describe characters within the text of IEEE Std 1003.1-2001. The first eight entries in <a href=
|
|
"#tagtcjh_3">Portable Character Set</a> are defined in the ISO/IEC 6429:1992 standard and the rest of the characters are
|
|
defined in the ISO/IEC 10646-1:2000 standard.</p>
|
|
|
|
<center><b><a name="tagtcjh_3"></a> Table: Portable Character Set</b></center>
|
|
|
|
<center>
|
|
<table border="1" cellpadding="3" align="center">
|
|
<tr valign="top">
|
|
<th align="center">
|
|
<p class="tent"><b>Symbolic Name</b></p>
|
|
</th>
|
|
<th align="center">
|
|
<p class="tent"><b>Glyph</b></p>
|
|
</th>
|
|
<th align="center">
|
|
<p class="tent"><b>UCS</b></p>
|
|
</th>
|
|
<th align="center">
|
|
<p class="tent"><b>Description</b></p>
|
|
</th>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><NUL></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0000></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">NULL (NUL)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><alert></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0007></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">BELL (BEL)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><backspace></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0008></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">BACKSPACE (BS)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><tab></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0009></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">CHARACTER TABULATION (HT)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><carriage-return></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U000D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">CARRIAGE RETURN (CR)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><newline></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U000A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LINE FEED (LF)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><vertical-tab></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U000B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LINE TABULATION (VT)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><form-feed></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U000C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">FORM FEED (FF)</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><space></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0020></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">SPACE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><exclamation-mark></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">!</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0021></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">EXCLAMATION MARK</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><quotation-mark></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">"</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0022></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">QUOTATION MARK</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><number-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">#</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0023></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">NUMBER SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><dollar-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">$</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0024></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DOLLAR SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><percent-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">%</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0025></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">PERCENT SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><ampersand></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">&</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0026></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">AMPERSAND</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><apostrophe></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">'</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0027></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">APOSTROPHE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><left-parenthesis></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">(</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0028></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LEFT PARENTHESIS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><right-parenthesis></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">)</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0029></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">RIGHT PARENTHESIS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><asterisk></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">*</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">ASTERISK</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><plus-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">+</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">PLUS SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><comma></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">,</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">COMMA</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><hyphen-minus></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">-</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">HYPHEN-MINUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><hyphen></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">-</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">HYPHEN-MINUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><full-stop></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">.</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">FULL STOP</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><period></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">.</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">FULL STOP</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><slash></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">/</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">SOLIDUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><solidus></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">/</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U002F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">SOLIDUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><zero></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">0</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0030></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT ZERO</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><one></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">1</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0031></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT ONE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><two></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">2</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0032></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT TWO</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><three></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">3</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0033></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT THREE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><four></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">4</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0034></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT FOUR</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><five></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">5</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0035></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT FIVE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><six></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">6</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0036></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT SIX</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><seven></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">7</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0037></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT SEVEN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><eight></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">8</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0038></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT EIGHT</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><nine></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">9</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0039></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">DIGIT NINE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><colon></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">:</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">COLON</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><semicolon></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">;</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">SEMICOLON</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><less-than-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"><</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LESS-THAN SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><equals-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">=</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">EQUALS SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><greater-than-sign></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">GREATER-THAN SIGN</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><question-mark></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">?</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U003F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">QUESTION MARK</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><commercial-at></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"> </p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0040></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><A></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">A</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0041></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER A</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><B></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">B</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0042></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER B</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><C></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">C</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0043></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER C</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><D></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">D</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0044></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER D</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><E></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">E</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0045></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER E</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><F></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">F</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0046></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER F</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><G></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">G</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0047></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER G</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><H></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">H</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0048></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER H</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><I></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">I</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0049></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER I</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><J></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">J</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER J</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><K></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">K</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER K</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><L></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">L</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER L</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><M></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">M</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER M</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><N></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">N</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER N</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><O></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">O</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U004F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER O</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><P></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">P</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0050></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER P</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><Q></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">Q</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0051></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER Q</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><R></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">R</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0052></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER R</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><S></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">S</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0053></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER S</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><T></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">T</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0054></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER T</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><U></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">U</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0055></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER U</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><V></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">V</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0056></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER V</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><W></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">W</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0057></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER W</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><X></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">X</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0058></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER X</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><Y></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">Y</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0059></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER Y</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><Z></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">Z</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN CAPITAL LETTER Z</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><left-square-bracket></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">[</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LEFT SQUARE BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><backslash></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">\</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">REVERSE SOLIDUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><reverse-solidus></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">\</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">REVERSE SOLIDUS</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><right-square-bracket></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">]</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">RIGHT SQUARE BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><circumflex-accent></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">^</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">CIRCUMFLEX ACCENT</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><circumflex></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">^</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">CIRCUMFLEX ACCENT</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><low-line></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">_</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LOW LINE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><underscore></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">_</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U005F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LOW LINE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><grave-accent></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">`</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0060></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">GRAVE ACCENT</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><a></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">a</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0061></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER A</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><b></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">b</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0062></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER B</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><c></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">c</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0063></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER C</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><d></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">d</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0064></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER D</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><e></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">e</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0065></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER E</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><f></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">f</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0066></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER F</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><g></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">g</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0067></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER G</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><h></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">h</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0068></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER H</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><i></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">i</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0069></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER I</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><j></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">j</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER J</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><k></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">k</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER K</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><l></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">l</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER L</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><m></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">m</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER M</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><n></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">n</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER N</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><o></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">o</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U006F></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER O</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><p></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">p</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0070></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER P</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><q></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">q</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0071></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER Q</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><r></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">r</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0072></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER R</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><s></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">s</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0073></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER S</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><t></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">t</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0074></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER T</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><u></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">u</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0075></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER U</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><v></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">v</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0076></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER V</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><w></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">w</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0077></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER W</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><x></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">x</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0078></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER X</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><y></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">y</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U0079></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER Y</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><z></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">z</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007A></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LATIN SMALL LETTER Z</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><left-brace></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">{</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LEFT CURLY BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><left-curly-bracket></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">{</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007B></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">LEFT CURLY BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><vertical-line></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">|</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007C></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">VERTICAL LINE</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><right-brace></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">}</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">RIGHT CURLY BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><right-curly-bracket></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">}</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007D></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">RIGHT CURLY BRACKET</p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><tilde></p>
|
|
</td>
|
|
<td align="center">
|
|
<p class="tent">˜</p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><U007E></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent">TILDE</p>
|
|
</td>
|
|
</tr>
|
|
</table>
|
|
</center>
|
|
|
|
<p>IEEE Std 1003.1-2001 uses character names other than the above, but only in an informative way; for example, in
|
|
examples to illustrate the use of characters beyond the portable character set with the facilities of
|
|
IEEE Std 1003.1-2001.</p>
|
|
|
|
<p><a href="#tagtcjh_3">Portable Character Set</a> defines the characters in the portable character set and the corresponding
|
|
symbolic character names used to identify each character in a character set description file. The table contains more than one
|
|
symbolic character name for characters whose traditional name differs from the chosen name. Characters defined in <a href=
|
|
"#tagtcjh_4">Control Character Set</a> may also be used in character set description files.</p>
|
|
|
|
<p>IEEE Std 1003.1-2001 places only the following requirements on the encoded values of the characters in the portable
|
|
character set:</p>
|
|
|
|
<ul>
|
|
<li>
|
|
<p>If the encoded values associated with each member of the portable character set are not invariant across all locales supported
|
|
by the implementation, if an application accesses any pair of locales where the character encodings differ, or accesses data from
|
|
an application running in a locale which has different encodings from the application's current locale, the results are
|
|
unspecified.</p>
|
|
</li>
|
|
|
|
<li>
|
|
<p>The encoded values associated with the digits 0 to 9 shall be such that the value of each character after 0 shall be one greater
|
|
than the value of the previous character.</p>
|
|
</li>
|
|
|
|
<li>
|
|
<p>A null character, NUL, which has all bits set to zero, shall be in the set of characters.</p>
|
|
</li>
|
|
|
|
<li>
|
|
<p>The encoded values associated with the members of the portable character set are each represented in a single byte. Moreover, if
|
|
the value is stored in an object of C-language type <b>char</b>, it is guaranteed to be positive (except the NUL, which is always
|
|
zero).</p>
|
|
</li>
|
|
</ul>
|
|
|
|
<p>Conforming implementations shall support certain character and character set attributes, as defined in <a href=
|
|
"xbd_chap07.html#tag_07_02"><i>POSIX Locale</i></a> .</p>
|
|
|
|
<h3><a name="tag_06_02"></a>Character Encoding</h3>
|
|
|
|
<p>The POSIX locale contains the characters in <a href="#tagtcjh_3">Portable Character Set</a> , which have the properties listed
|
|
in <a href="xbd_chap07.html#tag_07_03_01"><i>LC_CTYPE</i></a> . In other locales, the presence, meaning, and representation of any
|
|
additional characters are locale-specific.</p>
|
|
|
|
<p>In locales other than the POSIX locale, a character may have a state-dependent encoding. There are two types of these
|
|
encodings:</p>
|
|
|
|
<ul>
|
|
<li>
|
|
<p>A single-shift encoding (where each character not in the initial shift state is preceded by a shift code) can be defined if each
|
|
shift-code and character sequence is considered a multi-byte character. This is done using the concatenated-constant format in a
|
|
character set description file, as described in <a href="#tag_06_04">Character Set Description File</a> . If the implementation
|
|
supports a character encoding of this type, all of the standard utilities in the Shell and Utilities volume of
|
|
IEEE Std 1003.1-2001 shall support it. Use of a single-shift encoding with any of the functions in the System Interfaces
|
|
volume of IEEE Std 1003.1-2001 that do not specifically mention the effects of state-dependent encoding is
|
|
implementation-defined.</p>
|
|
</li>
|
|
|
|
<li>
|
|
<p>A locking-shift encoding (where the state of the character is determined by a shift code that may affect more than the single
|
|
character following it) cannot be defined with the current character set description file format. Use of a locking-shift encoding
|
|
with any of the standard utilities in the Shell and Utilities volume of IEEE Std 1003.1-2001 or with any of the functions
|
|
in the System Interfaces volume of IEEE Std 1003.1-2001 that do not specifically mention the effects of state-dependent
|
|
encoding is implementation-defined.</p>
|
|
</li>
|
|
</ul>
|
|
|
|
<p>While in the initial shift state, all characters in the portable character set shall retain their usual interpretation and shall
|
|
not alter the shift state. The interpretation for subsequent bytes in the sequence shall be a function of the current shift state.
|
|
A byte with all bits zero shall be interpreted as the null character independent of shift state. Thus a byte with all bits zero
|
|
shall never occur in the second or subsequent bytes of a character.</p>
|
|
|
|
<p>The maximum allowable number of bytes in a character in the current locale shall be indicated by {MB_CUR_MAX}, defined in the <a
|
|
href="stdlib.h.html"><i><stdlib.h></i></a> header and by the <b><mb_cur_max></b> value in a character set description
|
|
file; see <a href="#tag_06_04">Character Set Description File</a> . The implementation's maximum number of bytes in a character
|
|
shall be defined by the C-language macro {MB_LEN_MAX}.</p>
|
|
|
|
<h3><a name="tag_06_03"></a>C Language Wide-Character Codes</h3>
|
|
|
|
<p>In the shell, the standard utilities are written so that the encodings of characters are described by the locale's
|
|
<i>LC_CTYPE</i> definition (see <a href="xbd_chap07.html#tag_07_03_01"><i>LC_CTYPE</i></a> ) and there is no differentiation
|
|
between characters consisting of single octets (8-bit bytes) or multiple bytes. However, in the C language, a differentiation is
|
|
made. To ease the handling of variable length characters, the C language has introduced the concept of wide-character codes.</p>
|
|
|
|
<p>All wide-character codes in a given process consist of an equal number of bits. This is in contrast to characters, which can
|
|
consist of a variable number of bytes. The byte or byte sequence that represents a character can also be represented as a
|
|
wide-character code. Wide-character codes thus provide a uniform size for manipulating text data. A wide-character code having all
|
|
bits zero is the null wide-character code (see <a href="xbd_chap03.html#tag_03_246"><i>Null Wide-Character Code</i></a> ), and
|
|
terminates wide-character strings (see <a href="xbd_chap03.html#tag_03_432"><i>Wide-Character Code (C Language)</i></a> ). The
|
|
wide-character value for each member of the portable character set shall equal its value when used as the lone character in an
|
|
integer character constant. Wide-character codes for other characters are locale and implementation-defined. State shift bytes
|
|
shall not have a wide-character code representation.</p>
|
|
|
|
<h3><a name="tag_06_04"></a>Character Set Description File</h3>
|
|
|
|
<p>Implementations shall provide a character set description file for at least one coded character set supported by the
|
|
implementation. These files are referred to elsewhere in IEEE Std 1003.1-2001 as <i>charmap</i> files. It is
|
|
implementation-defined whether or not users or applications can provide additional character set description files.</p>
|
|
|
|
<p>IEEE Std 1003.1-2001 does not require that multiple character sets or codesets be supported. Although multiple charmap
|
|
files are supported, it is the responsibility of the implementation to provide the file or files; if only one is provided, only
|
|
that one is accessible using the <a href="../utilities/localedef.html"><i>localedef</i></a> utility's <b>-f</b> option.</p>
|
|
|
|
<p>Each character set description file, except those that use the ISO/IEC 10646-1:2000 standard position values as the
|
|
encoding values, shall define characteristics for the coded character set and the encoding for the characters specified in <a href=
|
|
"#tagtcjh_3">Portable Character Set</a> , and may define encoding for additional characters supported by the implementation. Other
|
|
information about the coded character set may also be in the file. Coded character set character values shall be defined using
|
|
symbolic character names followed by character encoding values.</p>
|
|
|
|
<p>Each symbolic name specified in <a href="#tagtcjh_3">Portable Character Set</a> shall be included in the file and shall be
|
|
mapped to a unique coding value, except as noted below. The glyphs <tt>'{'</tt> , <tt>'}'</tt> , <tt>'_'</tt> , <tt>'-'</tt> ,
|
|
<tt>'/'</tt> , <tt>'\'</tt> , <tt>'.'</tt> , and <tt>'^'</tt> have more than one symbolic name; all symbolic names for each such
|
|
glyph shall be included, each with identical encoding. If some or all of the control characters identified in <a href=
|
|
"#tagtcjh_4">Control Character Set</a> are supported by the implementation, the symbolic names and their corresponding encoding
|
|
values shall be included in the file. Some of the encodings associated with the symbolic names in <a href="#tagtcjh_4">Control
|
|
Character Set</a> may be the same as characters found in <a href="#tagtcjh_3">Portable Character Set</a> ; both names shall be
|
|
provided for each encoding.<br>
|
|
</p>
|
|
|
|
<center><b><a name="tagtcjh_4"></a> Table: Control Character Set</b></center>
|
|
|
|
<center>
|
|
<table border="1" cellpadding="3" align="center">
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><ACK></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><DC2></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><ENQ></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><FS></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><IS4></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><SOH></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><BEL></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><DC3></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><EOT></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><GS></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><LF></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><STX></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><BS></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><DC4></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><ESC></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><HT></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><NAK></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><SUB></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><CAN></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><DEL></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><ETB></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><IS1></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><RS></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><SYN></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><CR></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><DLE></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><ETX></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><IS2></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><SI></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><US></p>
|
|
</td>
|
|
</tr>
|
|
|
|
<tr valign="top">
|
|
<td align="left">
|
|
<p class="tent"><DC1></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><EM></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><FF></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><IS3></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><SO></p>
|
|
</td>
|
|
<td align="left">
|
|
<p class="tent"><VT></p>
|
|
</td>
|
|
</tr>
|
|
</table>
|
|
</center>
|
|
|
|
<p>The following declarations can precede the character definitions. Each shall consist of the symbol shown in the following list,
|
|
starting in column 1, including the surrounding brackets, followed by one or more <blank>s, followed by the value to be
|
|
assigned to the symbol.</p>
|
|
|
|
<dl compact>
|
|
<dt><b><code_set_name></b></dt>
|
|
|
|
<dd>The name of the coded character set for which the character set description file is defined. The characters of the name shall
|
|
be taken from the set of characters with visible glyphs defined in <a href="#tagtcjh_3">Portable Character Set</a> .</dd>
|
|
|
|
<dt><b><mb_cur_max></b></dt>
|
|
|
|
<dd>The maximum number of bytes in a multi-byte character. This shall default to 1.</dd>
|
|
|
|
<dt><b><mb_cur_min></b></dt>
|
|
|
|
<dd>An unsigned positive integer value that defines the minimum number of bytes in a character for the encoded character set.
|
|
<sup>[<a href="javascript:open_code('XSI')">XSI</a>]</sup> <img src="../images/opt-start.gif" alt="[Option Start]" border="0">
|
|
On XSI-conformant systems, <b><mb_cur_min></b> shall always be 1. <img src="../images/opt-end.gif" alt="[Option End]"
|
|
border="0"></dd>
|
|
|
|
<dt><b><escape_char></b></dt>
|
|
|
|
<dd>The character used to indicate that the characters following shall be interpreted in a special way, as defined later in this
|
|
section. This shall default to backslash ( <tt>'\'</tt> ), which is the character used in all the following text and examples,
|
|
unless otherwise noted.</dd>
|
|
|
|
<dt><b><comment_char></b></dt>
|
|
|
|
<dd>The character that, when placed in column 1 of a charmap line, is used to indicate that the line shall be ignored. The default
|
|
character shall be the number sign ( <tt>'#'</tt> ).</dd>
|
|
</dl>
|
|
|
|
<p>The character set mapping definitions shall be all the lines immediately following an identifier line containing the string
|
|
<tt>"CHARMAP"</tt> starting in column 1, and preceding a trailer line containing the string <tt>"END CHARMAP"</tt> starting in
|
|
column 1. Empty lines and lines containing a <b><comment_char></b> in the first column shall be ignored. Each non-comment
|
|
line of the character set mapping definition (that is, between the <tt>"CHARMAP"</tt> and <tt>"END CHARMAP"</tt> lines of the file)
|
|
shall be in either of two forms:</p>
|
|
|
|
<blockquote>
|
|
<pre>
|
|
<tt>"%s %s %s\n", <</tt><i>symbolic-name</i><tt>>, <</tt><i>encoding</i><tt>>, <</tt><i>comments</i><tt>>
|
|
</tt>
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>or:</p>
|
|
|
|
<blockquote>
|
|
<pre>
|
|
<tt>"%s...%s %s %s\n", <</tt><i>symbolic-name</i><tt>>, <</tt><i>symbolic-name</i><tt>>,
|
|
<</tt><i>encoding</i><tt>>, <</tt><i>comments</i><tt>>
|
|
</tt>
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>In the first format, the line in the character set mapping definition shall define a single symbolic name and a corresponding
|
|
encoding. A symbolic name is one or more characters from the set shown with visible glyphs in <a href="#tagtcjh_3">Portable
|
|
Character Set</a> , enclosed between angle brackets. A character following an escape character is interpreted as itself; for
|
|
example, the sequence <tt>"<\\\>>"</tt> represents the symbolic name <tt>"\>"</tt> enclosed between angle brackets.</p>
|
|
|
|
<p>In the second format, the line in the character set mapping definition shall define a range of one or more symbolic names. In
|
|
this form, the symbolic names shall consist of zero or more non-numeric characters from the set shown with visible glyphs in <a
|
|
href="#tagtcjh_3">Portable Character Set</a> , followed by an integer formed by one or more decimal digits. Both integers shall
|
|
contain the same number of digits. The characters preceding the integer shall be identical in the two symbolic names, and the
|
|
integer formed by the digits in the second symbolic name shall be equal to or greater than the integer formed by the digits in the
|
|
first name. This shall be interpreted as a series of symbolic names formed from the common part and each of the integers between
|
|
the first and the second integer, inclusive. As an example, <j0101>...<j0104> is interpreted as the symbolic names
|
|
<j0101>, <j0102>, <j0103>, and <j0104>, in that order.</p>
|
|
|
|
<p>A character set mapping definition line shall exist for all symbolic names specified in <a href="#tagtcjh_3">Portable Character
|
|
Set</a> , and shall define the coded character value that corresponds to the character indicated in the table, or the coded
|
|
character value that corresponds to the control character symbolic name. If the control characters commonly associated with the
|
|
symbolic names in <a href="#tagtcjh_4">Control Character Set</a> are supported by the implementation, the symbolic name and the
|
|
corresponding encoding value shall be included in the file. Additional unique symbolic names may be included. A coded character
|
|
value can be represented by more than one symbolic name.</p>
|
|
|
|
<p>The encoding part is expressed as one (for single-byte character values) or more concatenated decimal, octal, or hexadecimal
|
|
constants in the following formats:</p>
|
|
|
|
<blockquote>
|
|
<pre>
|
|
<tt>"%cd%u", <</tt><i>escape_char</i><tt>>, <</tt><i>decimal byte value</i><tt>>
|
|
"%cx%x", <</tt><i>escape_char</i><tt>>, <</tt><i>hexadecimal byte value</i><tt>>
|
|
"%c%o", <</tt><i>escape_char</i><tt>>, <</tt><i>octal byte value</i><tt>>
|
|
</tt>
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>Decimal constants shall be represented by two or three decimal digits, preceded by the escape character and the lowercase letter
|
|
<tt>'d'</tt> ; for example, <tt>"\d05"</tt> , <tt>"\d97"</tt> , or <tt>"\d143"</tt> . Hexadecimal constants shall be represented by
|
|
two hexadecimal digits, preceded by the escape character and the lowercase letter <tt>'x'</tt> ; for example, <tt>"\x05"</tt> ,
|
|
<tt>"\x61"</tt> , or <tt>"\x8f"</tt> . Octal constants shall be represented by two or three octal digits, preceded by the escape
|
|
character; for example, <tt>"\05"</tt> , <tt>"\141"</tt> , or <tt>"\217"</tt> . In a portable charmap file, each constant
|
|
represents an 8-bit byte. When constants are concatenated for multi-byte character values, they shall be of the same type, and
|
|
interpreted in byte order from first to last with the least significant byte of the multi-byte character specified by the last
|
|
constant. The manner in which these constants are represented in the character stored in the system is implementation-defined.
|
|
(This notation was chosen for reasons of portability. There is no requirement that the internal representation in the computer
|
|
memory be in this same order.) Omitting bytes from a multi-byte character definition produces undefined results.</p>
|
|
|
|
<p>In lines defining ranges of symbolic names, the encoded value shall be the value for the first symbolic name in the range (the
|
|
symbolic name preceding the ellipsis). Subsequent symbolic names defined by the range shall have encoding values in increasing
|
|
order. Bytes shall be treated as unsigned octets, and carry shall be propagated between the bytes as necessary to represent the
|
|
range. For example, the line:</p>
|
|
|
|
<blockquote>
|
|
<pre>
|
|
<tt><j0101>...<j0104> \d129\d254
|
|
</tt>
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>is interpreted as:</p>
|
|
|
|
<blockquote>
|
|
<pre>
|
|
<tt><j0101> \d129\d254
|
|
<j0102> \d129\d255
|
|
<j0103> \d130\d0
|
|
<j0104> \d130\d1
|
|
</tt>
|
|
</pre>
|
|
</blockquote>
|
|
|
|
<p>The comment is optional.</p>
|
|
|
|
<p>The following declarations can follow the character set mapping definitions (after the <tt>"END CHARMAP"</tt> statement). Each
|
|
shall consist of the keyword shown in the following list, starting in column 1, followed by the value(s) to be associated to the
|
|
keyword, as defined below.</p>
|
|
|
|
<dl compact>
|
|
<dt><b>WIDTH</b></dt>
|
|
|
|
<dd>An unsigned positive integer value defining the column width (see <a href="xbd_chap03.html#tag_03_103"><i>Column
|
|
Position</i></a> ) for the printable characters in the coded character set specified in <a href="#tagtcjh_3">Portable Character
|
|
Set</a> and <a href="#tagtcjh_4">Control Character Set</a> . Coded character set character values shall be defined using symbolic
|
|
character names followed by column width values. Defining a character with more than one <b>WIDTH</b> produces undefined results.
|
|
The <b>END WIDTH</b> keyword shall be used to terminate the <b>WIDTH</b> definitions. Specifying the width of a non-printable
|
|
character in a <b>WIDTH</b> declaration produces undefined results.</dd>
|
|
|
|
<dt><b>WIDTH_DEFAULT</b></dt>
|
|
|
|
<dd><br>
|
|
An unsigned positive integer value defining the default column width for any printable character not listed by one of the
|
|
<b>WIDTH</b> keywords. If no <b>WIDTH_DEFAULT</b> keyword is included in the charmap, the default character width shall be 1.</dd>
|
|
</dl>
|
|
|
|
<hr>
|
|
<div class="box"><em>The following sections are informative.</em></div>
|
|
|
|
<h5><a name="tag_06_04_00_01"></a>Example</h5>
|
|
|
|
<p>After the <tt>"END CHARMAP"</tt> statement, a syntax for a width definition would be:</p>
|
|
|
|
<pre>
|
|
<tt>WIDTH
|
|
<A> 1
|
|
<B> 1
|
|
<C>...<Z> 1
|
|
...
|
|
<foo1>...<foon> 2
|
|
...
|
|
END WIDTH
|
|
</tt>
|
|
</pre>
|
|
|
|
<p>In this example, the numerical code point values represented by the symbols <b><A></b> and <b><B></b> are assigned a
|
|
width of 1. The code point values <b><C></b> to <b><Z></b> inclusive ( <b><C></b>, <b><D></b>,
|
|
<b><E></b>, and so on) are also assigned a width of 1. Using <b><A></b>... <b><Z></b> would have required fewer
|
|
lines, but the alternative was shown to demonstrate flexibility. The keyword <b>WIDTH_DEFAULT</b> could have been added as
|
|
appropriate.</p>
|
|
|
|
<div class="box"><em>End of informative text.</em></div>
|
|
|
|
<hr>
|
|
<h4><a name="tag_06_04_01"></a>State-Dependent Character Encodings</h4>
|
|
|
|
<p>This section addresses the use of state-dependent character encodings (that is, those in which the encoding of a character is
|
|
dependent on one or more shift codes that may precede it).</p>
|
|
|
|
<p>A single-shift encoding (where each character not in the initial shift state is preceded by a shift code) can be defined in the
|
|
charmap format if each shift-code/character sequence is considered a multi-byte character, defined using the concatenated-constant
|
|
format described in <a href="#tag_06_04">Character Set Description File</a> . If the implementation supports a character encoding
|
|
of this type, all of the standard utilities shall support it. A locking-shift encoding (where the state of the character is
|
|
determined by a shift code that may affect more than the single character following it) could be defined with an extension to the
|
|
charmap format described in <a href="#tag_06_04">Character Set Description File</a> . If the implementation supports a character
|
|
encoding of this type, any of the standard utilities that describe character (<i>versus</i> byte) or text-file manipulation shall
|
|
have the following characteristics:</p>
|
|
|
|
<ol>
|
|
<li>
|
|
<p>The utility shall process the statefully encoded data as a concatenation of state-independent characters. The presence of
|
|
redundant locking shifts shall not affect the comparison of two statefully encoded strings.</p>
|
|
</li>
|
|
|
|
<li>
|
|
<p>A utility that divides, truncates, or extracts substrings from statefully encoded data shall produce output that contains
|
|
locking shifts at the beginning or end of the resulting data, if appropriate, to retain correct state information.</p>
|
|
</li>
|
|
</ol>
|
|
|
|
<hr size="2" noshade>
|
|
<center><font size="2"><!--footer start-->
|
|
UNIX ® is a registered Trademark of The Open Group.<br>
|
|
POSIX ® is a registered Trademark of The IEEE.<br>
|
|
[ <a href="../mindex.html">Main Index</a> | <a href="../basedefs/contents.html">XBD</a> | <a href=
|
|
"../utilities/contents.html">XCU</a> | <a href="../functions/contents.html">XSH</a> | <a href="../xrat/contents.html">XRAT</a>
|
|
]</font></center>
|
|
|
|
<!--footer end-->
|
|
<hr size="2" noshade>
|
|
</body>
|
|
</html>
|
|
|