8/16/2000 nhotta@netscape.com
Link to the original Apple spec
Technote
1141 Extending and Controlling Sherlock
Charset related tag attributes.
queryCharset - a charset name string for server queries, specify in
"SEARCH" section
charset - a charset name string used by server result html, specify
in "INTERPRET" section
queryEncoding - a charset encoding ID for server queries, specify in
"SEARCH" section
resultEncoding - a charset encoding ID used by server result html,
specify in "INTERPRET" section
Default
If no charset is specified "ISO-8859-1" is used as a default charset.
Precedence
Charset has higher precedence over encoding.
For example, if both "queryCharset" and "queryEncoding" are specified
in a data set file, "queryCharset" is used.
Example 1:
Specifying a query charset as "UTF-8" in SEARCH section.
<SEARCH
queryCharset="UTF-8"
 name="Netscape Search"
 description = "Netscape Search"
...............................................
>
...............................................
</SEARCH>
 
Example 2:
Specifying queryEncoding as "Shift_JIS" in SEARCH section and resultEncoding
as "EUC-JP" in INTERPRET section.
<SEARCH
...............................................
 queryEncoding="2561"
 ...............................................
<INTERPRET
 charset = "EUC-JP"
 resultEncoding = "2336"
 ...............................................
>
</SEARCH>
Mapping table between encoding IDs and charset names.
(defined in nsInternetSearchService.cpp
MapEncoding())
  { "0", "x-mac-roman" },
  { "6", "x-mac-greek" },
  { "35", "x-mac-turkish" },
  { "513", "ISO-8859-1" },
  { "514", "ISO-8859-2" },
  { "517", "ISO-8859-5" },
  { "518", "ISO-8859-6" },
  { "519", "ISO-8859-7" },
  { "520", "ISO-8859-8" },
  { "521", "ISO-8859-9" },
  { "1049", "IBM864" },
  { "1280", "windows-1252" },
  { "1281", "windows-1250" },
  { "1282", "windows-1251" },
  { "1283", "windows-1253" },
  { "1284", "windows-1254" },
  { "1285", "windows-1255" },
  { "1286", "windows-1256" },
  { "1536", "us-ascii" },
  { "1584", "GB2312" },
  { "1585", "x-gbk"  },
  { "1600", "EUC-KR" },
  { "2080", "ISO-2022-JP" },
  { "2096", "ISO-2022-CN" },
  { "2112", "ISO-2022-KR" },
  { "2336", "EUC-JP" },
  { "2352", "GB2312" },
  { "2353", "x-euc-tw" },
  { "2368", "EUC-KR" },
  { "2561", "Shift_JIS" },
  { "2562", "KOI8-R" },
  { "2563", "Big5"  },
  { "2565", "HZ-GB-2312" },