Non-ASCII Symbols in the SCA Armorial Database

by Iulstan Sigewealding
9 December 1997

Since January 1996, the SCA Ordinary database (oanda.db) has begun to encode non-ASCII symbols in names and blazons. The encoding is mostly complete for items registered since July 1983, but only sporadic before that date. In other words, about 80% of the database has been revised.

When a Latin-1 encoding exists, the non-ASCII symbol is encoded in accordance with that standard. The resulting code is an 8-bit byte with the most-significant bit set to 1, as detailed in the table below. (Unfortunately, these 8-bit codes are NOT compatible with the "437" code page normally active on PCs running MS-DOS in the United States.)

The columns in the table are as follows:

 Cnt Dec Oct Hx   Eq Daud Name..................................
   2 192 300 C0 À A  {'A} LATIN CAPITAL LETTER A WITH GRAVE
  45 193 301 C1 Á A  {A'} LATIN CAPITAL LETTER A WITH ACUTE
   3 194 302 C2 Â A  {A^} LATIN CAPITAL LETTER A WITH CIRCUMFLEX
   4 196 304 C4 Ä A  {A:} LATIN CAPITAL LETTER A WITH DIAERESIS
   5 197 305 C5 Å A  {Ao} LATIN CAPITAL LETTER A WITH RING ABOVE
 128 198 306 C6 Æ AE {AE} LATIN CAPITAL LIGATURE AE
   1 199 307 C7 Ç C  {C,} LATIN CAPITAL LETTER C WITH CEDILLA
 117 201 311 C9 É E  {E'} LATIN CAPITAL LETTER E WITH ACUTE
   6 205 315 CD Í I  {I'} LATIN CAPITAL LETTER I WITH ACUTE
   4 210 322 D2 Ò O  {'O} LATIN CAPITAL LETTER O WITH GRAVE
 164 211 323 D3 Ó O  {O'} LATIN CAPITAL LETTER O WITH ACUTE
  13 214 326 D6 Ö O  {O:} LATIN CAPITAL LETTER O WITH DIAERESIS
  21 216 330 D8 Ø O  {O/} LATIN CAPITAL LETTER O WITH STROKE
  14 218 332 DA Ú U  {U'} LATIN CAPITAL LETTER U WITH ACUTE
   1 221 335 DD Ý Y  {Y'} LATIN CAPITAL LETTER Y WITH ACUTE
  12 222 336 DE Þ Th {Th} LATIN CAPITAL LETTER THORN
   7 223 337 DF ß s  {sz} LATIN SMALL LETTER SHARP S
  58 224 340 E0 à a  {'a} LATIN SMALL LETTER A WITH GRAVE
1074 225 341 E1 á a  {a'} LATIN SMALL LETTER A WITH ACUTE
  45 226 342 E2 â a  {a^} LATIN SMALL LETTER A WITH CIRCUMFLEX
   1 227 343 E3 ã a  {a~} LATIN SMALL LETTER A WITH TILDE
 237 228 344 E4 ä a  {a:} LATIN SMALL LETTER A WITH DIAERESIS
  45 229 345 E5 å a  {ao} LATIN SMALL LETTER A WITH RING ABOVE
 102 230 346 E6 æ ae {ae} LATIN SMALL LIGATURE AE
  44 231 347 E7 ç c  {c,} LATIN SMALL LETTER C WITH CEDILLA
 161 232 350 E8 è e  {'e} LATIN SMALL LETTER E WITH GRAVE
 942 233 351 E9 é e  {e'} LATIN SMALL LETTER E WITH ACUTE
  47 234 352 EA ê e  {e^} LATIN SMALL LETTER E WITH CIRCUMFLEX
  14 235 353 EB ë e  {e:} LATIN SMALL LETTER E WITH DIAERESIS
  13 236 354 EC ì i  {'i} LATIN SMALL LETTER I WITH GRAVE
 847 237 355 ED í i  {i'} LATIN SMALL LETTER I WITH ACUTE
  12 238 356 EE î i  {i^} LATIN SMALL LETTER I WITH CIRCUMFLEX
   3 239 357 EF ï i  {i:} LATIN SMALL LETTER I WITH DIAERESIS
  51 240 360 F0 ð dh {dh} LATIN SMALL LETTER ETH
  24 241 361 F1 ñ n  {n~} LATIN SMALL LETTER N WITH TILDE
  60 242 362 F2 ò o  {'o} LATIN SMALL LETTER O WITH GRAVE
 515 243 363 F3 ó o  {o'} LATIN SMALL LETTER O WITH ACUTE
  22 244 364 F4 ô o  {o^} LATIN SMALL LETTER O WITH CIRCUMFLEX
 330 246 366 F6 ö o  {o:} LATIN SMALL LETTER O WITH DIAERESIS
  61 248 370 F8 ø o  {o/} LATIN SMALL LETTER O WITH STROKE
  28 249 371 F9 ù u  {'u} LATIN SMALL LETTER U WITH GRAVE
 151 250 372 FA ú u  {u'} LATIN SMALL LETTER U WITH ACUTE
  11 251 373 FB û u  {u^} LATIN SMALL LETTER U WITH CIRCUMFLEX
 264 252 374 FC ü u  {u:} LATIN SMALL LETTER U WITH DIAERESIS
  24 253 375 FD ý y  {y'} LATIN SMALL LETTER Y WITH ACUTE
   4 254 376 FE þ th {th} LATIN SMALL LETTER THORN
   2 255 377 FF ÿ y  {y:} LATIN SMALL LETTER Y WITH DIAERESIS

Twenty-four of the rarer symbols do not have Latin-1 encodings. In the database, these symbols have been reduced to the nearest ASCII equivalent:

Cnt Eq Daud Name................................
   2 C  {Cv} LATIN CAPITAL LETTER C WITH CARON
   2 OE {OE} LATIN CAPITAL LIGATURE OE
   1 S  {S'} LATIN CAPITAL LETTER S WITH ACUTE
   3 S  {Sv} LATIN CAPITAL LETTER S WITH CARON
   8 Z  {Zv} LATIN CAPITAL LETTER Z WITH CARON
  10 a  {a-} LATIN SMALL LETTER A WITH MACRON
   4 c  {cv} LATIN SMALL LETTER C WITH CARON
   2 e  {e,} LATIN SMALL LETTER E WITH OGONEK
   4 e  {e-} LATIN SMALL LETTER E WITH MACRON
   2 e  {ev} LATIN SMALL LETTER E WITH CARON
  12 i  {i-} LATIN SMALL LETTER I WITH MACRON
   5 m  {m.} LATIN SMALL LETTER M WITH DOT BELOW
   5 n  {n.} LATIN SMALL LETTER N WITH DOT BELOW
   2 o  {o,} LATIN SMALL LETTER O WITH OGONEK
   4 o  {o-} LATIN SMALL LETTER O WITH MACRON
   1 oe {oe} LATIN SMALL LIGATURE OE
   2 r  {rv} LATIN SMALL LETTER R WITH CARON
   5 s  {s.} LATIN SMALL LETTER S WITH DOT BELOW
   4 s  {sv} LATIN SMALL LETTER S WITH CARON
   2 u  {u,} LATIN SMALL LETTER U WITH OGONEK
   5 u  {u-} LATIN SMALL LETTER U WITH MACRON
   1 w  {w^} LATIN SMALL LETTER W WITH CIRCUMFLEX
   2 z  {z'} LATIN SMALL LETTER Z WITH ACUTE
   4 z  {zv} LATIN SMALL LETTER Z WITH CARON

Related web pages:


config.web version 980308 written by Stephen Gold / sgold@best.com
Copyright © 1996,1997 Society for Creative Anachronism, Inc.