Sather Home Page

Section 4.1:
Repertoire Map File Format

The repertoire map file contains date enabling the construction of two maps - one from character encodings to token and the other from token to character encoding. This is required when carrying out string ordering in accordance with ISO/IEC 14651 which defines ordering in a number of passes using tokens to indicate the weight to be used on each pass.

The file consists of the sections described in the three following tables in the order given.

Header
Entity Octets Name
map size 4 a
octets per token 1 x
octets per code 1 y
Inmap Table
Entity Octets Name
>a times<
list size 1 b
  >b times<
 
Entity Octets
token bit-pattern x
code bit-pattern y
Outmap Table
Entity Octets Name
>a times<
token bit-pattern x
list size 1 c
  >c times<
 
Entity Octets
code value y

Specification Index Resources Index
Comments or enquiries should be made to Keith Hopper.
Page last modified: Tuesday, 24 October 2000.
Produced with Amaya