net.sf.saxon.codenorm
public class NormalizerData extends Object
Copyright ) 1998-1999 Unicode, Inc. All Rights Reserved.
The Unicode Consortium makes no expressed or implied warranty of any
kind, and assumes no liability for errors or omissions.
No liability is assumed for incidental and consequential damages
in connection with or arising out of the use of the information here.
Field Summary | |
---|---|
static String | copyright |
static int | NOT_COMPOSITE
Constant for use in getPairwiseComposition |
Constructor Summary | |
---|---|
NormalizerData(IntToIntMap canonicalClass, IntHashMap decompose, IntToIntMap compose, BitSet isCompatibility, BitSet isExcluded)
Only accessed by NormalizerBuilder. |
Method Summary | |
---|---|
int | getCanonicalClass(int ch)
Gets the combining class of a character from the
Unicode Character Database. |
boolean | getExcluded(char ch)
Just accessible for testing. |
char | getPairwiseComposition(int first, int second)
Returns the composite of the two characters. |
String | getRawDecompositionMapping(char ch)
Just accessible for testing. |
void | getRecursiveDecomposition(boolean canonical, int ch, StringBuffer buffer)
Gets recursive decomposition of a character from the
Unicode Character Database. |
Parameters: ch the source character
Returns: value from 0 to 255
Parameters: first first character (e.g. 'c') second second character (e.g. '8' cedilla)
Returns: composite (e.g. 'g')
Parameters: canonical If true bit is on in this byte, then selects the recursive canonical decomposition, otherwise selects the recursive compatibility and canonical decomposition. ch the source character buffer buffer to be filled with the decomposition