Package com.ibm.text

Classes for RuleBasedBreakIterator, DictionaryBasedBreakIterator Normalizer, RuleBasedNumberFormat, BigNumberFormat, Unicode compression, International Searching, and Transliterator.

See:
          Description

Interface Summary
Replaceable Replaceable is an interface that supports the operation of replacing a substring with another piece of text.
SymbolTable An interface that maps strings to objects.
UnicodeFilter UnicodeFilter defines a protocol for selecting a subset of the full range (U+0000 to U+FFFF) of Unicode characters.
 

Class Summary
ArabicShaping Shape Arabic text on a character basis.
BreakDictionary This is the class that represents the list of known words used by DictionaryBasedBreakIterator.
BreakIterator A class that locates boundaries in text.
ChineseDateFormat A concrete DateFormat for ChineseCalendar.
ChineseDateFormatSymbols A subclass of DateFormatSymbols for ChineseDateFormat.
ComposedCharIter ComposedCharIter is an iterator class that returns all of the precomposed characters defined in the Unicode standard, along with their decomposed forms.
CompoundTransliterator A transliterator that is composed of two or more other transliterator objects linked together.
DateFormat DateFormat is an abstract class for date/time formatting subclasses which formats and parses dates or time in a language-independent manner.
DateFormatSymbols DateFormatSymbols is a public class for encapsulating localizable date-time formatting data, such as the names of the months, the names of the days of the week, and the time zone data.
DecimalFormat DecimalFormat is a concrete subclass of NumberFormat that formats decimal numbers.
DecimalFormatSymbols This class represents the set of symbols (such as the decimal separator, the grouping separator, and so on) needed by DecimalFormat to format numbers.
DictionaryBasedBreakIterator A subclass of RuleBasedBreakIterator that adds the ability to use a dictionary to further subdivide ranges of text beyond what is possible using just the state-table-based algorithm.
HangulJamoTransliterator A transliterator that converts Hangul to Jamo
HexToUnicodeTransliterator A transliterator that converts from hexadecimal Unicode escape sequences to the characters they represent.
JamoHangulTransliterator A transliterator that converts Jamo to Hangul
Normalizer Normalizer transforms Unicode text into an equivalent composed or decomposed form, allowing for easier sorting and searching of text.
Normalizer.Mode This class represents the mode of a Normalizer object, i.e. the Unicode Normalization Form of the text that the Normalizer produces.
NullTransliterator A transliterator that leaves text unchanged.
NumberFormat NumberFormat is the abstract base class for all number formats.
ReplaceableString ReplaceableString is an adapter class that implements the Replaceable API around an ordinary StringBuffer.
RuleBasedBreakIterator A subclass of BreakIterator whose behavior is specified using a list of rules.
RuleBasedNumberFormat A class that formats numbers according to a set of rules.
RuleBasedTransliterator RuleBasedTransliterator is a transliterator that reads a set of rules in order to determine how to perform translations.
SearchIterator SearchIterator is an abstract base class that provides methods to search for a pattern within a text string.
SimpleDateFormat SimpleDateFormat is a concrete class for formatting and parsing dates in a locale-sensitive manner.
StringCharacterIterator StringCharacterIterator implements the CharacterIterater protocol for a String.
StringSearch StringSearch is a SearchIterator that provides language-sensitive text searching based on the comparison rules defined in a RuleBasedCollator object.
Transliterator Transliterator is an abstract class that transliterates text from one format to another.
Transliterator.Position Position structure for incremental transliteration.
UCharacter A static class designed to be a generic code point information source that handles surrogate pairs.
Data for code point information originates from Unicode 3.0 data files, UnicodeData.txt and Mirror.txt, downloadable from the Unicode Consortium site ftp://ftp.unicode.org/Public/
ICU's gennames and genprops programs are used to compact the information from the above mentioned files before being used by this package.
UCharacterCategory Enumerated Unicode category types from the UnicodeData.txt file.
UCharacterDirection Enumerated Unicode character linguistic direction constants.
UnicodeCompressor A compression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.
UnicodeDecompressor A decompression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.
UnicodeFilterLogic UnicodeFilterLogic provides logical operators on UnicodeFilter objects.
UnicodeSet A mutable set of Unicode characters.
UnicodeToHexTransliterator A transliterator that converts from Unicode characters to hexadecimal Unicode escape sequences.
UTF16 Standalone utility class providing UTF16 character conversions and indexing conversions.
UTF16.StringComparator Compare strings using Unicode code point order, instead of UTF-16 code unit order.
 

Exception Summary
ArabicShapingException Thrown by ArabicShaping when there is a shaping error.
 

Package com.ibm.text Description

Classes for RuleBasedBreakIterator, DictionaryBasedBreakIterator Normalizer, RuleBasedNumberFormat, BigNumberFormat, Unicode compression, International Searching, and Transliterator.



Copyright (c) 1998-2000 IBM Corporation and others.