ICU 72.1 72.1
|
The CollationElementIterator class is used as an iterator to walk through
each character of an international string.
More...
#include <coleitr.h>
Public Types | |
enum | { NULLORDER = (int32_t)0xffffffff } |
Public Member Functions | |
CollationElementIterator (const CollationElementIterator &other) | |
Copy constructor. More... | |
virtual | ~CollationElementIterator () |
Destructor. More... | |
bool | operator== (const CollationElementIterator &other) const |
Returns true if "other" is the same as "this". More... | |
bool | operator!= (const CollationElementIterator &other) const |
Returns true if "other" is not the same as "this". More... | |
void | reset (void) |
Resets the cursor to the beginning of the string. More... | |
int32_t | next (UErrorCode &status) |
Gets the ordering priority of the next character in the string. More... | |
int32_t | previous (UErrorCode &status) |
Get the ordering priority of the previous collation element in the string. More... | |
int32_t | getMaxExpansion (int32_t order) const |
Return the maximum length of any expansion sequences that end with the specified comparison order. More... | |
int32_t | strengthOrder (int32_t order) const |
Gets the comparison order in the desired strength. More... | |
void | setText (const UnicodeString &str, UErrorCode &status) |
Sets the source string. More... | |
void | setText (CharacterIterator &str, UErrorCode &status) |
Sets the source string. More... | |
int32_t | getOffset (void) const |
Gets the offset of the currently processed character in the source string. More... | |
void | setOffset (int32_t newOffset, UErrorCode &status) |
Sets the offset of the currently processed character in the source string. More... | |
virtual UClassID | getDynamicClassID () const override |
ICU "poor man's RTTI", returns a UClassID for the actual class. More... | |
UCollationElements * | toUCollationElements () |
const UCollationElements * | toUCollationElements () const |
Public Member Functions inherited from icu::UObject | |
virtual | ~UObject () |
Destructor. More... | |
virtual UClassID | getDynamicClassID () const |
ICU4C "poor man's RTTI", returns a UClassID for the actual ICU class. More... | |
Static Public Member Functions | |
static int32_t | primaryOrder (int32_t order) |
Gets the primary order of a collation order. More... | |
static int32_t | secondaryOrder (int32_t order) |
Gets the secondary order of a collation order. More... | |
static int32_t | tertiaryOrder (int32_t order) |
Gets the tertiary order of a collation order. More... | |
static UBool | isIgnorable (int32_t order) |
Checks if a comparison order is ignorable. More... | |
static UClassID | getStaticClassID () |
ICU "poor man's RTTI", returns a UClassID for this class. More... | |
static CollationElementIterator * | fromUCollationElements (UCollationElements *uc) |
static const CollationElementIterator * | fromUCollationElements (const UCollationElements *uc) |
Friends | |
class | RuleBasedCollator |
class | UCollationPCE |
The CollationElementIterator class is used as an iterator to walk through
each character of an international string.
Use the iterator to return the ordering priority of the positioned character. The ordering priority of a character, which we refer to as a key, defines how a character is collated in the given collation object. For example, consider the following in Slovak and in traditional Spanish collation:
"ca" -> the first key is key('c') and second key is key('a'). "cha" -> the first key is key('ch') and second key is key('a').
And in German phonebook collation,
"æb"-> the first key is key('a'), the second key is key('e'), and the third key is key('b').
The key of a character, is an integer composed of primary order(short), secondary order(char), and tertiary order(char). Java strictly defines the size and signedness of its primitive data types. Therefore, the static functions primaryOrder(), secondaryOrder(), and tertiaryOrder() return int32_t to ensure the correctness of the key value.
Example of the iterator usage: (without error checking)
void CollationElementIterator_Example(){UnicodeString str = "This is a test";UErrorCode success = U_ZERO_ERROR;RuleBasedCollator* rbc =(RuleBasedCollator*) RuleBasedCollator::createInstance(success);rbc->createCollationElementIterator( str );int32_t order = c->next(success);c->reset();order = c->previous(success);delete c;delete rbc;}The CollationElementIterator class is used as an iterator to walk through each character of an inte...Definition: coleitr.h:121void reset(void)Resets the cursor to the beginning of the string.int32_t previous(UErrorCode &status)Get the ordering priority of the previous collation element in the string.int32_t next(UErrorCode &status)Gets the ordering priority of the next character in the string.static Collator * createInstance(UErrorCode &err)Creates the Collator object for the current default locale.The RuleBasedCollator class provides the implementation of Collator, using data-driven tables.Definition: tblcoll.h:115virtual CollationElementIterator * createCollationElementIterator(const UnicodeString &source) constCreates a collation element iterator for the source string.UnicodeString is a string class that stores Unicode characters directly and provides similar function...Definition: unistr.h:296
<p<blockquote>
The method next() returns the collation order of the next character based on the comparison level of the collator. The method previous() returns the collation order of the previous character based on the comparison level of the collator. The Collation Element Iterator moves only in one direction between calls to reset(), setOffset(), or setText(). That is, next() and previous() can not be inter-used. Whenever previous() is to be called after next() or vice versa, reset(), setOffset() or setText() has to be called first to reset the status, shifting pointers to either the end or the start of the string (reset() or setText()), or the specified position (setOffset()). Hence at the next call of next() or previous(), the first or last collation order, or collation order at the specified position will be returned. If a change of direction is done without one of these calls, the result is undefined.
The result of a forward iterate (next()) and reversed result of the backward iterate (previous()) on the same string are equivalent, if collation orders with the value 0 are ignored. Character based on the comparison level of the collator. A collation order consists of primary order, secondary order and tertiary order. The data type of the collation order is int32_t.
Note, CollationElementIterator should not be subclassed.
anonymous enum |
icu::CollationElementIterator::CollationElementIterator | ( | const CollationElementIterator & | other | ) |
|
virtual |
Destructor.
|
inlinestatic |
|
inlinestatic |
|
overridevirtual |
ICU "poor man's RTTI", returns a UClassID for the actual class.
Reimplemented from icu::UObject.
int32_t icu::CollationElementIterator::getMaxExpansion | ( | int32_t | order | ) | const |
Return the maximum length of any expansion sequences that end with the specified comparison order.
order | a collation order returned by previous or next. |
int32_t icu::CollationElementIterator::getOffset | ( | void | ) | const |
Gets the offset of the currently processed character in the source string.
|
static |
ICU "poor man's RTTI", returns a UClassID for this class.
|
inlinestatic |
int32_t icu::CollationElementIterator::next | ( | UErrorCode & | status | ) |
Gets the ordering priority of the next character in the string.
status | the error code status. |
bool icu::CollationElementIterator::operator!= | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is not the same as "this".
other | the object to be compared |
bool icu::CollationElementIterator::operator== | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is the same as "this".
other | the object to be compared |
int32_t icu::CollationElementIterator::previous | ( | UErrorCode & | status | ) |
Get the ordering priority of the previous collation element in the string.
status | the error code status. |
|
inlinestatic |
void icu::CollationElementIterator::reset | ( | void | ) |
Resets the cursor to the beginning of the string.
|
inlinestatic |
void icu::CollationElementIterator::setOffset | ( | int32_t | newOffset, |
UErrorCode & | status | ||
) |
Sets the offset of the currently processed character in the source string.
newOffset | the new offset. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | CharacterIterator & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source character iterator. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | const UnicodeString & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source string. |
status | the error code status. |
int32_t icu::CollationElementIterator::strengthOrder | ( | int32_t | order | ) | const |
Gets the comparison order in the desired strength.
Ignore the other differences.
order | The order value |
|
inlinestatic |
|
inline |
|
inline |
|
friend |