ICU 50.1.2
50.1.2
|
The CollationElementIterator class is used as an iterator to walk through each character of an international string. More...
#include <coleitr.h>
Public Types | |
enum | { NULLORDER = (int32_t)0xffffffff } |
Public Member Functions | |
CollationElementIterator (const CollationElementIterator &other) | |
Copy constructor. More... | |
virtual | ~CollationElementIterator () |
Destructor. More... | |
UBool | operator== (const CollationElementIterator &other) const |
Returns true if "other" is the same as "this". More... | |
UBool | operator!= (const CollationElementIterator &other) const |
Returns true if "other" is not the same as "this". More... | |
void | reset (void) |
Resets the cursor to the beginning of the string. More... | |
int32_t | next (UErrorCode &status) |
Gets the ordering priority of the next character in the string. More... | |
int32_t | previous (UErrorCode &status) |
Get the ordering priority of the previous collation element in the string. More... | |
int32_t | getMaxExpansion (int32_t order) const |
Return the maximum length of any expansion sequences that end with the specified comparison order. More... | |
int32_t | strengthOrder (int32_t order) const |
Gets the comparison order in the desired strength. More... | |
void | setText (const UnicodeString &str, UErrorCode &status) |
Sets the source string. More... | |
void | setText (CharacterIterator &str, UErrorCode &status) |
Sets the source string. More... | |
int32_t | getOffset (void) const |
Gets the offset of the currently processed character in the source string. More... | |
void | setOffset (int32_t newOffset, UErrorCode &status) |
Sets the offset of the currently processed character in the source string. More... | |
virtual UClassID | getDynamicClassID () const |
ICU "poor man's RTTI", returns a UClassID for the actual class. More... | |
![]() | |
virtual | ~UObject () |
Destructor. More... | |
Static Public Member Functions | |
static int32_t | primaryOrder (int32_t order) |
Gets the primary order of a collation order. More... | |
static int32_t | secondaryOrder (int32_t order) |
Gets the secondary order of a collation order. More... | |
static int32_t | tertiaryOrder (int32_t order) |
Gets the tertiary order of a collation order. More... | |
static UBool | isIgnorable (int32_t order) |
Checks if a comparison order is ignorable. More... | |
static UClassID | getStaticClassID () |
ICU "poor man's RTTI", returns a UClassID for this class. More... | |
Protected Member Functions | |
CollationElementIterator (const UnicodeString &sourceText, const RuleBasedCollator *order, UErrorCode &status) | |
CollationElementIterator constructor. More... | |
CollationElementIterator (const CharacterIterator &sourceText, const RuleBasedCollator *order, UErrorCode &status) | |
CollationElementIterator constructor. More... | |
const CollationElementIterator & | operator= (const CollationElementIterator &other) |
Assignment operator. More... | |
Friends | |
class | RuleBasedCollator |
The CollationElementIterator class is used as an iterator to walk through each character of an international string.
Use the iterator to return the ordering priority of the positioned character. The ordering priority of a character, which we refer to as a key, defines how a character is collated in the given collation object. For example, consider the following in Spanish:
"ca" -> the first key is key('c') and second key is key('a'). "cha" -> the first key is key('ch') and second key is key('a').
And in German,
"æb"-> the first key is key('a'), the second key is key('e'), and the third key is key('b').
The key of a character, is an integer composed of primary order(short), secondary order(char), and tertiary order(char). Java strictly defines the size and signedness of its primitive data types. Therefore, the static functions primaryOrder(), secondaryOrder(), and tertiaryOrder() return int32_t to ensure the correctness of the key value.
Example of the iterator usage: (without error checking)
* void CollationElementIterator_Example()* {* UnicodeString str = "This is a test";* UErrorCode success = U_ZERO_ERROR;* RuleBasedCollator* rbc =* (RuleBasedCollator*) RuleBasedCollator::createInstance(success);* CollationElementIterator* c =* rbc->createCollationElementIterator( str );* int32_t order = c->next(success);* c->reset();* order = c->previous(success);* delete c;* delete rbc;* }*
CollationElementIterator::next returns the collation order of the next character based on the comparison level of the collator. CollationElementIterator::previous returns the collation order of the previous character based on the comparison level of the collator. The Collation Element Iterator moves only in one direction between calls to CollationElementIterator::reset. That is, CollationElementIterator::next() and CollationElementIterator::previous can not be inter-used. Whenever CollationElementIterator::previous is to be called after CollationElementIterator::next() or vice versa, CollationElementIterator::reset has to be called first to reset the status, shifting pointers to either the end or the start of the string. Hence at the next call of CollationElementIterator::previous or CollationElementIterator::next(), the first or last collation order will be returned. If a change of direction is done without a CollationElementIterator::reset(), the result is undefined. The result of a forward iterate (CollationElementIterator::next) and reversed result of the backward iterate (CollationElementIterator::previous) on the same string are equivalent, if collation orders with the value UCOL_IGNORABLE are ignored. Character based on the comparison level of the collator. A collation order consists of primary order, secondary order and tertiary order. The data type of the collation order is t_int32.
Note, CollationElementIterator should not be subclassed.
anonymous enum |
icu::CollationElementIterator::CollationElementIterator | ( | const CollationElementIterator & | other | ) |
|
virtual |
Destructor.
|
protected |
CollationElementIterator constructor.
This takes the source string and the collation object. The cursor will walk thru the source string based on the predefined collation rules. If the source string is empty, NULLORDER will be returned on the calls to next().
sourceText | the source string. |
order | the collation object. |
status | the error code status. |
|
protected |
CollationElementIterator constructor.
This takes the source string and the collation object. The cursor will walk thru the source string based on the predefined collation rules. If the source string is empty, NULLORDER will be returned on the calls to next().
sourceText | the source string. |
order | the collation object. |
status | the error code status. |
|
virtual |
ICU "poor man's RTTI", returns a UClassID for the actual class.
Implements icu::UObject.
|
inline |
Return the maximum length of any expansion sequences that end with the specified comparison order.
order | a collation order returned by previous or next. |
Definition at line 386 of file coleitr.h.
References ucol_getMaxExpansion().
int32_t icu::CollationElementIterator::getOffset | ( | void | ) | const |
Gets the offset of the currently processed character in the source string.
|
static |
ICU "poor man's RTTI", returns a UClassID for this class.
|
inlinestatic |
int32_t icu::CollationElementIterator::next | ( | UErrorCode & | status | ) |
Gets the ordering priority of the next character in the string.
status | the error code status. |
UBool icu::CollationElementIterator::operator!= | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is not the same as "this".
other | the object to be compared |
|
protected |
UBool icu::CollationElementIterator::operator== | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is the same as "this".
other | the object to be compared |
int32_t icu::CollationElementIterator::previous | ( | UErrorCode & | status | ) |
Get the ordering priority of the previous collation element in the string.
status | the error code status. |
|
inlinestatic |
Gets the primary order of a collation order.
Get the primary order of a collation order.
order | the collation order |
order | the collation order |
void icu::CollationElementIterator::reset | ( | void | ) |
Resets the cursor to the beginning of the string.
|
inlinestatic |
Gets the secondary order of a collation order.
Get the secondary order of a collation order.
order | the collation order |
order | the collation order |
void icu::CollationElementIterator::setOffset | ( | int32_t | newOffset, |
UErrorCode & | status | ||
) |
Sets the offset of the currently processed character in the source string.
newOffset | the new offset. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | const UnicodeString & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source string. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | CharacterIterator & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source character iterator. |
status | the error code status. |
int32_t icu::CollationElementIterator::strengthOrder | ( | int32_t | order | ) | const |
Gets the comparison order in the desired strength.
Ignore the other differences.
order | The order value |
|
inlinestatic |
Gets the tertiary order of a collation order.
Get the tertiary order of a collation order.
order | the collation order |
order | the collation order |
|
friend |