Package net.sf.saxon.str
Class BMPString
java.lang.Object
net.sf.saxon.str.UnicodeString
net.sf.saxon.str.BMPString
- All Implemented Interfaces:
- Comparable<UnicodeString>,- AtomicMatchKey
An implementation of 
UnicodeString that wraps a Java string which is known to contain
 no surrogates. That is, all the characters in the string are in the Basic Multilingual
 Plane (their codepoints are in the range 0-65535 and not in the surrogate range).- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionintcodePointAt(long index) Get the code point at a given position in the stringGet an iterator over the code points present in the string.intcompareTo(UnicodeString other) Compare this string to another using codepoint comparisonconcat(UnicodeString other) Concatenate with another string, returning a new stringbooleanintgetWidth()Get the number of bits needed to hold all the characters in this stringinthashCode()Compute a hashCode.longindexOf(int codePoint) Get the position of the first occurrence of the specified codepoint, starting the search at the beginninglongindexOf(int codePoint, long from) Get the position of the first occurrence of the specified codepoint, starting the search at a given position in the stringlongindexWhere(IntPredicate predicate, long from) Get the position of the first occurrence of the specified codepoint, starting the search at a given position in the stringbooleanisEmpty()Ask whether the string is emptylonglength()Get the length of the stringstatic UnicodeStringWrap a String, which must contain no surrogatessubstring(long start, long end) Get a substring of this string, with a given start and end positiontoString()Methods inherited from class net.sf.saxon.str.UnicodeStringasAtomic, checkSubstringBounds, economize, estimatedLength, hasSubstring, indexOf, length32, prefix, requireInt, requireNonNegativeInt, substring, tidy, verifyCharacters
- 
Constructor Details- 
BMPStringProtected constructor- Parameters:
- baseString- the string to be wrapped: the caller is responsible for ensuring this contains no surrogates
 
 
- 
- 
Method Details- 
ofWrap a String, which must contain no surrogates- Parameters:
- base- the string. The caller warrants that this string contains no surrogates; this condition is checked only if Java assertions are enabled.
- Returns:
- the wrapped string.
 
- 
lengthpublic long length()Description copied from class:UnicodeStringGet the length of the string- Specified by:
- lengthin class- UnicodeString
- Returns:
- the number of code points in the string
 
- 
isEmptypublic boolean isEmpty()Description copied from class:UnicodeStringAsk whether the string is empty- Overrides:
- isEmptyin class- UnicodeString
- Returns:
- true if the length of the string is zero
 
- 
getWidthpublic int getWidth()Description copied from class:UnicodeStringGet the number of bits needed to hold all the characters in this string- Specified by:
- getWidthin class- UnicodeString
- Returns:
- 7 for ascii characters (not used??), 8 for latin-1, 16 for BMP, 24 for general Unicode.
 
- 
codePointsDescription copied from class:UnicodeStringGet an iterator over the code points present in the string.- Specified by:
- codePointsin class- UnicodeString
- Returns:
- an iterator that delivers the individual code points
 
- 
indexOfpublic long indexOf(int codePoint) Description copied from class:UnicodeStringGet the position of the first occurrence of the specified codepoint, starting the search at the beginning- Overrides:
- indexOfin class- UnicodeString
- Parameters:
- codePoint- the sought codePoint
- Returns:
- the position (0-based) of the first occurrence found, or -1 if not found, counting codePoints rather than UTF16 chars.
 
- 
indexOfpublic long indexOf(int codePoint, long from) Description copied from class:UnicodeStringGet the position of the first occurrence of the specified codepoint, starting the search at a given position in the string- Specified by:
- indexOfin class- UnicodeString
- Parameters:
- codePoint- the sought codePoint
- from- the position from which the search should start (0-based). A negative value is treated as zero. A position beyond the end of the string results in a return value of -1 (meaning not found).
- Returns:
- the position (0-based) of the first occurrence found, or -1 if not found
 
- 
indexWhereGet the position of the first occurrence of the specified codepoint, starting the search at a given position in the string- Specified by:
- indexWherein class- UnicodeString
- Parameters:
- predicate- condition that the codepoint must satisfy
- from- the position from which the search should start (0-based). A negative value is treated as zero. A position beyond the end of the string results in a return value of -1 (meaning not found).
- Returns:
- the position (0-based) of the first codepoint to match the predicate, or -1 if not found
 
- 
codePointAtpublic int codePointAt(long index) Description copied from class:UnicodeStringGet the code point at a given position in the string- Specified by:
- codePointAtin class- UnicodeString
- Parameters:
- index- the given position (0-based)
- Returns:
- the code point at the given position
 
- 
substringDescription copied from class:UnicodeStringGet a substring of this string, with a given start and end position- Specified by:
- substringin class- UnicodeString
- Parameters:
- start- the start position (0-based): that is, the position of the first code point to be included
- end- the end position (0-based): specifically, the position of the first code point not to be included
- Returns:
- the requested substring
 
- 
concatDescription copied from class:UnicodeStringConcatenate with another string, returning a new string- Overrides:
- concatin class- UnicodeString
- Parameters:
- other- the string to be appended
- Returns:
- the result of concatenating this string followed by the other
 
- 
compareToDescription copied from class:UnicodeStringCompare this string to another using codepoint comparison- Specified by:
- compareToin interface- Comparable<UnicodeString>
- Overrides:
- compareToin class- UnicodeString
- Parameters:
- other- the other string
- Returns:
- -1 if this string comes first, 0 if they are equal, +1 if the other string comes first
 
- 
equals- Overrides:
- equalsin class- UnicodeString
 
- 
hashCodepublic int hashCode()Description copied from class:UnicodeStringCompute a hashCode. All implementations ofUnicodeStringuse compatible hash codes and the hashing algorithm is therefore identical to that forjava.lang.String. This means that for strings containing Astral characters, the hash code needs to be computed by decomposing an Astral character into a surrogate pair.- Overrides:
- hashCodein class- UnicodeString
- Returns:
- the hash code
 
- 
toString
 
-