java.lang.Objectjava.lang.Character
All Implemented Interfaces:
Serializable, Comparable
Character class wraps a value of the primitive
type char in an object. An object of type
Character contains a single field whose type is
char.
In addition, this class provides several methods for determining a character's category (lowercase letter, digit, etc.) and for converting characters from uppercase to lowercase and vice versa.
Character information is based on the Unicode Standard, version 4.0.
The methods and data of class Character are defined by
the information in the UnicodeData file that is part of the
Unicode Character Database maintained by the Unicode
Consortium. This file specifies various properties including name
and general category for every defined Unicode code point or
character range.
The file and its description are available from the Unicode Consortium at:
The char data type (and therefore the value that a
Character object encapsulates) are based on the
original Unicode specification, which defined characters as
fixed-width 16-bit entities. The Unicode standard has since been
changed to allow for characters whose representation requires more
than 16 bits. The range of legal code points is now
U+0000 to U+10FFFF, known as Unicode scalar value.
(Refer to the
definition of the U+n notation in the Unicode
standard.)
The set of characters from U+0000 to U+FFFF is sometimes
referred to as the Basic Multilingual Plane (BMP). Characters whose code points are greater
than U+FFFF are called supplementary characters. The Java
2 platform uses the UTF-16 representation in char
arrays and in the String and StringBuffer
classes. In this representation, supplementary characters are
represented as a pair of char values, the first from
the high-surrogates range, (\uD800-\uDBFF), the
second from the low-surrogates range
(\uDC00-\uDFFF).
A char value, therefore, represents Basic
Multilingual Plane (BMP) code points, including the surrogate
code points, or code units of the UTF-16 encoding. An
int value represents all Unicode code points,
including supplementary code points. The lower (least significant)
21 bits of int are used to represent Unicode code
points and the upper (most significant) 11 bits must be zero.
Unless otherwise specified, the behavior with respect to
supplementary characters and surrogate char values is
as follows:
char value cannot support
supplementary characters. They treat char values from the
surrogate ranges as undefined characters. For example,
Character.isLetter('\uD840') returns false, even though
this specific value if followed by any low-surrogate value in a string
would represent a letter.
int value support all
Unicode characters, including supplementary characters. For
example, Character.isLetter(0x2F81A) returns
true because the code point value represents a letter
(a CJK ideograph).
In the Java SE API documentation, Unicode code point is
used for character values in the range between U+0000 and U+10FFFF,
and Unicode code unit is used for 16-bit
char values that are code units of the UTF-16
encoding. For more information on Unicode terminology, refer to the
Unicode Glossary.
Lee - BoyntonGuy - SteeleAkira - Tanaka1.0 - | Nested Class Summary: | ||
|---|---|---|
| public static class | Character.Subset | Instances of this class represent particular subsets of the Unicode
character set. The only family of subsets defined in the
Character class is {@link Character.UnicodeBlock
UnicodeBlock}. Other portions of the Java API may define other
subsets for their own purposes. |
| public static final class | Character.UnicodeBlock | A family of character subsets representing the character blocks in the Unicode specification. Character blocks generally define characters used for a specific script or purpose. A character is contained by at most one Unicode block. |
| Field Summary | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| public static final int | MIN_RADIX | The minimum radix available for conversion to and from strings.
The constant value of this field is the smallest value permitted
for the radix argument in radix-conversion methods such as the
digit method, the forDigit
method, and the toString method of class
Integer.
| |||||||||||||
| public static final int | MAX_RADIX | The maximum radix available for conversion to and from strings.
The constant value of this field is the largest value permitted
for the radix argument in radix-conversion methods such as the
digit method, the forDigit
method, and the toString method of class
Integer.
| |||||||||||||
| public static final char | MIN_VALUE | The constant value of this field is the smallest value of type
char, '\u0000'.
| |||||||||||||
| public static final char | MAX_VALUE | The constant value of this field is the largest value of type
char, '\uFFFF'.
| |||||||||||||
| public static final Class | TYPE | The Class instance representing the primitive type
char.
| |||||||||||||
| public static final byte | UNASSIGNED | General category "Cn" in the Unicode specification.
| |||||||||||||
| public static final byte | UPPERCASE_LETTER | General category "Lu" in the Unicode specification.
| |||||||||||||
| public static final byte | LOWERCASE_LETTER | General category "Ll" in the Unicode specification.
| |||||||||||||
| public static final byte | TITLECASE_LETTER | General category "Lt" in the Unicode specification.
| |||||||||||||
| public static final byte | MODIFIER_LETTER | General category "Lm" in the Unicode specification.
| |||||||||||||
| public static final byte | OTHER_LETTER | General category "Lo" in the Unicode specification.
| |||||||||||||
| public static final byte | NON_SPACING_MARK | General category "Mn" in the Unicode specification.
| |||||||||||||
| public static final byte | ENCLOSING_MARK | General category "Me" in the Unicode specification.
| |||||||||||||
| public static final byte | COMBINING_SPACING_MARK | General category "Mc" in the Unicode specification.
| |||||||||||||
| public static final byte | DECIMAL_DIGIT_NUMBER | General category "Nd" in the Unicode specification.
| |||||||||||||
| public static final byte | LETTER_NUMBER | General category "Nl" in the Unicode specification.
| |||||||||||||
| public static final byte | OTHER_NUMBER | General category "No" in the Unicode specification.
| |||||||||||||
| public static final byte | SPACE_SEPARATOR | General category "Zs" in the Unicode specification.
| |||||||||||||
| public static final byte | LINE_SEPARATOR | General category "Zl" in the Unicode specification.
| |||||||||||||
| public static final byte | PARAGRAPH_SEPARATOR | General category "Zp" in the Unicode specification.
| |||||||||||||
| public static final byte | CONTROL | General category "Cc" in the Unicode specification.
| |||||||||||||
| public static final byte | FORMAT | General category "Cf" in the Unicode specification.
| |||||||||||||
| public static final byte | PRIVATE_USE | General category "Co" in the Unicode specification.
| |||||||||||||
| public static final byte | SURROGATE | General category "Cs" in the Unicode specification.
| |||||||||||||
| public static final byte | DASH_PUNCTUATION | General category "Pd" in the Unicode specification.
| |||||||||||||
| public static final byte | START_PUNCTUATION | General category "Ps" in the Unicode specification.
| |||||||||||||
| public static final byte | END_PUNCTUATION | General category "Pe" in the Unicode specification.
| |||||||||||||
| public static final byte | CONNECTOR_PUNCTUATION | General category "Pc" in the Unicode specification.
| |||||||||||||
| public static final byte | OTHER_PUNCTUATION | General category "Po" in the Unicode specification.
| |||||||||||||
| public static final byte | MATH_SYMBOL | General category "Sm" in the Unicode specification.
| |||||||||||||
| public static final byte | CURRENCY_SYMBOL | General category "Sc" in the Unicode specification.
| |||||||||||||
| public static final byte | MODIFIER_SYMBOL | General category "Sk" in the Unicode specification.
| |||||||||||||
| public static final byte | OTHER_SYMBOL | General category "So" in the Unicode specification.
| |||||||||||||
| public static final byte | INITIAL_QUOTE_PUNCTUATION | General category "Pi" in the Unicode specification.
| |||||||||||||
| public static final byte | FINAL_QUOTE_PUNCTUATION | General category "Pf" in the Unicode specification.
| |||||||||||||
| static final int | ERROR | Error flag. Use int (code point) to avoid confusion with U+FFFF. | |||||||||||||
| public static final byte | DIRECTIONALITY_UNDEFINED | Undefined bidirectional character type. Undefined char
values have undefined directionality in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_LEFT_TO_RIGHT | Strong bidirectional character type "L" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_RIGHT_TO_LEFT | Strong bidirectional character type "R" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC | Strong bidirectional character type "AL" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_EUROPEAN_NUMBER | Weak bidirectional character type "EN" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR | Weak bidirectional character type "ES" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR | Weak bidirectional character type "ET" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_ARABIC_NUMBER | Weak bidirectional character type "AN" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_COMMON_NUMBER_SEPARATOR | Weak bidirectional character type "CS" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_NONSPACING_MARK | Weak bidirectional character type "NSM" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_BOUNDARY_NEUTRAL | Weak bidirectional character type "BN" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_PARAGRAPH_SEPARATOR | Neutral bidirectional character type "B" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_SEGMENT_SEPARATOR | Neutral bidirectional character type "S" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_WHITESPACE | Neutral bidirectional character type "WS" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_OTHER_NEUTRALS | Neutral bidirectional character type "ON" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING | Strong bidirectional character type "LRE" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE | Strong bidirectional character type "LRO" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING | Strong bidirectional character type "RLE" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE | Strong bidirectional character type "RLO" in the Unicode specification.
| |||||||||||||
| public static final byte | DIRECTIONALITY_POP_DIRECTIONAL_FORMAT | Weak bidirectional character type "PDF" in the Unicode specification.
| |||||||||||||
| public static final char | MIN_HIGH_SURROGATE | The minimum value of a Unicode high-surrogate code unit in the
UTF-16 encoding. A high-surrogate is also known as a
leading-surrogate.
| |||||||||||||
| public static final char | MAX_HIGH_SURROGATE | The maximum value of a Unicode high-surrogate code unit in the
UTF-16 encoding. A high-surrogate is also known as a
leading-surrogate.
| |||||||||||||
| public static final char | MIN_LOW_SURROGATE | The minimum value of a Unicode low-surrogate code unit in the
UTF-16 encoding. A low-surrogate is also known as a
trailing-surrogate.
| |||||||||||||
| public static final char | MAX_LOW_SURROGATE | The maximum value of a Unicode low-surrogate code unit in the
UTF-16 encoding. A low-surrogate is also known as a
trailing-surrogate.
| |||||||||||||
| public static final char | MIN_SURROGATE | The minimum value of a Unicode surrogate code unit in the UTF-16 encoding.
| |||||||||||||
| public static final char | MAX_SURROGATE | The maximum value of a Unicode surrogate code unit in the UTF-16 encoding.
| |||||||||||||
| public static final int | MIN_SUPPLEMENTARY_CODE_POINT | The minimum value of a supplementary code point.
| |||||||||||||
| public static final int | MIN_CODE_POINT | The minimum value of a Unicode code point.
| |||||||||||||
| public static final int | MAX_CODE_POINT | The maximum value of a Unicode code point.
| |||||||||||||
| public static final int | SIZE | The number of bits used to represent a char value in unsigned
binary form.
| |||||||||||||
| Constructor: |
|---|
Character object that
represents the specified char value.
|
| Methods from java.lang.Object: |
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method from java.lang.Character Detail: | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
char values needed to
represent the specified character (Unicode code point). If the
specified character is equal to or greater than 0x10000, then
the method returns 2. Otherwise, the method returns 1.
This method doesn't validate the specified character to be a valid Unicode code point. The caller must validate the character value using isValidCodePoint if necessary. | |||||||||||||||
Character object. | |||||||||||||||
CharSequence. If the char value at
the given index in the CharSequence is in the
high-surrogate range, the following index is less than the
length of the CharSequence, and the
char value at the following index is in the
low-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at the given index is returned. | |||||||||||||||
char array. If the char value at
the given index in the char array is in the
high-surrogate range, the following index is less than the
length of the char array, and the
char value at the following index is in the
low-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at the given index is returned. | |||||||||||||||
char array, where only array elements with
index less than limit can be used. If
the char value at the given index in the
char array is in the high-surrogate range, the
following index is less than the limit, and the
char value at the following index is in the
low-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at the given index is returned. | |||||||||||||||
| |||||||||||||||
CharSequence. If the char value at
(index - 1) in the CharSequence is in
the low-surrogate range, (index - 2) is not
negative, and the char value at (index -
2) in the CharSequence is in the
high-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at (index - 1) is
returned. | |||||||||||||||
char array. If the char value at
(index - 1) in the char array is in
the low-surrogate range, (index - 2) is not
negative, and the char value at (index -
2) in the char array is in the
high-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at (index - 1) is
returned. | |||||||||||||||
char array, where only array elements with
index greater than or equal to start
can be used. If the char value at (index -
1) in the char array is in the
low-surrogate range, (index - 2) is not less than
start, and the char value at
(index - 2) in the char array is in
the high-surrogate range, then the supplementary code point
corresponding to this surrogate pair is returned. Otherwise,
the char value at (index - 1) is
returned. | |||||||||||||||
| |||||||||||||||
beginIndex and extends to the
char at index endIndex - 1. Thus the
length (in chars) of the text range is
endIndex-beginIndex. Unpaired surrogates within
the text range count as one code point each. | |||||||||||||||
char array argument. The offset
argument is the index of the first char of the
subarray and the count argument specifies the
length of the subarray in chars. Unpaired
surrogates within the subarray count as one code point each. | |||||||||||||||
| |||||||||||||||
Character objects numerically. | |||||||||||||||
ch in the
specified radix.
If the radix is not in the range Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #digit(int, int) method. | |||||||||||||||
If the radix is not in the range | |||||||||||||||
true if and only if the argument is not
null and is a Character object that
represents the same char value as this object. | |||||||||||||||
radix is not a
valid radix, or the value of digit is not a valid
digit in the specified radix, the null character
('\u0000') is returned.
The
If the digit is less than 10, then
| |||||||||||||||
char values is DIRECTIONALITY_UNDEFINED.
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #getDirectionality(int) method. | |||||||||||||||
| |||||||||||||||
int value that the specified Unicode
character represents. For example, the character
'\u216C' (the roman numeral fifty) will return
an int with a value of 50.
The letters A-Z in their uppercase ( If the character does not have a numeric value, then -1 is returned. If the character has a numeric value that cannot be represented as a nonnegative integer (for example, a fractional value), then -2 is returned. Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #getNumericValue(int) method. | |||||||||||||||
int value that the specified
character (Unicode code point) represents. For example, the character
'\u216C' (the Roman numeral fifty) will return
an int with a value of 50.
The letters A-Z in their uppercase ( If the character does not have a numeric value, then -1 is returned. If the character has a numeric value that cannot be represented as a nonnegative integer (for example, a fractional value), then -2 is returned. | |||||||||||||||
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #getType(int) method. | |||||||||||||||
| |||||||||||||||
Character. | |||||||||||||||
A character is defined if at least one of the following is true: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isDefined(int) method. | |||||||||||||||
A character is defined if at least one of the following is true: | |||||||||||||||
A character is a digit if its general category type, provided
by Some Unicode character ranges that contain digits: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isDigit(int) method. | |||||||||||||||
A character is a digit if its general category type, provided
by getType(codePoint) , is
Some Unicode character ranges that contain digits: | |||||||||||||||
char value is a
high-surrogate code unit (also known as leading-surrogate
code unit). Such values do not represent characters by
themselves, but are used in the representation of supplementary characters in the
UTF-16 encoding.
This method returns isch >= '\uD800' && ch <= '\uDBFF' true. | |||||||||||||||
'\u0000'
through '\u001F' or in the range
'\u007F' through '\u009F'.
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isISOControl(int) method. | |||||||||||||||
'\u0000'
through '\u001F' or in the range
'\u007F' through '\u009F'. | |||||||||||||||
The following Unicode characters are ignorable in a Java identifier or a Unicode identifier: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isIdentifierIgnorable(int) method. | |||||||||||||||
The following Unicode characters are ignorable in a Java identifier or a Unicode identifier: | |||||||||||||||
A character may be part of a Java identifier if any of the following are true: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isJavaIdentifierPart(int) method. | |||||||||||||||
A character may be part of a Java identifier if any of the following are true: | |||||||||||||||
A character may start a Java identifier if and only if one of the following conditions is true: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isJavaIdentifierStart(int) method. | |||||||||||||||
A character may start a Java identifier if and only if one of the following conditions is true: | |||||||||||||||
Deprecated! Replaced - by isJavaIdentifierStart(char).
A character may start a Java identifier if and only if one of the following is true: | |||||||||||||||
Deprecated! Replaced - by isJavaIdentifierPart(char).
A character may be part of a Java identifier if and only if any of the following are true: | |||||||||||||||
A character is considered to be a letter if its general
category type, provided by Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isLetter(int) method. | |||||||||||||||
A character is considered to be a letter if its general category type, provided by getType(codePoint) , is any of the following: | |||||||||||||||
A character is considered to be a letter or digit if either
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isLetterOrDigit(int) method. | |||||||||||||||
A character is considered to be a letter or digit if either
isLetter(codePoint) or
isDigit(codePoint) returns
| |||||||||||||||
char value is a
low-surrogate code unit (also known as trailing-surrogate code
unit). Such values do not represent characters by themselves,
but are used in the representation of supplementary characters in the UTF-16 encoding.
This method returns isch >= '\uDC00' && ch <= '\uDFFF' true. | |||||||||||||||
A character is lowercase if its general category type, provided
by The following are examples of lowercase characters: a b c d e f g h i j k l m n o p q r s t u v w x y z '\u00DF' '\u00E0' '\u00E1' '\u00E2' '\u00E3' '\u00E4' '\u00E5' '\u00E6' '\u00E7' '\u00E8' '\u00E9' '\u00EA' '\u00EB' '\u00EC' '\u00ED' '\u00EE' '\u00EF' '\u00F0' '\u00F1' '\u00F2' '\u00F3' '\u00F4' '\u00F5' '\u00F6' '\u00F8' '\u00F9' '\u00FA' '\u00FB' '\u00FC' '\u00FD' '\u00FE' '\u00FF' Many other Unicode characters are lowercase too. Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isLowerCase(int) method. | |||||||||||||||
A character is lowercase if its general category type, provided
by getType(codePoint) , is
The following are examples of lowercase characters: a b c d e f g h i j k l m n o p q r s t u v w x y z '\u00DF' '\u00E0' '\u00E1' '\u00E2' '\u00E3' '\u00E4' '\u00E5' '\u00E6' '\u00E7' '\u00E8' '\u00E9' '\u00EA' '\u00EB' '\u00EC' '\u00ED' '\u00EE' '\u00EF' '\u00F0' '\u00F1' '\u00F2' '\u00F3' '\u00F4' '\u00F5' '\u00F6' '\u00F8' '\u00F9' '\u00FA' '\u00FB' '\u00FC' '\u00FD' '\u00FE' '\u00FF' Many other Unicode characters are lowercase too. | |||||||||||||||
'\u0028' LEFT
PARENTHESIS is semantically defined to be an opening
parenthesis. This will appear as a "(" in text that is
left-to-right but as a ")" in text that is right-to-left.
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isMirrored(int) method. | |||||||||||||||
'\u0028' LEFT PARENTHESIS is semantically
defined to be an opening parenthesis. This will appear
as a "(" in text that is left-to-right but as a ")" in text
that is right-to-left. | |||||||||||||||
Deprecated! Replaced - by isWhitespace(char).
true for the following five
characters only:
| |||||||||||||||
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isSpaceChar(int) method. | |||||||||||||||
| |||||||||||||||
codePoint >= 0x10000 && codePoint <= 0x10FFFF | |||||||||||||||
char
values is a valid surrogate pair. This method is equivalent to
the expression:
isHighSurrogate(high) && isLowSurrogate(low) | |||||||||||||||
A character is a titlecase character if its general
category type, provided by Some characters look like pairs of Latin letters. For example, there is an uppercase letter that looks like "LJ" and has a corresponding lowercase letter that looks like "lj". A third form, which looks like "Lj", is the appropriate form to use when rendering a word in lowercase with initial capitals, as for a book title.
These are some of the Unicode characters for which this method returns
Many other Unicode characters are titlecase too.
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isTitleCase(int) method. | |||||||||||||||
A character is a titlecase character if its general
category type, provided by getType(codePoint) ,
is Some characters look like pairs of Latin letters. For example, there is an uppercase letter that looks like "LJ" and has a corresponding lowercase letter that looks like "lj". A third form, which looks like "Lj", is the appropriate form to use when rendering a word in lowercase with initial capitals, as for a book title.
These are some of the Unicode characters for which this method returns
Many other Unicode characters are titlecase too. | |||||||||||||||
A character may be part of a Unicode identifier if and only if one of the following statements is true: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isUnicodeIdentifierPart(int) method. | |||||||||||||||
A character may be part of a Unicode identifier if and only if one of the following statements is true: | |||||||||||||||
A character may start a Unicode identifier if and only if one of the following conditions is true: Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isUnicodeIdentifierStart(int) method. | |||||||||||||||
A character may start a Unicode identifier if and only if one of the following conditions is true: | |||||||||||||||
A character is uppercase if its general category type, provided by
The following are examples of uppercase characters: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z '\u00C0' '\u00C1' '\u00C2' '\u00C3' '\u00C4' '\u00C5' '\u00C6' '\u00C7' '\u00C8' '\u00C9' '\u00CA' '\u00CB' '\u00CC' '\u00CD' '\u00CE' '\u00CF' '\u00D0' '\u00D1' '\u00D2' '\u00D3' '\u00D4' '\u00D5' '\u00D6' '\u00D8' '\u00D9' '\u00DA' '\u00DB' '\u00DC' '\u00DD' '\u00DE' Many other Unicode characters are uppercase too.
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isUpperCase(int) method. | |||||||||||||||
A character is uppercase if its general category type, provided by
getType(codePoint) , is The following are examples of uppercase characters: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z '\u00C0' '\u00C1' '\u00C2' '\u00C3' '\u00C4' '\u00C5' '\u00C6' '\u00C7' '\u00C8' '\u00C9' '\u00CA' '\u00CB' '\u00CC' '\u00CD' '\u00CE' '\u00CF' '\u00D0' '\u00D1' '\u00D2' '\u00D3' '\u00D4' '\u00D5' '\u00D6' '\u00D8' '\u00D9' '\u00DA' '\u00DB' '\u00DC' '\u00DD' '\u00DE' Many other Unicode characters are uppercase too. | |||||||||||||||
0x0000 to
0x10FFFF inclusive. This method is equivalent to
the expression:
codePoint >= 0x0000 && codePoint <= 0x10FFFF | |||||||||||||||
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #isWhitespace(int) method. | |||||||||||||||
| |||||||||||||||
index by codePointOffset
code points. Unpaired surrogates within the text range given by
index and codePointOffset count as
one code point each. | |||||||||||||||
char subarray
that is offset from the given index by
codePointOffset code points. The
start and count arguments specify a
subarray of the char array. Unpaired surrogates
within the text range given by index and
codePointOffset count as one code point each. | |||||||||||||||
| |||||||||||||||
| |||||||||||||||
char array. If
the specified code point is a BMP (Basic Multilingual Plane or
Plane 0) value, the resulting char array has
the same value as codePoint. If the specified code
point is a supplementary code point, the resulting
char array has the corresponding surrogate pair. | |||||||||||||||
dst[dstIndex], and 1 is returned. If the
specified code point is a supplementary character, its
surrogate values are stored in dst[dstIndex]
(high-surrogate) and dst[dstIndex+1]
(low-surrogate), and 2 is returned. | |||||||||||||||
| |||||||||||||||
Note that
In general, java.lang.String#toLowerCase() should be used to map
characters to lowercase. Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #toLowerCase(int) method. | |||||||||||||||
Note that
In general, java.lang.String#toLowerCase() should be used to map
characters to lowercase. | |||||||||||||||
String object representing this
Character's value. The result is a string of
length 1 whose sole component is the primitive
char value represented by this
Character object. | |||||||||||||||
String object representing the
specified char. The result is a string of length
1 consisting solely of the specified char. | |||||||||||||||
| |||||||||||||||
char argument is already a titlecase
char, the same char value will be
returned.
Note that
Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #toTitleCase(int) method. | |||||||||||||||
Note that
| |||||||||||||||
Note that
In general, java.lang.String#toUpperCase() should be used to map
characters to uppercase. Note: This method cannot handle supplementary characters. To support all Unicode characters, including supplementary characters, use the #toUpperCase(int) method. | |||||||||||||||
Note that
In general, java.lang.String#toUpperCase() should be used to map
characters to uppercase. | |||||||||||||||
char itself is returned in the
char[]. | |||||||||||||||
| |||||||||||||||
|