Java Character Class: Enhancing Character Handling Beyond Primitive Types
Discover the Java Character class, which provides useful methods for manipulating characters beyond the primitive char
data type. Learn how to leverage this class to perform operations such as case conversion, character classification, and more, improving your character handling capabilities in Java programming.
Java Character Class
Normally, when we work with characters, we use primitive data types char
.
char ch = 'a';
// Unicode for uppercase Greek omega character
char uniChar = '\u039A';
// an array of chars
char[] charArray = {'a', 'b', 'c', 'd', 'e'};
Use of Character Class in Java
In development, we come across situations where we need to use objects instead of primitive data types. Java provides a wrapper class Character
for the primitive data type char
.
The Character
class offers a number of useful class (i.e., static) methods for manipulating characters. You can create a Character
object with the Character
constructor:
Character ch = new Character('a');
The Java compiler will also create a Character
object for you under some circumstances. For example, if you pass a primitive char
into a method that expects an object, the compiler automatically converts the char
to a Character
for you. This feature is called autoboxing or unboxing, if the conversion goes the other way.
Example of Java Character Class
// Here following primitive char 'a'
// is boxed into the Character object ch
Character ch = 'a';
// Here primitive 'x' is boxed for method test,
// return is unboxed to char 'c'
char c = test('x');
Escape Sequences
A character preceded by a backslash (\
) is an escape sequence and has a special meaning to the compiler.
The newline character (\n
) has been used frequently in this tutorial in System.out.println()
statements to advance to the next line after the string is printed.
Escape Sequence | Description |
---|---|
\t | Inserts a tab in the text at this point. |
\b | Inserts a backspace in the text at this point. |
\n | Inserts a newline in the text at this point. |
\r | Inserts a carriage return in the text at this point. |
\f | Inserts a form feed in the text at this point. |
\' | Inserts a single quote character in the text at this point. |
\" | Inserts a double quote character in the text at this point. |
\\ | Inserts a backslash character in the text at this point. |
When an escape sequence is encountered in a print statement, the compiler interprets it accordingly.
Example: Escape Sequences
If you want to put quotes within quotes, you must use the escape sequence, \"
, on the interior quotes:
public class Test {
public static void main(String args[]) {
System.out.println("She said \"Hello!\" to me.");
}
}
Output
She said "Hello!" to me.
Character Class Declaration
Following is the declaration for java.lang.Character
class:
public final class Character
extends Object
implements Serializable, Comparable<Character>
Fields
Following are the fields for java.lang.Character
class:
Function Name | Description |
---|---|
COMBINING_SPACING_MARK | This is the General category "Mc" in the Unicode specification. |
CONNECTOR_PUNCTUATION | This is the General category "Pc" in the Unicode specification. |
CONTROL | This is the General category "Cc" in the Unicode specification. |
CURRENCY_SYMBOL | This is the General category "Sc" in the Unicode specification. |
DASH_PUNCTUATION | This is the General category "Pd" in the Unicode specification. |
DECIMAL_DIGIT_NUMBER | This is the General category "Nd" in the Unicode specification. |
DIRECTIONALITY_ARABIC_NUMBER | This is the Weak bidirectional character type "AN" in the Unicode specification. |
DIRECTIONALITY_BOUNDARY_NEUTRAL | This is the Weak bidirectional character type "BN" in the Unicode specification. |
DIRECTIONALITY_COMMON_NUMBER_SEPARATOR | This is the Weak bidirectional character type "CS" in the Unicode specification. |
DIRECTIONALITY_EUROPEAN_NUMBER | This is the Weak bidirectional character type "EN" in the Unicode specification. |
DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR | This is the Weak bidirectional character type "ES" in the Unicode specification. |
DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR | This is the Weak bidirectional character type "ET" in the Unicode specification. |
DIRECTIONALITY_LEFT_TO_RIGHT | This is the Strong bidirectional character type "L" in the Unicode specification. |
DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING | This is the Strong bidirectional character type "LRE" in the Unicode specification. |
DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE | This is the Strong bidirectional character type "LRO" in the Unicode specification. |
DIRECTIONALITY_NONSPACING_MARK | This is the Weak bidirectional character type "NSM" in the Unicode specification. |
DIRECTIONALITY_OTHER_NEUTRALS | This is the Neutral bidirectional character type "ON" in the Unicode specification. |
DIRECTIONALITY_PARAGRAPH_SEPARATOR | This is the Neutral bidirectional character type "B" in the Unicode specification. |
DIRECTIONALITY_POP_DIRECTIONAL_FORMAT | This is the Weak bidirectional character type "PDF" in the Unicode specification. |
DIRECTIONALITY_RIGHT_TO_LEFT | This is the Strong bidirectional character type "R" in the Unicode specification. |
DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC | This is the Strong bidirectional character type "AL" in the Unicode specification. |
DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING | This is the Strong bidirectional character type "RLE" in the Unicode specification. |
DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE | This is the Strong bidirectional character type "RLO" in the Unicode specification. |
DIRECTIONALITY_SEGMENT_SEPARATOR | This is the Neutral bidirectional character type "S" in the Unicode specification. |
DIRECTIONALITY_UNDEFINED | This is the Undefined bidirectional character type. |
DIRECTIONALITY_WHITESPACE | This is the Neutral bidirectional character type "WS" in the Unicode specification. |
ENCLOSING_MARK | This is the General category "Me" in the Unicode specification. |
END_PUNCTUATION | This is the General category "Pe" in the Unicode specification. |
FINAL_QUOTE_PUNCTUATION | This is the General category "Pf" in the Unicode specification. |
FORMAT | This is the General category "Cf" in the Unicode specification. |
INITIAL_QUOTE_PUNCTUATION | This is the General category "Pi" in the Unicode specification. |
LETTER_NUMBER | This is the General category "Nl" in the Unicode specification. |
LINE_SEPARATOR | This is the Neutral bidirectional character type "B" in the Unicode specification. |
LOWERCASE_LETTER | This is the General category "Ll" in the Unicode specification. |
MATH_SYMBOL | This is the General category "Sm" in the Unicode specification. |
MODIFIER_LETTER | This is the General category "Lm" in the Unicode specification. |
MODIFIER_SYMBOL | This is the General category "Sk" in the Unicode specification. |
NON_SPACING_MARK | This is the General category "Mn" in the Unicode specification. |
OTHER_LETTER | This is the General category "Lo" in the Unicode specification. |
OTHER_NUMBER | This is the General category "No" in the Unicode specification. |
OTHER_PUNCTUATION | This is the General category "Po" in the Unicode specification. |
OTHER_SYMBOL | This is the General category "So" in the Unicode specification. |
PARAGRAPH_SEPARATOR | This is the Neutral bidirectional character type "B" in the Unicode specification. |
PRIVATE_USE | This is the General category "Co" in the Unicode specification. |
SPACE_SEPARATOR | This is the General category "Zs" in the Unicode specification. |
START_PUNCTUATION | This is the General category "Ps" in the Unicode specification. |
SURROGATE | This is the General category "Cs" in the Unicode specification. |
TITLECASE_LETTER | This is the General category "Lt" in the Unicode specification. |
UNASSIGNED | This is the General category "Cn" in the Unicode specification. |
UPPERCASE_LETTER | This is the General category "Lu" in the Unicode specification. |