Java character code

Everyone hates character codes, doesn't it? There was talk of Java 8 and UTF-8 becoming the default, Now, I wish I could write a sample that uses the character code in Java.

When should I be aware of the character code?

First of all, when do you need to be aware of the character code? That's when there is input or output from something other than your own Java application.

Input example

Input data from the client in the server-client system Read external files such as CSV

Output example

Return from server in server-client system File export DB registration

Processing using character code

I think there are many others, but here are some that I often use.

String.getBytes A function that gets a string in byte format.


In this way, specify the character code in the argument of getBytes. By doing this, the character string "TEST" can be interpreted as UTF-8 and converted to byte format. This getBytes function can specify nothing as an argument. In that case, the default character code in the execution environment is used. If you want to check the value, you can check it by executing the following code.


If you want to change the default character code, specify the following options at runtime.


String constructor

You can get the character string of the specified character code by receiving byte [] and the character code in the constructor of String.

byte[] byte1 = "TEST".getBytes(StandardCharsets.UTF_8);
String encorded = new String(byte1, "MS932");

If this constructor also does not specify a character code, the default character code in the execution environment will be used.

Read file

There are many ways to read a file, but only one is excerpted.

try {
    BufferedReader bufferedReader = Files.newBufferedReader(Paths.get(""), StandardCharsets.UTF_8);
} catch (IOException e) {
    //TODO auto-generated catch block

Specify the file to be read with Files.newBufferedReader and specify the character code in the second argument. This function can omit the character code, in which case it will be UTF-8 in any environment. (It seems to be from Java8) It looks like this.

public static BufferedReader newBufferedReader(Path path) throws IOException {
    return newBufferedReader(path, StandardCharsets.UTF_8);


Well, I would like to end here today. Files.newBufferedReader defaults to UTF-8, and getBytes and String constructors have different default values depending on the environment. Please note that the default value differs depending on the function used. Well, I think you should specify the character code at any time. ..

Recommended Posts

Java character code
Script Java code
Java sample code 02
Java sample code 03
Java sample code 04
Java sample code 01
Digital signature sample code (JAVA)
Basics of character operation (java)
Java test code method collection
[Windows] Java code is garbled
Write Java8-like code in Java8
Correct the character code in Java and read from the URL
Code Java from Emacs with Eclim
Java Spring environment in vs Code
Java 15 implementation and VS Code preferences
[Java] Remove whitespace from character strings
[Java] Boilerplate code elimination using Lombok
Java build with mac vs code
Arbitrary string creation code by Java
Execute packaged Java code with commands
Java source code reading java.lang.Math class
[Java] Boilerplate code elimination using Lombok 2
Java development environment (Mac, VS Code)
[Android] Convert Android Java code to Kotlin
Sample code using Minio from Java
Basic structure of Java source code
Java Converts disparate character codes to the same character code at once
Avoid character code error in java when using VScode extension RUN-CODE
[Java] Divide a character string by a specified character
Java learning (0)
The application absorbs the difference in character code
Studying Java ―― 3
[Java] array
Java protected
[Java] Annotation
Prepare Java development environment with VS Code
[Java] Module
Java array
Studying Java ―― 9
Java scratch scratch
Java tips, tips
Java methods
Java method
java (constructor)
Java array
[Java] ArrayDeque
java (override)
java (method)
Java string
java (array)
Java serialization
java beginner 4
JAVA paid
Studying Java ―― 4
Java (set)
java shellsort
[Java] compareTo
OCR in Java (character recognition from images)
Studying Java -5