Java new String getBytes 使用utf8静态枚举 StandardCharsets.UTF

本文主要是介绍Java new String getBytes 使用utf8静态枚举 StandardCharsets.UTF_8，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

Java new String 使用utf8静态枚举用 `StandardCharsets.UTF_8`

new String(byteArray "utf8")或 getByte("utf8")要捕获异常

new String(byteArray, StandardCharsets.UTF_8) 或 getBytes(StandardCharsets.UTF_8) 就不用捕获异常了

`StandardCharsets.UTF_8`

StandardCharsets.UTF_8

在这里插入图片描述

StandardCharsets

StandardCharsets 是 Java 7 引入的一个实用类，位于 java.nio.charset 包中。这个类提供了对几种最常用的字符编码的静态引用，这些字符编码以 Charset 对象的形式表示。使用 StandardCharsets 而不是手动创建或查找字符集对象的好处是，它可以提高性能（因为 StandardCharsets 缓存了这些对象）并减少错误（因为您不需要记住或输入确切的字符集名称）。

以下是 StandardCharsets 类中定义的一些常量：

US_ASCII：表示 ASCII 字符集，这是一个 7 位的字符集，包含 128 个字符。
ISO_8859_1：表示 ISO-8859-1 字符集，这是一个 8 位的字符集，包含 256 个字符，是 ASCII 的超集。
UTF_8：表示 UTF-8 字符集，这是一种变长字节表示的 Unicode 字符集，非常流行，因为它能够兼容 ASCII 并且能够表示所有的 Unicode 字符。
UTF_16：表示 UTF-16 字符集，这是一种使用 16 位单元表示的 Unicode 字符集。注意，StandardCharsets 没有直接提供 UTF_16BE（大端序）或 UTF_16LE（小端序）的常量，因为 UTF_16 本身不指定字节顺序；它通常与 ByteBuffer 的字节顺序设置一起使用。
UTF_16BE（Java 9 及以上）：在 Java 9 及更高版本中，StandardCharsets 类增加了对 UTF-16BE（大端序）的直接支持。
UTF_16LE（Java 9 及以上）：同样，在 Java 9 及更高版本中，StandardCharsets 类也增加了对 UTF-16LE（小端序）的直接支持。

请注意，最后两个常量（UTF_16BE 和 UTF_16LE）是在 Java 9 中引入的。如果您使用的是 Java 8 或更早的版本，这些常量将不可用。

使用 StandardCharsets 类的示例：

import java.nio.charset.StandardCharsets;
import java.nio.ByteBuffer;
import java.nio.CharBuffer;public class StandardCharsetsExample {public static void main(String[] args) {byte[] bytes = "Hello, World!".getBytes(StandardCharsets.UTF_8);String decoded = new String(bytes, StandardCharsets.UTF_8);System.out.println(decoded); // 输出: Hello, World!// 使用 ByteBuffer 和 CharBuffer 进行编码和解码ByteBuffer byteBuffer = ByteBuffer.wrap(bytes);CharBuffer charBuffer = StandardCharsets.UTF_8.decode(byteBuffer);System.out.println(charBuffer.toString()); // 输出: Hello, World!}
}

在这个示例中，我们使用 StandardCharsets.UTF_8 来编码和解码字符串。这种方式比使用字符串字面量（如 "UTF-8"）来指定字符集更加高效且安全。

StandardCharsets类是Java中的一个工具类，它提供了一组标准字符集常量。这些常量用于在Java程序中指定字符集，以便在处理文本数据时进行编码和解码操作。StandardCharsets类位于java.nio.charset包中，是一个final类，不能被继承。

StandardCharsets类提供了以下常量：

UTF_8：表示UTF-8字符集。
UTF_16：表示UTF-16字符集。
UTF_16BE：表示UTF-16 Big Endian字符集。
UTF_16LE：表示UTF-16 Little Endian字符集。
US_ASCII：表示US-ASCII字符集。
ISO_8859_1：表示ISO-8859-1字符集。

使用StandardCharsets类可以方便地指定字符集，例如在读取或写入文件时指定字符集，或者在网络通信中指定字符集。下面是一个使用StandardCharsets类的示例代码：

import java.nio.charset.StandardCharsets;public class Example {public static void main(String[] args) {String str = "Hello, World!";// 使用UTF-8编码将字符串转换为字节数组byte[] bytes = str.getBytes(StandardCharsets.UTF_8);// 使用UTF-8解码字节数组为字符串String decodedStr = new String(bytes, StandardCharsets.UTF_8);System.out.println("Encoded bytes: " + bytes);System.out.println("Decoded string: " + decodedStr);}
}

这个示例演示了如何使用StandardCharsets类将字符串编码为字节数组，并将字节数组解码为字符串。在这个示例中，我们使用UTF-8字符集进行编码和解码操作。

jdk17的 `StandardCharsets` 类的源码

/** Copyright (c) 2011, 2019, Oracle and/or its affiliates. All rights reserved.* DO NOT ALTER OR REMOVE COPYRIGHT NOTICES OR THIS FILE HEADER.** This code is free software; you can redistribute it and/or modify it* under the terms of the GNU General Public License version 2 only, as* published by the Free Software Foundation.  Oracle designates this* particular file as subject to the "Classpath" exception as provided* by Oracle in the LICENSE file that accompanied this code.** This code is distributed in the hope that it will be useful, but WITHOUT* ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or* FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License* version 2 for more details (a copy is included in the LICENSE file that* accompanied this code).** You should have received a copy of the GNU General Public License version* 2 along with this work; if not, write to the Free Software Foundation,* Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA.** Please contact Oracle, 500 Oracle Parkway, Redwood Shores, CA 94065 USA* or visit www.oracle.com if you need additional information or have any* questions.*/
package java.nio.charset;/*** Constant definitions for the standard {@link Charset charsets}. These* charsets are guaranteed to be available on every implementation of the Java* platform.** @see <a href="Charset.html#standard">Standard Charsets</a>* @since 1.7*/
public final class StandardCharsets {// To avoid accidental eager initialization of often unused Charsets// from happening while the VM is booting up, which may delay// initialization of VM components, we should generally avoid depending// on this class from elsewhere in java.base.private StandardCharsets() {throw new AssertionError("No java.nio.charset.StandardCharsets instances for you!");}/*** Seven-bit ASCII, also known as ISO646-US, also known as the* Basic Latin block of the Unicode character set.*/public static final Charset US_ASCII = sun.nio.cs.US_ASCII.INSTANCE;/*** ISO Latin Alphabet {@literal No. 1}, also known as ISO-LATIN-1.*/public static final Charset ISO_8859_1 = sun.nio.cs.ISO_8859_1.INSTANCE;/*** Eight-bit UCS Transformation Format.*/public static final Charset UTF_8 = sun.nio.cs.UTF_8.INSTANCE;/*** Sixteen-bit UCS Transformation Format, big-endian byte order.*/public static final Charset UTF_16BE = new sun.nio.cs.UTF_16BE();/*** Sixteen-bit UCS Transformation Format, little-endian byte order.*/public static final Charset UTF_16LE = new sun.nio.cs.UTF_16LE();/*** Sixteen-bit UCS Transformation Format, byte order identified by an* optional byte-order mark.*/public static final Charset UTF_16 = new sun.nio.cs.UTF_16();
}