调用火山云的语音生成TTS和语音识别STT

2024-09-05 13:36

本文主要是介绍调用火山云的语音生成TTS和语音识别STT,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

首先需要去火山云的控制台开通TTS和STT服务语音技术 (volcengine.com)

火山这里都提供了免费的额度可以使用

我这里是使用了java来调用API

目前我还了解到阿里的开源项目SenseVoice(STT)和CosyVoice(TTS)非常的不错,但是都是使用Python开发的。可以做到说话情绪的识别,感兴趣可以去github上了解一下。


TTS(首先需要导入它给的类)

package com.erroright.backend_server_java.pojo.util;import java.util.UUID;public class TtsRequest {public static final String APP_ID = "控制台的APPID";public static final String CLUSTER = "";public static final String Token = "";public static final String VoiceType = "BV001_streaming";//生成声音的选择(如果生成语音报错,就是你没开通这个音色的权限)public static final String Emotion = "angry";//语气public TtsRequest() {}public TtsRequest(String text) {this.request.text = text;}private App app = new App();private User user = new User();private Audio audio = new Audio();private Request request = new Request();public App getApp() {return app;}public void setApp(App app) {this.app = app;}public User getUser() {return user;}public void setUser(User user) {this.user = user;}public Audio getAudio() {return audio;}public void setAudio(Audio audio) {this.audio = audio;}public Request getRequest() {return request;}public void setRequest(Request request) {this.request = request;}public class App {private String appid = APP_ID;private String token = Token; // 目前未生效,填写默认值:access_tokenprivate String cluster = CLUSTER;public String getAppid() {return appid;}public void setAppid(String appid) {this.appid = appid;}public String getToken() {return token;}public void setToken(String token) {this.token = token;}public String getCluster() {return cluster;}public void setCluster(String cluster) {this.cluster = cluster;}}public class User {private String uid = "388808087185088"; // 目前未生效,填写一个默认值就可以public String getUid() {return uid;}public void setUid(String uid) {this.uid = uid;}}public class Audio {private String voice_type = VoiceType;private String encoding = "wav";private float speed_ratio = 1.0F;private float volume_ratio = 10;private float pitch_ratio = 10;private String emotion = Emotion;public String getVoice_type() {return voice_type;}public void setVoice_type(String voice_type) {this.voice_type = voice_type;}public String getEncoding() {return encoding;}public void setEncoding(String encoding) {this.encoding = encoding;}public float getSpeedRatio() {return speed_ratio;}public void setSpeedRatio(int speed_ratio) {this.speed_ratio = speed_ratio;}public float getVolumeRatio() {return volume_ratio;}public void setVolumeRatio(int volume_ratio) {this.volume_ratio = volume_ratio;}public float getPitchRatio() {return pitch_ratio;}public void setPitchRatio(int pitch_ratio) {this.pitch_ratio = pitch_ratio;}public String getEmotion() {return emotion;}public void setEmotion(int emotion) {this.emotion = String.valueOf(emotion);}}public class Request {private String reqid = UUID.randomUUID().toString();private String text;private String text_type = "plain";private String operation = "query";public String getReqid() {return reqid;}public void setReqid(String reqid) {this.reqid = reqid;}public String getText() {return text;}public void setText(String text) {this.text = text;}public String getText_type() {return text_type;}public void setText_type(String text_type) {this.text_type = text_type;}public String getOperation() {return operation;}public void setOperation(String operation) {this.operation = operation;}}
}

调用代码

package com.erroright.backend_server_java.util;import com.alibaba.fastjson.JSON;
import com.erroright.backend_server_java.pojo.util.TtsRequest;
import lombok.extern.slf4j.Slf4j;
import okhttp3.*;
import org.springframework.stereotype.Component;import java.io.FileOutputStream;
import java.io.IOException;
import java.util.Base64;@Component
@Slf4j
public class TtsHttpClient {public static final String API_URL = "https://openspeech.bytedance.com/api/v1/tts";public static final String ACCESS_TOKEN = "填入火山云开通项目的Token";public static byte[] getTts(String content) throws IOException {log.info("TTS生成:"+content);TtsRequest ttsRequest = new TtsRequest(content);String json= JSON.toJSONString(ttsRequest);OkHttpClient client = new OkHttpClient();RequestBody body = RequestBody.create(json, MediaType.get("application/json; charset=utf-8"));Request request = new Request.Builder().url(API_URL).post(body).header("Authorization", "Bearer; " + ACCESS_TOKEN).build();try (Response response = client.newCall(request).execute()) {String TtsRresponse=response.body().string();// 提取 "data" 字段的值String data = TtsRresponse.split("\"data\":\"")[1].split("\"")[0];//保存生成的文件try (FileOutputStream fos = new FileOutputStream("output.wav")) {fos.write(Base64.getDecoder().decode(data));}// 解码 Base64 数据return Base64.getDecoder().decode(data);}}}

STT(导入类,在官方文档中是三个类,为了在springBoot中封装,分开了一个)
 

package com.erroright.backend_server_java.pojo.util;import com.fasterxml.jackson.databind.DeserializationFeature;
import com.fasterxml.jackson.databind.ObjectMapper;
import org.java_websocket.client.WebSocketClient;
import org.java_websocket.handshake.ServerHandshake;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;import javax.crypto.Mac;
import javax.crypto.spec.SecretKeySpec;
import java.io.ByteArrayInputStream;
import java.io.ByteArrayOutputStream;
import java.io.IOException;
import java.net.URI;
import java.net.URISyntaxException;
import java.nio.ByteBuffer;
import java.nio.charset.StandardCharsets;
import java.security.InvalidKeyException;
import java.security.NoSuchAlgorithmException;
import java.util.Base64;
import java.util.UUID;
import java.util.concurrent.CountDownLatch;
import java.util.concurrent.TimeUnit;
import java.util.zip.GZIPInputStream;
import java.util.zip.GZIPOutputStream;class AsrParams {private App app;private User user;private Request request;private Audio audio;public AsrParams(App app, User user, Request request, Audio audio) {this.app = app;this.user = user;this.request = request;this.audio = audio;}public App getApp() {return app;}public void setApp(App app) {this.app = app;}public User getUser() {return user;}public void setUser(User user) {this.user = user;}public Request getRequest() {return request;}public void setRequest(Request request) {this.request = request;}public Audio getAudio() {return audio;}public void setAudio(Audio audio) {this.audio = audio;}public static class App {private String appid;private String cluster;private String token;public App(String appid, String cluster, String token) {this.appid = appid;this.cluster = cluster;this.token = token;}public String getAppid() {return appid;}public void setAppid(String appid) {this.appid = appid;}public String getCluster() {return cluster;}public void setCluster(String cluster) {this.cluster = cluster;}public String getToken() {return token;}public void setToken(String token) {this.token = token;}}public static class User {private String uid;public User(String uid) {this.uid = uid;}public String getUid() {return uid;}public void setUid(String uid) {this.uid = uid;}}public static class Request {private String reqid;private String workflow;private int nbest;private boolean show_utterances;private String result_type;private int sequence;public Request(String reqid, String workflow, int nbest, boolean show_utterances, String result_type, int sequence) {this.reqid = reqid;this.workflow = workflow;this.nbest = nbest;this.show_utterances = show_utterances;this.result_type = result_type;this.sequence = sequence;}public String getReqid() {return reqid;}public void setReqid(String reqid) {this.reqid = reqid;}public String getWorkflow() {return workflow;}public void setWorkflow(String workflow) {this.workflow = workflow;}public int getNbest() {return nbest;}public void setNbest(int nbest) {this.nbest = nbest;}public boolean isShow_utterances() {return show_utterances;}public void setShow_utterances(boolean show_utterances) {this.show_utterances = show_utterances;}public String getResult_type() {return result_type;}public void setResult_type(String result_type) {this.result_type = result_type;}public int getSequence() {return sequence;}public void setSequence(int sequence) {this.sequence = sequence;}}public static class Audio {private String format;private String codec;private int rate;private int bits;private int channels;public Audio(String format, String codec, int rate, int bits, int channels) {this.format = format;this.codec = codec;this.rate = rate;this.bits = bits;this.channels = channels;}public String getFormat() {return format;}public void setFormat(String format) {this.format = format;}public String getCodec() {return codec;}public void setCodec(String codec) {this.codec = codec;}public int getRate() {return rate;}public void setRate(int rate) {this.rate = rate;}public int getBits() {return bits;}public void setBits(int bits) {this.bits = bits;}public int getChannels() {return channels;}public void setChannels(int channels) {this.channels = channels;}}
}public class AsrClient extends WebSocketClient {private static final String URL = "wss://openspeech.bytedance.com/api/v2/asr";private static final Logger logger = LoggerFactory.getLogger(WebSocketClient.class);private String appid;private String token;private String sk;private String cluster;private String workflow = "audio_in,resample,partition,vad,fe,decode,nlu_punctuate";private String uid = "usesr_id";private int nhest = 1;private boolean show_utterances = true;private String result_type = "full";private String format = "wav";private String codec = "raw";private int sample_rate = 16000;private int channels = 1;private int bits = 16;private AuthType authType = AuthType.TOKEN;private byte[] params_msg = null;private AsrResponse asr_response;private CountDownLatch recv_latch = null;private int recv_timeout = 5;private boolean recv_suc = true;public static AsrClient build() throws URISyntaxException {URI uri = new URI(URL);return new AsrClient(uri);}// TODO 接受一个 listener 监听消息, onOpen, onMessage, onError, onCompleteprivate AsrClient(URI uri) {super(uri);}public  static class ProtocolVersion {static public int PROTOCOL_VERSION = 0b0001;}public static class MessageType {static public int FULL_CLIENT_REQUEST = 0b0001;static public int AUDIO_ONLY_CLIENT_REQUEST = 0b0010;static public int FULL_SERVER_RESPONSE = 0b1001;static public int SERVER_ACK = 0b1011;static public int ERROR_MESSAGE_FROM_SERVER = 0b1111;}public static class MessageTypeFlag {static public int NO_SEQUENCE_NUMBER = 0b0000;static public int POSITIVE_SEQUENCE_CLIENT_ASSGIN = 0b0001;static public int NEGATIVE_SEQUENCE_SERVER_ASSGIN = 0b0010;static public int NEGATIVE_SEQUENCE_CLIENT_ASSGIN = 0b0011;}public static class MessageSerial {public int NO_SERIAL = 0b0000;public static int JSON = 0b0001;public int CUSTOM_SERIAL = 0b1111;}public  static class MessageCompress {public int NO_COMPRESS = 0b0000;public static int GZIP = 0b0001;public int CUSTOM_COMPRESS = 0b1111;}public enum AuthType {TOKEN,SIGNATURE;}@Overridepublic void onOpen(ServerHandshake serverHandshake) {logger.info("asr client onOpen");}@Overridepublic void onMessage(String s) {logger.info("onMessage String, should be onMessage(ByteBuffer) called");
//        try {
//            if (parse_response(s) != 0) {
//                logger.error("error happends to close connection");
//                close();
//            }
//        } catch (IOException e) {
//            e.printStackTrace();
//        }}@Overridepublic void onMessage(ByteBuffer bytes) {try {if (parse_response(bytes) != 0) {recv_suc = false;logger.error("error happends to close connection");close();}recv_latch.countDown();} catch (IOException e) {e.printStackTrace();}}@Overridepublic void onClose(int i, String s, boolean b) {logger.info("asr onClose {}, {}, {}", i, s, b);}@Overridepublic void onError(Exception e) {logger.info("asr onError {}", e.getMessage());recv_suc = false;recv_latch.countDown();this.close();}//    public int asr_connect() throws IOException, NoSuchAlgorithmException, InvalidKeyException {
//        this.params_msg = construct_param();
//        set_auth_header();
//        this.connect();
//        return 0;
//    }public boolean asr_sync_connect() throws IOException, InterruptedException, NoSuchAlgorithmException, InvalidKeyException {this.params_msg = construct_param();set_auth_header();boolean ret = this.connectBlocking();if (!ret) {return ret;}recv_latch = new CountDownLatch(1);this.send(this.params_msg);ret = recv_latch.await(recv_timeout, TimeUnit.SECONDS);return ret && recv_suc;}public AsrResponse asr_send(byte[] audio, boolean is_last) throws IOException, InterruptedException {recv_latch = new CountDownLatch(1);byte[] payload = construct_audio_payload(audio, is_last);this.send(payload);boolean ret = recv_latch.await(recv_timeout, TimeUnit.SECONDS);if (!ret) {logger.error("recv message timeout");this.close();return new AsrResponse();}return asr_response;}public int asr_close() {this.close();return 0;}private void set_auth_header() throws NoSuchAlgorithmException, InvalidKeyException {if (authType == AuthType.TOKEN) {this.addHeader("Authorization", "Bearer; " + token);return;}String custom_header = "Custom";String custom_cont = "auth_custom";this.addHeader(custom_header, custom_cont);String str = "GET " + getURI().getPath() + " HTTP/1.1\n"+ custom_cont + "\n";byte[] str_byte = str.getBytes(StandardCharsets.UTF_8);byte[] data = concat_byte(str_byte, this.params_msg);byte[] sk_byte = this.sk.getBytes(StandardCharsets.UTF_8);String HMAC_SHA256 = "HmacSHA256";Mac sha256Hmac = Mac.getInstance(HMAC_SHA256);SecretKeySpec keySpec = new SecretKeySpec(sk_byte, HMAC_SHA256);sha256Hmac.init(keySpec);byte[] mac_data = sha256Hmac.doFinal(data);String base64_data = Base64.getUrlEncoder().encodeToString(mac_data);String auth_cont = "HMAC256; access_token=\"" + this.token+ "\"; mac=\"" + base64_data+ "\"; h=\"" + custom_header + "\"";this.addHeader("Authorization", auth_cont);}private byte[] gzip_compress(byte[] content) throws IOException {ByteArrayOutputStream out = new ByteArrayOutputStream(content.length);GZIPOutputStream gzip = new GZIPOutputStream(out);gzip.write(content);gzip.close();byte[] result = out.toByteArray();out.close();return result;}private byte[] gzip_decompress(byte[] content) throws IOException {ByteArrayInputStream in = new ByteArrayInputStream(content);GZIPInputStream gzip = new GZIPInputStream(in);ByteArrayOutputStream out = new ByteArrayOutputStream();byte[] buff = new byte[1024];int len = 0;while ((len = gzip.read(buff, 0, buff.length)) > 0) {out.write(buff, 0, len);}byte[] result = out.toByteArray();in.close();gzip.close();out.close();return result;}private byte[] construct_param() throws IOException {int header_len = 4;byte[] header = new byte[header_len];header[0] = (byte) (ProtocolVersion.PROTOCOL_VERSION << 4 | (header_len >> 2));header[1] = (byte) (MessageType.FULL_CLIENT_REQUEST << 4 | MessageTypeFlag.NO_SEQUENCE_NUMBER);header[2] = (byte) (MessageSerial.JSON << 4 | MessageCompress.GZIP);header[3] = 0;String reqid = UUID.randomUUID().toString();AsrParams.App app = new AsrParams.App(appid, cluster, token);AsrParams.User user = new AsrParams.User(uid);AsrParams.Request request = new AsrParams.Request(reqid, workflow, 1, show_utterances, result_type, 1);AsrParams.Audio audio = new AsrParams.Audio(format, codec, sample_rate, bits, channels);AsrParams asr_params = new AsrParams(app, user, request, audio);ObjectMapper mapper = new ObjectMapper();
//        String params_json = mapper.writeValueAsString(asr_params);byte[] payload = mapper.writeValueAsBytes(asr_params);logger.info("params_json {}", new String(payload));payload = gzip_compress(payload);// java big-endian defaultint payload_len = payload.length;ByteBuffer bb = ByteBuffer.allocate(4);//b.order(ByteOrder.BIG_ENDIAN); // optional, the initial order of a byte buffer is always BIG_ENDIAN.bb.putInt(payload_len);byte[] pl_byte = bb.array();return concat_byte(header, pl_byte, payload);}private int parse_response(ByteBuffer msg) throws IOException {byte[] msg_byte = msg.array();int header_len = (msg_byte[0] & 0x0f) << 2;int message_type = (msg_byte[1] & 0xf0) >> 4;int message_type_flag = msg_byte[1] & 0x0f;int message_serial = (msg_byte[2] & 0xf0) >> 4;int message_compress = msg_byte[2] & 0x0f;byte[] payload = null;int payload_len = 0;int payload_offset = header_len;if (message_type == MessageType.FULL_SERVER_RESPONSE) {ByteBuffer bb = ByteBuffer.wrap(msg_byte, payload_offset, 4);payload_len = bb.getInt();payload_offset += 4;} else if (message_type == MessageType.SERVER_ACK) {ByteBuffer bb = ByteBuffer.wrap(msg_byte, payload_offset, 4);int seq = bb.getInt();payload_offset += 4;if (msg_byte.length > 8) {payload_len = ByteBuffer.wrap(msg_byte, payload_offset, 4).getInt();payload_offset += 4;}} else if (message_type == MessageType.ERROR_MESSAGE_FROM_SERVER) {int error_code = ByteBuffer.wrap(msg_byte, payload_offset, 4).getInt();payload_offset += 4;payload_len = ByteBuffer.wrap(msg_byte, payload_offset, 4).getInt();payload_offset += 4;} else {logger.error("unsupported message type {}", message_type);return -1;}payload = new byte[msg_byte.length - payload_offset];System.arraycopy(msg_byte, payload_offset, payload, 0, payload.length);if (message_compress == MessageCompress.GZIP) {payload = gzip_decompress(payload);}if (message_serial == MessageSerial.JSON) {ObjectMapper mapper = new ObjectMapper().disable(DeserializationFeature.FAIL_ON_UNKNOWN_PROPERTIES);asr_response = mapper.readValue(payload, 0, payload.length, AsrResponse.class);}if (asr_response.getCode() != 1000) {logger.error("asr resposne {}", new String(payload));return -1;}if (asr_response.getSequence() < 0) {logger.debug("get last response");}// logger.info("asr response {}", new String(payload));return 0;}private byte[] construct_audio_payload(byte[] audio, boolean is_last) throws IOException {int header_len = 4;byte[] header = new byte[header_len];header[0] = (byte) (ProtocolVersion.PROTOCOL_VERSION << 4 | (header_len >> 2));if (!is_last) {header[1] = (byte) (MessageType.AUDIO_ONLY_CLIENT_REQUEST << 4 | MessageTypeFlag.NO_SEQUENCE_NUMBER);} else {header[1] = (byte) (MessageType.AUDIO_ONLY_CLIENT_REQUEST << 4 | MessageTypeFlag.NEGATIVE_SEQUENCE_SERVER_ASSGIN);}header[2] = (byte) (MessageSerial.JSON << 4 | MessageCompress.GZIP);header[3] = 0;byte[] payload = gzip_compress(audio);int payload_len = payload.length;ByteBuffer bb = ByteBuffer.allocate(4);bb.putInt(payload_len);byte[] pl_byte = bb.array();return concat_byte(header, pl_byte, payload);}public void setAppid(String appid) {this.appid = appid;}public void setToken(String token) {this.token = token;}public void setSk(String sk) {this.sk = sk;}public void setCluster(String cluster) {this.cluster = cluster;}public void setWorkflow(String workflow) {this.workflow = workflow;}public void setUid(String uid) {this.uid = uid;}public void setShow_utterances(boolean show_utterances) {this.show_utterances = show_utterances;}public void setResult_type(String result_type) {this.result_type = result_type;}public void setFormat(String format) {this.format = format;}public void setCodec(String codec) {this.codec = codec;}public void setSample_rate(int sample_rate) {this.sample_rate = sample_rate;}public void setChannels(int channels) {this.channels = channels;}public void setBits(int bits) {this.bits = bits;}public AuthType getAuthType() {return authType;}public void setAuthType(AuthType authType) {this.authType = authType;}public AsrResponse getAsrResponse() {return asr_response;}private byte[] concat_byte(byte[] first, byte[] second) {byte[] result = new byte[first.length + second.length];System.arraycopy(first, 0, result, 0, first.length);System.arraycopy(second, 0, result, first.length, second.length);return result;}private byte[] concat_byte(byte[] first, byte[] second, byte[] third) {byte[] result = new byte[first.length + second.length + third.length];System.arraycopy(first, 0, result, 0, first.length);System.arraycopy(second, 0, result, first.length, second.length);System.arraycopy(third, 0, result, first.length+second.length, third.length);return result;}
}
package com.erroright.backend_server_java.pojo.util;public class AsrResponse {private String reqid = "unknow";private int code = 0;private String message = "";private int sequence = 0;private Result[] result;private Addition addition;public String getReqid() {return reqid;}public void setReqid(String reqid) {this.reqid = reqid;}public int getCode() {return code;}public void setCode(int code) {this.code = code;}public String getMessage() {return message;}public void setMessage(String message) {this.message = message;}public int getSequence() {return sequence;}public void setSequence(int sequence) {this.sequence = sequence;}public Result[] getResult() {return result;}public void setResult(Result[] result) {this.result = result;}public Addition getAddition() {return addition;}public void setAddition(Addition addition) {this.addition = addition;}public static class Result {private String text;private int confidence;private String language;private Utterances[] utterances;private float global_confidence;public String getText() {return text;}public void setText(String text) {this.text = text;}public int getConfidence() {return confidence;}public void setConfidence(int confidence) {this.confidence = confidence;}public String getLanguage() {return language;}public void setLanguage(String language) {this.language = language;}public Utterances[] getUtterances() {return utterances;}public void setUtterances(Utterances[] utterances) {this.utterances = utterances;}public float getGlobal_confidence() {return global_confidence;}public void setGlobal_confidence(float global_confidence) {this.global_confidence = global_confidence;}}public static class Utterances {private String text;private int start_time;private int end_time;private boolean definite;private String language;private Words[] words;public String getText() {return text;}public void setText(String text) {this.text = text;}public int getStart_time() {return start_time;}public void setStart_time(int start_time) {this.start_time = start_time;}public int getEnd_time() {return end_time;}public void setEnd_time(int end_time) {this.end_time = end_time;}public boolean isDefinite() {return definite;}public void setDefinite(boolean definite) {this.definite = definite;}public String getLanguage() {return language;}public void setLanguage(String language) {this.language = language;}public Words[] getWords() {return words;}public void setWords(Words[] words) {this.words = words;}}public static class Words {private String text;private int start_time;private int end_time;private int blank_duration;public String getText() {return text;}public void setText(String text) {this.text = text;}public int getStart_time() {return start_time;}public void setStart_time(int start_time) {this.start_time = start_time;}public int getEnd_time() {return end_time;}public void setEnd_time(int end_time) {this.end_time = end_time;}public int getBlank_duration() {return blank_duration;}public void setBlank_duration(int blank_duration) {this.blank_duration = blank_duration;}}public static class Addition {private String duration;public String getDuration() {return duration;}public void setDuration(String duration) {this.duration = duration;}}
}

调用方式

package com.erroright.backend_server_java.util;import com.erroright.backend_server_java.pojo.util.AsrClient;
import com.erroright.backend_server_java.pojo.util.AsrResponse;
import com.fasterxml.jackson.core.JsonProcessingException;
import lombok.extern.slf4j.Slf4j;
import org.springframework.stereotype.Component;import java.io.File;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.IOException;
import java.net.URISyntaxException;
import java.security.InvalidKeyException;
import java.security.NoSuchAlgorithmException;
import java.util.Arrays;@Slf4j
@Component
public class SttStreamClient {String appid = "";  // 项目的 appidString token = "";  // 项目的 tokenString cluster = "";  // 请求的集群String audio_format = "wav";  // wav 或者 mp3, 根据音频类型设置AsrClient asr_client = null;SttStreamClient() throws URISyntaxException, IOException, NoSuchAlgorithmException, InvalidKeyException, InterruptedException {asr_client = AsrClient.build();asr_client.setAppid(appid);asr_client.setToken(token);asr_client.setCluster(cluster);asr_client.setFormat(audio_format);asr_client.setShow_utterances(true);asr_client.asr_sync_connect();}public  String STT(  byte[] file ) throws URISyntaxException, JsonProcessingException, FileNotFoundException {long startTime = System.currentTimeMillis();String STTResult="";try {// File file = new File(audio_path);// FileInputStream fp = new FileInputStream(file);byte[] b = new byte[64000];int len = 0;int count = 0;AsrResponse asr_response = new AsrResponse();// while ((len = fp.read(b)) > 0) {//     count += 1;//     asr_response = asr_client.asr_send(Arrays.copyOfRange(b, 0, len), fp.available() == 0);// }while (len < file.length) {int bytesToRead = Math.min(b.length, file.length - len);System.arraycopy(file, len, b, 0, bytesToRead);len += bytesToRead;asr_response = asr_client.asr_send(Arrays.copyOfRange(b, 0, bytesToRead), len == file.length);count += 1;}// get asr text// AsrResponse response = asr_client.getAsrResponse();for (AsrResponse.Result result: asr_response.getResult()) {STTResult+=result.getText();}} catch (Exception e) {System.err.println(e.getMessage());} finally {if (asr_client != null) {asr_client.asr_close();}long endTime = System.currentTimeMillis();log.info("语音识别执行时间: " +( endTime - startTime) / 1000.0);return STTResult;}}
}



 

这篇关于调用火山云的语音生成TTS和语音识别STT的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/1139100

相关文章

MybatisGenerator文件生成不出对应文件的问题

《MybatisGenerator文件生成不出对应文件的问题》本文介绍了使用MybatisGenerator生成文件时遇到的问题及解决方法,主要步骤包括检查目标表是否存在、是否能连接到数据库、配置生成... 目录MyBATisGenerator 文件生成不出对应文件先在项目结构里引入“targetProje

Python使用qrcode库实现生成二维码的操作指南

《Python使用qrcode库实现生成二维码的操作指南》二维码是一种广泛使用的二维条码,因其高效的数据存储能力和易于扫描的特点,广泛应用于支付、身份验证、营销推广等领域,Pythonqrcode库是... 目录一、安装 python qrcode 库二、基本使用方法1. 生成简单二维码2. 生成带 Log

Idea调用WebService的关键步骤和注意事项

《Idea调用WebService的关键步骤和注意事项》:本文主要介绍如何在Idea中调用WebService,包括理解WebService的基本概念、获取WSDL文件、阅读和理解WSDL文件、选... 目录前言一、理解WebService的基本概念二、获取WSDL文件三、阅读和理解WSDL文件四、选择对接

Python使用Pandas库将Excel数据叠加生成新DataFrame的操作指南

《Python使用Pandas库将Excel数据叠加生成新DataFrame的操作指南》在日常数据处理工作中,我们经常需要将不同Excel文档中的数据整合到一个新的DataFrame中,以便进行进一步... 目录一、准备工作二、读取Excel文件三、数据叠加四、处理重复数据(可选)五、保存新DataFram

SpringBoot生成和操作PDF的代码详解

《SpringBoot生成和操作PDF的代码详解》本文主要介绍了在SpringBoot项目下,通过代码和操作步骤,详细的介绍了如何操作PDF,希望可以帮助到准备通过JAVA操作PDF的你,项目框架用的... 目录本文简介PDF文件简介代码实现PDF操作基于PDF模板生成,并下载完全基于代码生成,并保存合并P

Java调用Python代码的几种方法小结

《Java调用Python代码的几种方法小结》Python语言有丰富的系统管理、数据处理、统计类软件包,因此从java应用中调用Python代码的需求很常见、实用,本文介绍几种方法从java调用Pyt... 目录引言Java core使用ProcessBuilder使用Java脚本引擎总结引言python

Python基于火山引擎豆包大模型搭建QQ机器人详细教程(2024年最新)

《Python基于火山引擎豆包大模型搭建QQ机器人详细教程(2024年最新)》:本文主要介绍Python基于火山引擎豆包大模型搭建QQ机器人详细的相关资料,包括开通模型、配置APIKEY鉴权和SD... 目录豆包大模型概述开通模型付费安装 SDK 环境配置 API KEY 鉴权Ark 模型接口Prompt

详解Java中如何使用JFreeChart生成甘特图

《详解Java中如何使用JFreeChart生成甘特图》甘特图是一种流行的项目管理工具,用于显示项目的进度和任务分配,在Java开发中,JFreeChart是一个强大的开源图表库,能够生成各种类型的图... 目录引言一、JFreeChart简介二、准备工作三、创建甘特图1. 定义数据集2. 创建甘特图3.

java如何调用kettle设置变量和参数

《java如何调用kettle设置变量和参数》文章简要介绍了如何在Java中调用Kettle,并重点讨论了变量和参数的区别,以及在Java代码中如何正确设置和使用这些变量,避免覆盖Kettle中已设置... 目录Java调用kettle设置变量和参数java代码中变量会覆盖kettle里面设置的变量总结ja

AI一键生成 PPT

AI一键生成 PPT 操作步骤 作为一名打工人,是不是经常需要制作各种PPT来分享我的生活和想法。但是,你们知道,有时候灵感来了,时间却不够用了!😩直到我发现了Kimi AI——一个能够自动生成PPT的神奇助手!🌟 什么是Kimi? 一款月之暗面科技有限公司开发的AI办公工具,帮助用户快速生成高质量的演示文稿。 无论你是职场人士、学生还是教师,Kimi都能够为你的办公文