Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

语音 robonix/service/speech

语音服务提供 ASR(识别)、TTS(合成)、以及端到端对话。一元与流式都有:asr / tts 是一元 RPC,asr_stream / tts_stream / dialog 是流式,给低延迟语音交互用。它是 Liaison 语音 pipeline 的 ASR / TTS 两段。

能力约定 TOML 在 capabilities/service/speech/,IDL 在 capabilities/lib/{speech,asr,tts}/

接口

能力约定 ID模式载荷(IDL)能力约定 TOML
robonix/service/speech/driverrpclifecycle/Driverservice/speech/driver.v1.toml
robonix/service/speech/asrrpcasr/Recognizeservice/speech/asr.v1.toml
robonix/service/speech/asr_streamrpc_bidirectional_streamasr/RecognizeStreamservice/speech/asr_stream.v1.toml
robonix/service/speech/ttsrpctts/Synthesizeservice/speech/tts.v1.toml
robonix/service/speech/tts_streamrpc_server_streamtts/SynthesizeStreamservice/speech/tts_stream.v1.toml
robonix/service/speech/dialogrpc_server_streamspeech/StartDialogservice/speech/dialog.v1.toml
robonix/service/speech/speakrpcspeech/Speakservice/speech/speak.v1.toml
robonix/service/speech/list_speakersrpcspeech/ListSpeakersservice/speech/list_speakers.v1.toml

speak 是“合成并直接播放“的便捷入口(服务内部接扬声器);tts / tts_stream 只回音频流,由调用方播放。list_speakers 列可用音色。流式能力约定只能走 gRPC(ROS 2 不原生支持流式 RPC)。

参考实现:services/speech(FunASR ASR + TTS)。