about python SDK speaker.
Options
Hello, there
Referring to the GitHub below, I received the user's audio input in bytes format using Python SDK speaker.
To make sure the audio data is correct, I converted the collection of data into mp3 files and proceeded with the STT request, but an error is occurring in the mp3 generation.
Bytes when status is speaking were combined to create the entire audio data.
Is there a way to get the user's audio input as it is through python SDK?
Thanks
Dominic
0
Answers
-
Hi @Dominic . The bytes received from a virtual speaker are PCM S16 not MP3. Does your STT service accept PCM (or WAV)? This example shows how to send audio to Google STT:
0