Hello,
I'm using Twilio to record video and audio conversations and when they are finished, I need to transcribe them. Twilio saves the files with the opus codec and with .mka format.
When I make the API call I pass this "audio/ogg; codecs=opus" as Content-Type but I get a bad request. If I convert the file to .wav (using an online converter) and use "audio/wav; codec=audio/pcm; samplerate=16000" as Content-Type it returns the transcription text correctly.
Am I doing something wrong? Or will I have to convert the file to .wav format every time?