The request takes a very long time, even when I'm giving it a tiny audio file and using the tiny.en model, and it eventually always ends in an internal server error.
Could it be that the micro-service only accepts audio files in some subset of file formats / codecs? If yes, which ones?
I've tried .mp3 and .wav so far.
I was hoping that running the tiny.en model locally in my docker environment would return a result faster than using OpenAI's whisper API, but that does not seem to be the case.