This is a niche that seems to be without portable options, most are web based services like aws https://aws.amazon.com/transcribe/ or youtube.
But sometimes you need to use an offline generator for sensitive data.
Further perhaps useful info links
https://towardsdatascience.com/generati ... 2c633936a7
https://github.com/TensorSpeech/TensorFlowASR