I've not had much luck with the automatic 'generate text' functionality. It takes ages and often returns garbage when used with accents or low level audio.
As a tip, I'd recommend this:
https://www.apowersoft.com/speech-to-text-onlineFree, faster and much better speech recognition. You can then just copy and paste the resulting text.
Unity Virtual Reality Developer