For Converting Text documents to MP3, you are getting into TTS (Text To Speech) domain. This is what people technically call it. There are two TTS methods available Human Software. Human Get someone from Fiverr, Upwork. This comes at a cost. If you want to save cost and are comfortable recording yourself, go for it. You’d need a decent (somewhere between good to great) microphone, low noise (or no noise) recording room and a god software to play with the audio recorded. Audacity is one such recording software, it’s a great tool & comes for FREE. Software There are lots of software solutions available, much more than even I know. Sharing info on some, which are free to a certain limit & after that there is a cost. Google Cloud, Amazon AWS, IBM Watson are the ones which I consider good. The cost is low per any standards. e.g. for pricing of Amazon Polly (Amazon’s TTS solution) - Amazon Polly Pricing There are some desktop software which can be fed text and t spit out good quality audio. My knowledge is limited to just the one I shared. You may or may not like the Synthetic voice (TTS Software). It’s the cost benefit analysis you have to do. Do let us know, what choice you made.
I guess the main takeaway from this story is that you have to be brave to try something like that. I am not really comfortable with it, but it is a personal quest, I suppose. Now that we have the background, what we need to know about using a TTS (text to speech) software, its advantages and disadvantages, let us get into details. Why using a TTS Software? TTS Software vs. Synthetic Voice (TTS Software) There are good and bad reasons and advantages to using a TTS Software, let us know whether you need one or the other for your content creation. Pros: There are fewer requirements and thus time saved. It is cheaper if used on occasional basis. You can be creative. Cons: You could be creating something which is really different from others. It has no real time lag and no actual time. It could sound fake on occasion. You could end up losing track of your target audience. TTS vs. HSP.