The most important choice when producing speech is most likely the selection of the speaker.
At the moment api.audio offers speech models from the following providers:
AWS Polly, Google Text-To-Speech, Microsoft Azure, IBM and Messner. A total of 250+ voice!
Further, api.audio allows you to clone a voice, as well as the voices of your users, which will then become available in your organisation for speech creation.
You can retrieve a list of all the speakers that are available in your organisation:
# Get all available voices and print them all_voices = aflr.Voice().list() print(all_voices)
Remember to always use your API key!
aflr.api_key = "your-key"
Some voice models allow additional parameter tuning beyond standard SSML (see here) annotation.
Speaker selection can get unwieldy quickly. Hence api.audio offers additional information for each speaker which allows you to filter, sort, assess and choose. This is especially useful when you are building a frontend that allows your users to make such a choice.
Updated about 1 month ago
See a list of top 20 speakers