Generates extremely realistic text to speech, using GPT-4o Audio. Unlike 11labs, this has no limit to usage, and can even be used as API.