Link: https://copilot.microsoft.com/labs/audio-expression
Microsoft just released a ground breaking Text To Speech (TTS) model and I'm super impressed by how crisp it sounds. It's not robotic at all and can express a variety of styles.
I used it to generate some announcement/marketing audio, that I can't say I will use in the future, but definitely helps guide the direction of what expressions I want to have in a advertisement.
I was initially confused at why it was saying a bunch of things when I put in "Hi" because usually TTS just repeats the text you enter in, but the prompt there is just a guide for what to generate. It does repeat what you put in but with more oomph!
Google's Veo3 is good too as it generates video and audio, but it could be better. If it did what this TTS could do, they'd have a real ground breaking Video Generation service.
I have to say, AI is definitely the great deflater as it drags down the prices of digital content creation. Too bad we live in a world where things get more expensive, so there won't be much if any cost savings from all of this enabling. No company is going to charge less for their services just because AI increased productivity and decreased the cost of production. It's just not happening.
Love to see this kind of tech, but more want to see impact on economy.