AI systems that convert written text into natural-sounding spoken audio.
AI systems that convert written text into natural-sounding spoken audio. Modern TTS like ElevenLabs and OpenAI's voice models produce speech that's nearly indistinguishable from human recordings. Used in virtual assistants, accessibility tools, content creation, and dubbing.
Converting spoken audio into written text.
Using AI to create a synthetic copy of someone's voice from a small sample of their speech.
AI models that can understand and generate multiple types of data — text, images, audio, video.
A mathematical function applied to a neuron's output that introduces non-linearity into the network.
An optimization algorithm that combines the best parts of two other methods — AdaGrad and RMSProp.
Artificial General Intelligence.
Browse our complete glossary or subscribe to our newsletter for the latest AI news and insights.