Synthesia Unveils AI Upgrade Enabling Avatars to Mimic Human Emotions and Movements
Summary:
Synthesia, an AI startup backed by Nvidia, has launched "Expressive Avatars," an upgrade that enables avatars to mimic human emotions and movements. The enhancement aims to produce more accurate depictions of humans, rectifying issues such as distorted body parts. The new avatars can respond to instructions reflecting emotions, support over 130 languages, provide closed captions and mimic user's voices. Synthesia, valued at nearly $1 billion, caters to more than 55,000 firms, including many Fortune 100 companies.
Synthesia, an artificial intelligence (AI) company backed by Nvidia, has unveiled an innovative upgrade which equips AI avatars to mimic human emotions and actions. On April 25, the firm launched its "Expressive Avatars," engineered to display feelings in line with text commands, used primarily for enterprise presentations, marketing, and instructional functions.
Despite generative AI being hailed for its capability to fabricate convincing motion graphics, as seen with OpenAI’s Sora video generator, it's still far from perfect, particularly when it comes to emulating humans, often resulting in distorted body parts, mismatched backgrounds, or uncoordinated lip movements with speech. Synthesia seeks to rectify these inaccuracies in its most recent iteration developed using actual people reading scripts. This approach aids bots in perfecting lip movement accuracy and in refining their emotional representation. Synthesia's CEO and co-founder, Victor Ribarbelli, highlighted in a video that unlike humans, avatars previously lacked comprehension of their speech, robbing them of appropriate facial reactions.
Training in the studio involved avatars successful response to basic directives like "I am happy. I am sad. I am frustrated", resulting in accurate replication of associated facial cues and intonation. The company’s new avatars support over 130 languages, offer automatic captioning and can duplicate a user’s voice. The English language model proves to be the most realistic and sophisticated amongst other language models based on a test by Cointelegraph.
Synthesia, listed with at least half of the Fortune 100 companies as its customers, caters to more than 55,000 businesses, including market leaders like Zoom, Xerox, Microsoft and Reuters, to name a few. Founded in 2017 in the United Kingdom, the company has rocketed to a near $1 billion valuation, driven by the recent AI surge and the backing of significant players like Nvidia who lead AI semiconductor chip innovation. With its focused mission of devising lifelike avatars for commercial applications, Synthesia has managed to circumvent some of the overheated rivalry seen in the chatbot space, where models like OpenAI's ChatGPT and Google's Gemini chatbot are head-to-head.
Published At
4/26/2024 2:51:52 PM
Disclaimer: Algoine does not endorse any content or product on this page. Readers should conduct their own research before taking any actions related to the asset, company, or any information in this article and assume full responsibility for their decisions. This article should not be considered as investment advice. Our news is prepared with AI support.
Do you suspect this content may be misleading, incomplete, or inappropriate in any way, requiring modification or removal?
We appreciate your report.