Speech and facial recognition combine to boost AI emotion detection

About the Author

By Ryan Daws | January 17, 2019

Categories: Face Recognition, Research, Surveillance,

Ryan is a senior editor at TechForge Media with over a decade of experience covering the latest technology and interviewing leading industry figures. He can often be sighted at tech conferences with a strong coffee in one hand and a laptop in the other. If it's geeky, he’s probably into it. Find him on Twitter (@Gadget_Ry) or Mastodon (@gadgetry@techhub.social)

Researchers have combined speech and facial recognition data to improve the emotion detection abilities of AIs.

The ability to recognise emotions is a longstanding goal of AI researchers. Accurate recognition enables things such as detecting tiredness at the wheel, anger which could lead to a crime being committed, or perhaps even signs of sadness/depression at suicide hotspots.

Nuances in how people speak and move their facial muscles to express moods have presented a challenge. Detailed in a paper (PDF) on Arxiv, researchers at the University of Science and Technology of China in Hefei have made some progress.

In the paper, the researchers wrote:

“Automatic emotion recognition (AER) is a challenging task due to the abstract concept and multiple expressions of emotion.
Inspired by this cognitive process in human beings, it’s natural to simultaneously utilize audio and visual information in AER … The whole pipeline can be completed in a neural network.”

Breaking down the process as much as I can, the system is made of two parts: one for visual, and one for audio.

For the video system, frames of faces run through a further two computational layers: a basic face detection algorithm, and three facial recognition networks that are ‘emotion-relevant’ optimised.

This image has an empty alt attribute; its file name is AI-BIG-DATA-BANNER-WORLD-SERIES-PPCv1-1.png — https://www.ai-expo.net/

As for the audio system, algorithms which process sound are input with speech spectrograms to help the AI model focus on areas most relevant to emotion.

Things such as measurable characteristics are extracted from the four facial recognition algorithms from the video system and matched with speech from the audio counterpart to capture associations between them for a final emotion prediction.

A database known as AFEW8.0 contains film and television shows that were used for a subchallenge of EmotiW2018. The AI was fed with 653 video and corresponding audio clips from the database.

In the challenge, the researchers AI performed admirably – it correctly determined the emotions ‘angry,’ ‘disgust,’ ‘fear,’ ‘happy,’ ‘neutral,’ ‘sad,’ and ‘surprise’ about 62.48 percent of the time.

Overall, the AI performed better on emotions like ‘angry,’ ‘happy,’ and ‘neutral,’ which have obvious characteristics. Those which are more nuanced – like ‘disgust’ and ‘surprise’ – it struggled more with.

Interested in hearing industry leaders discuss subjects like this and their use cases? Attend the co-located AI & Big Data Expo events with upcoming shows in Silicon Valley, London, and Amsterdam to learn more. Co-located with the IoT Tech Expo, Blockchain Expo, and Cyber Security & Cloud Expo.

Tags: ai, artificial intelligence, emotion detection, facial recognition, Featured, paper, research, speech recognition

Speech and facial recognition combine to boost AI emotion detection

About the Author

Leave a Reply Cancel reply

LATEST ARTICLES

Dell, Intel and University of Cambridge deploy the UK’s fastest AI supercomputer

Customer adoption of Amperity for paid media soars to over 50%

Microsoft and Siemens revolutionise industry with AI-powered Copilot

Biden issues executive order to ensure responsible AI development

UK paper highlights AI risks ahead of global Safety Summit

Register

Personal Details

Company Details

Account Details

About the Author

Leave a Reply Cancel reply

Join our community

Create your free account now to access all our premium content and recieve the latest tech news to your inbox.

LATEST ARTICLES

Dell, Intel and University of Cambridge deploy the UK’s fastest AI supercomputer

Customer adoption of Amperity for paid media soars to over 50%

Microsoft and Siemens revolutionise industry with AI-powered Copilot

Biden issues executive order to ensure responsible AI development

UK paper highlights AI risks ahead of global Safety Summit

Login

Register

Personal Details

Company Details

Account Details