Audio Cues for Detecting Deepfakes

This page contains videos which discuss audio cues for detecting deepfakes – known as Expert-Defined Linguistic Features (EDLFs).

Prior studies show some linguistic features are shared between deepfake audio, and natural human speech. Some of those features are listed here, along with how you can differentiate between a real person speaking, and computer-generated speech.

Learn more about how you can identify each linguistic feature for spoofed audio detection with the videos below.

Audio Cues for Detecting Deepfakes (EDLFs) – An Overview

How can you tell a voice you’re listening to might be fake? Our multidisciplinary team of sociolinguists and machine learning experts collaborated to come up with a tool to help you tell fake speech from real speech. This video gives an introduction to our five Expert-Defined Linguistic Features (EDLFs) of fake and real speech, providing you with a strategy for spotting audio deepfakes.

Video Transcript

Listening for Pitch

This video goes into detail about one of our Expert-Defined Linguistic Features (EDLFs), Pitch, showing you how to listen for this feature when you hear potentially fake audio.

Video Transcript

Listening for Pause

This video gives a more in-depth discussion of Pause, one of our Expert-Defined Linguistic Features (EDLFs) and how to listen for this feature in a speech sample.

Video Transcript

Listening for Consonant Bursts

This video provides a definition of Initial and Final Stop Consonant Bursts, one of our Expert-Defined Linguistic Features (EDLFs) and shows you how to identify this feature in stop consonants in spoken English.

Video Transcript

Listening for Breath

In this video, we explain one of our Expert-Defined Linguistic Features (EDLFs), Breath, and show you how to listen for it in a speech sample.

Video Transcript

Listening for Audio Quality

This video discusses Audio Quality, one of our Expert-Defined Linguistic Features (EDLFs) that can be used to spot fake speech.

Video Transcript

Audio Cues for Detecting Deepfakes, in Review

In this video, we’ll briefly review how to use our five Expert-Defined Linguistic Features (EDLFs) as a tool to spot fake audio. Remember, human language is complex, and variation in speech is normal and natural. As you use EDLFs to help you spot misleading content, it’s important to always keep context–information about the individual speaker, intended audience, and setting–in mind when discerning real from fake.

Video Transcript

NSF Award #2346473

Infographics by Pragya Pandit
Website Design by Lavanya Neelakandan

Search UMBC

Audio Cues for Detecting Deepfakes (EDLFs) – An Overview

Listening for Pitch

Listening for Pause

Listening for Consonant Bursts

Listening for Breath

Listening for Audio Quality

Audio Cues for Detecting Deepfakes, in Review

Subscribe to UMBC Weekly Top Stories

I am interested in: