Hide and Speak: Deep Neural Networks for Speech Steganography

Felix Kreuk*, Yossi Adi*, Bhiksha Raj, Rita Singh, Joseph Keshet

*Corresponding authors.

Below are some samples from our steganography system for you to listen. "Original Carrier" refers to the original audio in which we conceal a hidden message. "Embedded Carrier" refers to the audio with a hidden message, "Original Message" referes to the original message we wish to hide inside "Original Carrier" and "Decoded Message" is the message decoded from "Embedded Carrier".

Note,

  • "Original Carrier" and "Embedded Carrier" should sound identical so as to not raise suspicion.
  • "Decoded Message" should be intelligble for a human listener but is allowed to suffer some degradtion in quality (espicially for high compression rates).

TIMIT Single Message

Original Carrier Embedded Carrier Original Message Decoded Message








YOHO Single Message

Original Carrier Embedded Carrier Original Message Decoded Message








TIMIT 3 Messages (x3 compression)

Original Carrier Embedded Carrier Original Message Decoded Message








YOHO 3 Messages (x3 compression)

Original Carrier Embedded Carrier Original Message Decoded Message








YOHO 5 Messages (x5 compression)

Original Carrier Embedded Carrier Original Message Decoded Message