Journal of Communications and Information Networks

Figure1 System model for acoustics-based short-range communication system.

In the acoustics-based short-range communication system, we mainly consider a passive eavesdropper whose objective is to obtain the exchanged data during the acoustic transmissions. The threat model is based on the scenario where adversary may deploy multiple sensors (hidden microphones) at any fixed location in priori to the acoustic communication. A more detailed threat model will be discussed at the following section.

3 Secure acoustics-based shortrange communications

3. 1 A brief view of secure acoustics-based short-range communications

Acoustic communication has received continuous and extensive attention and it has been widely used in many underwater wireless communication systems. To secure these acoustics-based short-range communication systems, researchers are utilizing the physical layer of wireless communication to design novel ways to protect communications without using any prior shared key. We take a glance of those stateof-the-art friendly-jamming systems first.

Dhwani: The first acoustics-based short-range communication system, called Dhwani, uses the special random white Gaussian noise as jamming signal, which consists of parts of several one-tie random white Gaussian noises. The main components of Dhwani include an ingress filter, an OFDM (Orthogonal Frequency Division Multiplexing) based software module, and a self-jamming module. It applies PSK (Phase-Shift Keying) as the digital modulation way to module the data.

PriWhisper: As a parallel and independent work, another acoustics-based short range communication system called PriWhisper was proposed in Ref. 2]. PriWhisper also uses the random white Gaussian noise as jamming signal. Different from Dhwani, which focuses on more implementation aspects, the authors aim to provide rigorous security analysis of friendly-jamming technology in acoustics-based short-range communication systems. In their model, they well designs the generated noise to cover the frequencies selected by FSK ( Frequency-Shift Keying) modulation scheme.

3. 2 Threat model and design challenges

Prior work has considered the problem of eavesdropping over acoustic emanations as a side channel. The system security is analyzed in the standard LoS channel model. Both PriWhisper and Dhwani are expected to provide secure communication in the presence of either single or multiple passive eavesdropper(s). A typical scenario is that the eavesdropper places one or more remotely controlled wireless microphone(s) near a user’s workspace in priori and records the acoustic signal during the transmission. In particular, multiple-sensor eavesdroppers may try to separate the data signal from its recorded mixture signals. Fig.2 illustrates a typical attack scenario where the attacker utilizes multiple microphones for eavesdropping (R₁, R₂). Note that s₁and s₂are two original signals (data signal and jamming signal), x₁and x₂are two received mixed signals.

Figure2

Figure2 System workflow for PriWhisper

To fully separate s₁and s₂from x₁and x₂, the adversary continuously launching a separation attack. In this attack, the adversary tries to estimate the data signal and jamming signal by using BSS techniques. FD-ICA (Frequency Domain Independent Component Analysis ) is one of the most famous algorithms. Upon success, the adversary can separate the transmitted data from the mixed signal.

BSS is a technique for estimating original source signals using only observed mixtures. It has a wide range of applications including high-quality telecommunication system and robust speech recognition. FD-ICA is one of the most popular blind signal segmentation techniques. Assume that the original data signal and jamming signal are s_i(t) (i=1, …, N), the signals which are observed by the eavesdropper using microphone j are x_j(t)(j=1, …, M), and the separated signals are y_k(t)(k=1, …, N), the BSS model can be described by the following equations:

x_{j} (t) = \sum_{i = 1}^{N} (h_{i j} * s_{i}) (t),

y_{k} (t) = \sum_{j = 1}^{N} (ω_{k j} * x_{j}) (t),

where h_ji represents the coefficient from source i to the eavesdropper’s microphone j, ω_kj is the coefficient for the FIR (Finite Impulse Response) filter, and *denotes the convolution operator. Instead of applying an ordinary ICA algorithm in the time domain to solve the BSS problem, we can first use a STDFT (Short-Time Discrete Fourier Transform ) to convert the time domain signal into frequency domain, and then apply ICA in the frequency domain. The model is approximated as:

X (ω, n) = H (ω) S (ω, n),

where, ω is the angular frequency, and n denotes the frame index of the recorded audio. S(ω, n) = [S₁(ω, n), …, S_N(ω, n)]^T is the source signal in frequency binω, X(ω, n)=[X₁(ω, n), …, X_M(ω, n)]^Tdenotes the mixed signals. The separating process can be represented by the following equation:

Y (ω, n) = W (ω) X (ω, n),

where Y(ω, n) = [Y₁(ω, n), …, Y_N(ω, n)]^Tdenotes the estimated data and jamming signal, and W(ω) represents the separating matrix. The goal of FD-ICA is to determine the W(ω) so that Y_i(ω, n) and Y_j(ω, n) become mutually independent. Fig.3 shows the whole process.

Figure3

Figure3 Model of BSS system

3. 3 System architecture

The general architecture of self-jamming communication system is shown in Fig.4. Both PriWhisper and Dhwani are designed to enable key-less secure acoustic short-range communication in smartphonesmartphone and smartphone-terminal scenarios. The distance between two devices should be in a few cm. At the beginning of the transmission, the receiver sends jamming noise and the transmitter sends data signal simultaneously. Once the receiver gets the mixture signal, the receiver removes the jamming noise with the help of its own knowledge. However, due to the phase distortion, the receiver can only remove parts of the jamming noise. To protect the confidentiality of the message, the duration time of the jamming noise should always cover the duration time of the message signal.

Figure4

Figure4 Acoustic self-Jamming communication Architecture

The workflows of PriWhisper are shown in Fig.5. For PriWhisper, the transmitter first broadcast a start signal to inform the receiver to prepare for receiving messages. Then the transmitter starts detecting jamming signal. The receiver, once detect the starting signal, begins recording audio signals. In the next step, the receiver plays a synchronization sound for itself to mark the beginning spot of the jamming signal. Once finished, the receiver immediately plays the jamming signal. The power of the jamming signal is the maximum power that the receiver can reach. Once the transmitter detects the jamming signal, it begins to transmit messages. Finally, the receiver removes the jamming signal from its received mixture signal and decode it to get the message.

Figure5

Figure5 System workflow for PriWhisper

4 Future research issues

The practical use of these acoustics-based short range communication systems in a real-world scenario relies on the system usability, data through-put and the security strength provided by the friendly jamming technique. Further research study needs to be done in the following areas.

4. 1 Non-invasive design for acoustics-based communication

To make a non-invasive design of acoustics-based communication, the transmission is made in the ultrasonic or near ultrasonic frequency range that humans cannot hear. The carrier frequency of the smartphone usually has a wider range. However, as most off-the-shelf smartphone speakers are only capable of producing sound with a 44. 1 KHz sample rate, based on Nyquist-Shannon sampling theorem, this allows us to use a maximum frequency of about 22 KHz for transmitting ultrasonic sound on smartphones. The ultrasonic range for most humans is above 19. 5 KHz. Motivated by this observation, it is possible to have a non-invasive design of acousticsbased communication system using the non-voice frequency band, achieving even higher security guarantee.

To implement non-invasive design into these existing systems, new and novel mechanisms of jamming signal generation needed to be designed. Recently, researchers have proposed the perceiving power ratio threshold in Ref. 9]. According to Ref. 9], human can perceive specific audio from background wide-band noise if SNR (Signal-to-Noise-Ratio) of single-frequency audio is above13 dB. While for multi-frequency audio, SNR should be larger than 0 dB. One solution is to adopt spread spectrum encoding where each bit of message is transmitted as random noise that has been filtered to be only in the ultrasonic spectrum 19. 5 KHz to 22 KHz. Modulated noise rather than signal is chosen so that it blends more discreetly into the background. Using the entire spectrum also allows for a low power but high SNR signal. To defend against activate adversaries using ultrasonic frequency, a novel physical layer integrity checking scheme need to be further designed.

4. 2 Enforced proximity communications

Existing acoustic friendly jamming system is assumed that the transmitter and the receiver are both in a fixed location. This makes the system vulnerable to multi-eavesdropping attack if the distance d between the transmitter and the receiver is larger than a threshold h_d (usually 0.5 cm). To enforce proximity communications, researchers rely on two main techniques: distance estimation and bounding protocol and contextual co-presence approach. Distance estimation and bounding protocol aims to cryptographically bound the distance between transmitter and receiver by measuring the response time. However, exiting distance estimation protocols^[10,11,12]and distance bounding protocols^[13] either can only work in special environments or require specialized hardware. Contextual co-presence approach^[10, ^[011, ^[12, ^[14], on the other hand, comparing the ambient information (e. g. , RSSI level, GPS, etc. ) sensed by transmitter and receiver to enforce proximity. These contextual co-presence approach also suffer from a few limitations. First, they aim to determine relative distance (e. g. which device is closer) instead of absolute distance between transmitter and receiver. Second, these approaches are insecure because attackers can modify the ambience around the transmitter and receiver. Recently, a system named Dolphin^[15]has been proposed to restrict distance. Dolphin utilizes the fast decay property of acoustic signals to ensure the distance and uses full-duplex communication to defend against eavesdropping attacks. As a future research direction, it is important and necessary to investigate the effectiveness of these acoustic near filed assertion system, under a very powerful attacker equipped with multiple microphones, by conducting both analytical evaluation and experiments.

4. 3 Channel randomization

Another solution for improving system security is to randomize the acoustic channels from the receiver to the eavesdropper to prevent accurately decoding the message. To implement channel randomization in practice, one possible solution is to leverage recent results in under water acoustic communication^[16]. Ref. 16] shows that due to multi-path effect, even small motion of the transmitter and receiver can create large variation in the acoustic channel. Thus, we could place the transmitter and receiver both on a rotating frame, and randomly change the relative location of both transmitter and receiver to randomize the signal. This creates fast varying acoustic channels with a random distribution. It also provides the channel diversity of an acoustic transmitter with a huge number of possible locations, which renders an eavesdropper unable to separate and decode.

In fact, the BSS algorithm running by the eavesdropper is highly directional and the solution of the FD-ICA is as the same way as an adaptive beamformer. Because of this special characteristic, FD-ICA is robust as regards a moving transmitter. But it may fail to decode when we add a moving receiver since the moving receiver add randomizing to the acoustic channels which makes the eavesdropper impossible to form a spatial null towards the jamming signal. Thus, to defend against multiple passive adversaries, the use of channel randomization techniques may help effectively protect today's widely used commercial acoustic systems from eavesdroppers without degrading usability.

4. 4 Performance enhancement

For performance enhancement design, it has been observed that the data rate, maximum transmission range and robustness of the system is closely related to the frequency modulation scheme and error correction scheme. To further improve the system performance and dependability, it is possible to explore advanced modulation methods (e. g. multiple frequencies amplitude modulation) and repetition error correction scheme. On top of this, a systematic study of system SNR, indoor and outdoor multi-path effect and frequency amplitude modulation methods is also very critical to evaluate the robustness of the acoustics-based short communication system. Moreover, other practical countermeasures and security enhanced schemes may also need to be designed to improve the overall system security.

5 Concluding remarks

In this article we presented an overview of existing acoustics-based short range communication systems, specifically, we picked two state-of-the-art acoustic friendly jamming systems, Dhwani and PriWhisper. They both provide near field communication functionalities and enable stronger security guarantees but require less strict hardware support. However, as we pointed out in the discussion section (Section 4), there is still much room to improve the usability and practicality of these acoustics-based systems in terms of security strength, channel security, transmission data rate and so on. Further, we demonstrated that the non-invasive design, enforced proximity techniques and acoustic channel randomization technique can be combined with many existing security primitives, which opens doors to the designing of a variety of safe acoustic short-range communication systems. We also believe that these acoustics-based short range communication systems will be widely adopting in our daily life after we successfully overcome these technical challenges.

The authors have declared that no competing interests exist.

作者已声明无竞争性利益关系。

Reference

By original order

By published year

By cited within times

By Impact factor

[1]

CHEN

, LI

, QIN

, et al.

AcousAuth: an acoustic-based mobile application for user authentication

[C]// IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), 2014: 215-216.

[2]

ZHANG

, ZHAN

, CHEN

, et al.

Enabling keyless secure acoustic communication for smartphones

[J]. IEEE internet of things journal, 2014, 1(1): 33-45.

[Cited within: 3]

[3]

NANDAKUMAR

, CHINTALAPUDI

K K

, PADMANABHAN

, et al.

Dhwani: secure peer-to-peer acoustic NFC

[C]// ACM SIGCOMM Computer Communication Review, 2013, 43(4): 63-74.

[4]

WEI

, WANG

, ZHOU

, et al.

Acoustic eavesdropping through wireless vibrome-try

[C]// The 21st Annual International Conference on Mobile Computing and Networking, 2015: 130-141.

[5]

KORTVEDT

, MJOLSNES

Eavesdropping near field communication

[C]// 2009: 57.The Norwegian Information Security Conference (NISK), 2009: 57

[6]

GOEL

, NEGI

Guaranteeing secrecy using artificial noise

[J]. IEEE transactions on wireless communications, 2008, 7(6): 2180-9.

[7]

KUMAR

, UMA

M K

, KUMAR

Peer-to-peer acoustic near field communication

[J]. IOSR journal of electronics and communication engineering.

[8]

SKLAR

Digital communications

[M]. Upper Saddle River: Prentice HallPress, 2001.

[9]

STUART

J R

Noise: methods for estimating detectability and threshold

[J]. Journal of the audio engineering society, 1994, 42(3): 124-40.

[10]

, SAXENA

, XIANG

, et al.

Location-aware and safer cards: enhancing RFID security and privacy via location sensing

[J]. IEEE transactions on dependable and secure computing, 2013, 10(2): 57-69.

[11]

SCHRMANN

, SIGG

Secure communication based on ambient audio

[J]. IEEE transactions on mobile computing, 2013, 12(2): 358-70.

[12]

HALEVI

, MA

, SAXENA

, et al.

Secure proximity detection for NFC devices based on ambient sensor data in computer security C ESORICS 2012

[M]. Berlin: Springer Berlin HeidelbergPress, 2012: 379-396.

[13]

BRANDS

, CHAUM

Distance-bounding protocols

[C]// Workshop on the Theory and Application of of Cryptographic Techniques, 1993: 344-359.

[14]

KRUMM

, HINCKLEY

The nearme wireless proximity server

[C]// InInternational Conference on Ubiquitous Computing, 2004: 283-300.

[15]

, XUE

, ZHAO

The Power of Whispering: Near Field Assertions via Acoustic Communications

[C]// The 10th ACM y on Information,Computer and Communications Security, 2015: 627-632.

[16]

STOJANOVIC

, BEAUJEAN

P P

Acoustic communication

[R]. DOI 10.1109/ACCESS.2016.2552538,IEEE Access, 2016.