Journal of Communications and Information Networks, 2017, 2(1): 41-51 doi: 10.1007/s41650-017-0004-z

Research papers

Wireless channel feature extraction via GMM and CNN in the tomographic channel model

Haihan Li,1,2, Yunzhou Li,2, Shidong Zhou1,2, Jing Wang2

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China

Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing 100084, China

Corresponding authors: Yunzhou Li,liyunzhou@mail.tsinghua.edu.cn

Fund supported: the National Natural Science Foundation of China61631013
National Key Basic Research Program of China (973 Program)2013CB329002
National Major Project2014ZX03003002-002
Program for New Century Excellent Talents in UniversityNCET-13-0321

作者简介 About authors

Haihan Li received the B. S. and M. S. degree from Tsinghua University, Bei-jing China in 2011 and 2014 respec-tively. He is currently working toward the Ph. D. degree. His research now focuses on channel modeling and radio channel measurements, as well as the data mining of statistical CSI. E-mail:lihh14@mails.tsinghua.edu.cn.

Yunzhou Li re-ceived his Ph. D. degree from Tsinghua University in 2004. currently, he is an associate professor of Tsinghua Univer-sity. Dr. Li mainly focus on signal pro-cessing technologies in wireless and mo-bile communications, including spatial-time signal processing, channel estima-tion, multi-user detection, and synchronization algorithms for CDMA/OFDM system. Also he is interested in analysis, op-timization design and enhancement of cellular system and WLAN. E-mail:liyunzhou@mail.tsinghua.edu.cn.

Shidong Zhou received the B. S. and M. S. degrees in wireless communica-tions from Southeast University, Nanjing, China, in 1991 and 1994, respectively, and the Ph. D. degree in communication and information systems from Tsinghua Uni-versity, Beijing, China, in 1998. From 1999 to 2001, he was in charge of sev-eral projects in the China 3G Mobile Communication R&D Project. He is currently a professor at Tsinghua University. He is currently a member of the China Fu-TURE Project. His research interests are in the area of wireless and mobile communications. .

Jing Wang received the B. S. and M. S. degree in electronic engineering from Ts-inghua University, Beijing China in 1983 and 1986 respectively. He has been on the faculty at the Tsinghua University since 1986. He is currently a professor of the School of Information Science and Tech-nology. He serves as the vice director of Tsinghua National Lab for Information Science and Technol-ogy. His research interests are in the area of wireless commu-nications, including transmission and networking technologies of 4G/B4G. He has published more than 150 conference and journal papers. .

Abstract

Abstract: Wireless channel modeling has always been one of the most fundamental highlights of the wireless communication research. The performance of new advanced models and technologies heavily depends on the accuracy of the wireless CSI (Channel State Information). This study examined the randomness of the wireless channel parameters based on the characteristics of the radio propagation environment. The diversity of the statistical properties of wireless channel parameters inspired us to introduce the concept of the tomographic channel model. With this model, the static part of the CSI can be extracted from the huge amount of existing CSI data of previous measurements, which can be defined as the wireless channel feature. In the proposed scheme for obtaining CSI with the tomographic channel model, the GMM(Gaussian Mixture Model)is applied to acquire the distribution of the wireless channel parameters, and the CNN(Convolutional Neural Network) is applied to automatically distinguish different wireless channels. The wireless channel feature information can be stored offline to guide the design of pilot symbols and save pilot resources. The numerical results based on actual measurements demonstrated the clear diversity of the statistical properties of wireless channel parameters and that the proposed scheme can extract the wireless channel feature automatically with fewer pilot resources. Thus, computing and storage resources can be exchanged for the finite and precious spectrum resource.

Keywords: wireless channel modeling ; tomographic channel model ; Gaussian mixture model ; convolutional neural network

PDF (1528KB) Metadata Metrics Related articles Export EndNote| Ris| Bibtex  Favorite

Cite this article

Haihan Li. Wireless channel feature extraction via GMM and CNN in the tomographic channel model. [J], 2017, 2(1): 41-51 doi:10.1007/s41650-017-0004-z

1 Introduction

The wireless channel is the most important feature of wireless communication and the fundamental difference between wireless and wired communications. Wireless channel modeling plays a very important role in both theoretical and practical research on modern wireless technology. Thus, channel modeling has always held a unique appeal for wireless communication researchers. Accurate and efficient wireless channel models help meet the demands of advanced technologies for CSI(Channel State Information)to achieve adequate performance in 5G and future wireless communication systems. In other words, the performance of new advanced technologies heavily depends on the accuracy of CSI.

The common way to acquire CSI is by using pilot symbols or a training sequence. A more complex wireless communication system means a higher frequency cost. Spectrum resources are limited and precious. Thus, the pilot overhead can become intolerable, especially for massive MIMO (MultipleInput Multiple-Output)antenna systems and UDNs (Ultra-Dense Networks).

Different kinds of wireless channel models have been proposed to better understand the wireless channel. Proper wireless channel models are the basis of accurate CSI. Given the statistical properties of channel parameters, wireless channel models can generally be divided into deterministic and stochastic models.

Deterministic models are based on the belief that wireless channel parameters can be calculated from the propagation environment information and propagation law of the electromagnetic wave. The raytracing channel model and its extensions are representative deterministic models[1]. The ray-tracing model can simulate almost all of the reflection, diffraction, refraction, and scattering by using wellknown radio propagation environment information between the transmitter and receiver. Then, the information of almost every propagation path can be calculated theoretically to obtain the CSI of the channel.

GSCMs (Geometry-based Stochastic Channel Models) are one of the most popular stochastic models[2]. GSCMs can reproduce the stochastic properties of different kinds of wireless channels over time, frequency, and space. Because of the scalability and reasonable complexity of GSCM, it has been widely adopted by many common wireless channel models, such as WINNER II/+, COST2100, and IEEE 802. 11ad[3,4,5,6,7].

The key for deterministic models to acquire sufficiently accurate CSI is sufficiently abundant information on the radio propagation environment. In other words, the accuracy of the CSI calculation result is closely related to the electromagnetic param eters that can be determined from the radio propagation environment, such as the reflection coefficient of different materials[1]. As the required level of accuracy for CSI increases, the calculation process of a deterministic model becomes more complex. Thus, deterministic models are feasible but much too complex to be practical in real wireless communication systems. Stochastic models are more widely used for system design and performance testing because the reflection, diffraction, refraction, and scattering in the radio propagation environment are completely random[8,9]. Thus, GSCMs are simpler but do not reveal information on the radio propagation environment. Such models cannot reflect the actual CSI in real communication systems.

Obviously, there is a tradeoff between the accuracy of CSI and the computational complexity or spectrum resources. For pilot-aided channel estimation, massive antennas and complex networks cost a huge amount of spectrum resources. Thus, new channel models and methods for acquiring CSI are necessary to reduce the pilot cost and improve the performance of the communication system. Typical propagation scenarios have their own unique radio propagation environment. There is a trend where radio propagation environment information of typical propagation scenarios is used to apply prior information to help model a wireless channel and reduce the cost used to determine the CSI. Some HST (High-Speed Train) channel models divide the HST propagation environment into different typical scenarios, such as open space, viaducts, cutting, hilly terrain, tunnels, and stations[10,11,12]. The METIS(Mobile and wireless communications Enablers for the Twenty-twenty Information Society)has introduced the METIS channel models, which consist of a map-based model, a stochastic model, and a hybrid model combin ing the previous two[13,14,15]. The map-based model is based on ray tracing and uses a simplified threedimensional geometric description of the propagation environment. Thus, it inherently accounts for significant propagation mechanisms such as diffraction, specular reflection, diffuse scattering, and blocking. They reported that the map-based model enjoys ex cellent support for massive-MIMO, spherical waves, dual mobility, and other advanced technologies. A few studies based on the map-based model have produced large-scale measurement results. Detailed research on small-scale information or full CSI is needed. Refs. [16,17]presented a general framework for modeling and reproducing environment-specific channel properties in V2V(Vehicle-to-Vehicle)communication. In contrast to the traditional deterministic models or stochastic models, these new models try to introduce some deterministic pattern or feature into the stochastic models to obtain extra information with the purpose of acquiring CSI acquisition at a lower cost. Such advances inspired us to focus on the relationship between the radio propagation environment and CSI in order to incorporate radio propagation environment information of a wireless channel to acquire the CSI at lower cost and with greater accuracy. However, most of these new wireless channel models only focus on large-scale information[18,19]. Detailed research on small-scale information is needed.

In this study, the relationship between the radio propagation environment and CSI and the statistical properties of wireless channel parameters were analyzed. The diversity of wireless channel parameters was used to help explain the concept of the tomographic channel model. The tomographic channel model consists of static, dynamic, and disturbing CSI[20]. Based on the tomographic channel model, the GMM and EM(Expectation Maximization)were applied to extract the static CSI from the existing channel measurement campaign data. The statistical properties of the static CSI can guide the pilot design. An online-offline scheme to acquire CSI via the tomographic channel model is proposed. A CNN (Convolutional Neural Network)was applied to identify different wireless channels automatically. The numerical results of the channel measurement campaigns showed that the extracted and stored wireless channel features introduces extra information on the radio propagation environment and helps reduce the pilot cost. In addition, the CNN can automatically identify different wireless channels and help decide which relevant wireless channel features should be used.

The paper is organized as follows. Section 2 presents the tomographic channel model and onlineoffline scheme for CSI acquisition. Section 3 describes the application of the GMM-EM algorithm to obtaining static CSI (wireless channel feature) from the huge amount of existing wireless channel data. Section 4 describes how the CNN was applied to identify different wireless channels automatically. Section 5 presents the results of repeated measurements and the performance of the method.

2 Tomographic channel model

The classic multi-path channel model can be expressed as[21]

h(τ;t)=l=1Lβl(t)δ(ττl),     (1)

βl(t)=αl(t)ej(2πflt+ϕl)=βl(t)˜ej(2πflt),     (2)

βl(t)˜=αl(t)ejϕl,      (3)

fl=fcvccosθl.      (4)

where L is the number of MPCs (MultiPath Components), βl(t) is the complex amplitude and τl is the time delay of the lth path. fl is the Doppler frequency and φl is the phase of the lth path. βl(t)˜ is the complex amplitude upgraded with the phase of the lth path. fc is the carrier frequency and θl is the DOA(Direction of Arrival)of the lth path. c is the speed of light and ν is the velocity of the MS(Mobile Station). If the MS moves at a low speed (v ? c), fl ≈ 0 and βl(t)=αl(t)ejϕl. Now, the βl(t) is the complex amplitude regardless of the change of DOA and velocity of the MS.

If the channel impulse response is assumed to be wide sense stationary over a small-scale time or distance, the h(τ; t)can be simplified as

h(τ)=l=1Lβlδ(ττl).     (5)

Theoretically, the parameters of each path and the CSI are determined by the radio propagation environment. The radio propagation environment is physically defined by the position and motion of the BS (Base Station), MS, and surrounding scatterers. The scatterers can basically be divided into two kinds: static and moving. Static scatterers mainly include buildings, trees, and so on in the radio propagation environment. Moving scatterers mainly consist of moving people and cars. In a real radio propagation environment, the BS and most of the scatterers are usually static. When the MS moves a short distance, the static scatterers change slowly compared with the receivers motion. Previous research revealed that, because the BS and main scatterers(e. g. , outdoor buildings and indoor walls) are static, some variables in the multi-path channel model mainly affected by static scatterers can be considered static[22,23]. Thus, the number of static scatterers and the DOAs of rays from these scatterers, which are referred to as static parameters, stay relatively invariable. Thus, L and θl are usually static and τl and βl can also be regarded as static. The velocity of the receiver ν is a dynamic parameter and needs to be obtained with GPS (Global Positioning System) or other speed testing instruments. Finally, the disturbing CSI refers to the reflection, diffraction, refraction and scattering caused by scatterers that appear randomly and may only affect part of the multiple paths. As mentioned above, the main static scatterers like buildings or walls are much taller than the MS. Thus, the power of the paths caused by the static scatterers is much stronger than that of the single-reflection signal from random scatterers. Therefore, Eq. (1)can be approximately expressed as

h(τ;t)=l=1Lβl(t)δ(ττl)+l=L1+1L2+L1βl(t)δ(ττl),     (6)

where L1 is the number of MPCs of the static scatterers and L2 is the number of MPCs of the random scatterers. Static scatterers make up most of the CSI. Luckily, they are static, so they can be extracted precisely via repeated prior measurements. This is the“feature”of the channel in each specific real propagation scenario. When the feature of the channel is obtained, the pilot can be designed at the main static time and frequency positions. This means that we already know the basic structure of the wireless channel and can put the pilot at a position where the wireless channel parameters have a large role. Tong et al. [24] showed that a good pilot design can help obtain a more accurate CSI.

The tomographic channel model consists of static, dynamic, and disturbing channel information. Static channel information is CSI caused by static scatterers and can be extracted from a large amount of existing CSI data offline. The motion of the MS leads to dynamic channel information. The velocity of the MS can be determined with the help of GPS or other speed-testing instruments. Thus, the dynamic channel information can be calculated. Because paths caused by random scatterers seem rare and weak in some propagation scenarios, disturbing channel information can be ignored or estimated by a smaller pilot in practice.

Fig.1 shows our online-offline scheme to acquire CSI. First, the static parameters can be extracted from the data of previous repeated experiments. The SAGE(Space Alternating Generalized ExpectationMaximization)algorithm is used to restore the multipath parameters; then, the GMM is applied to obtain the distribution of the multi-path parameters based on the EM principle. The above process is carried out offline. With the distribution of the multi-path parameters, prior information can help in the pilot design. Second, we can obtain the velocity information of the receiver with GPS or other speed-testing instruments. With the velocity information, the dynamic part can be calculated. Finally, the disturbing CSI can be estimated with a well-designed pilot. Thus, the full CSI should be acquired based on the above three steps. Note that the static multi-path information is additive and the dynamic part is multiplicative. The disturbing CSI is always considered additive.

Figure 1

Figure 1   The scheme to acquire CSI in the tomographic channel model


3 Scheme to acquire CSI with the tomographic channel model

3. 1 SAGE algorithm

The SAGE algorithm is often applied to parameter extraction from channel measurement data. As an extension of the EM algorithm, the SAGE algorithm classifies parameters (e. g. , time delay, amplitude, Doppler frequency, AOA)that need to be estimated into subsets to improve the convergence speed compared with the EM algorithm[25].

The vector Θl=(βl, θl, τl, ϕl)T is the set of wireless channel parameters. The SAGE algorithm classifies Θl into S subsets {Θs, s=1, …, S}. Thus, the log-likelihood function can be expressed as[25]

Λ(Θs;xls)=1N0[2D0Real{H(Θs|Θ^s¯,t)H}xls(t)dt

D0H(Θs|Θ^s¯,t)2dt],     (7)

(Θs^)ML(xls)=argmax{Λ(Θs;xls)},     (8)

xl is the observable data. xls is the admissible hidden-data space with respect to Θs in the lth path. N0 is the noise variance. D0 is the interval of observation. Θ^s¯ is the given complementary set of Θs in Θl.

The M L estimate of the subsets Θs instead of Θl is needed to improve the convergence speed. L is determined by the MDL (Minimum Description Length)criterion. With L is identified, βl, θl, τl, φl can be estimated off-line via SAGE algorithm. This algorithm estimates and updates Θl iteratively until it achieves convergence.

3. 2 Gaussian mixture model

The GMM is a popular parametric density model for approximating arbitrary continuous multivariate PDFs (Probability Density Functions) in unsupervised learning or clustering procedures[26,27]. The GMM is made up of K multivariate Gaussian distributions known as mixture components. Each component has its own {µk, Σk}, k =1, …, K, where µk is the mean value vector and Σk is the covariance matrix of the multivariate Gaussian component. Each mixture component also has an associated nonnegative mixing weight πk. The PDF (Probability Density Function)for GMM can be expressed as

p(x)=k=1KπkN(x|μk,Σk).     (9)

Θ=(β, θ, τ, φ)T can be regarded as four dimensional variables. K ≈L. The PDF of Θ can be expressed as

p(Θ)=k=1LπlN(Θ|μl,Σl).     (10)

3. 3 EM algorithm

The next step is to estimate{πl, µl, Σl}to maximize the log-likelihood function i=1Np(Θi). The EM algorithm can be carried out as follows[28]:

Expectation step: estimate the probability of Θi generated by the lth mixture Gaussian component.

γ(i,l)=πlN(Θi|μl,Σl)i=1LπjN(Θi|μj,Σj),   (11)

πj, µj, Σj, j=1, …, L need to be initialized.

Maximization Step: Estimate the πl, µl, Σl.

μl=1Nli=1Nγ(i,l)Θi,     (12)

Σl=1Nli=1Nγ(i,l)(Θaiμl)(Θiμl)T,    (13)

πl=NlN,    (14)

N is the number of training multi-path data sets. Nl=i=1Nγ(i,l).l, µl, Σl} can be estimated iteratively.

4 Channel recognition via CNN

Deep learning enjoys excellent performance in automatic speech recognition, face recognition, and text recognition. Deep learning enables an artificial neural network to automatically extract features from the input training data and reduce the incompleteness brought by the artificial settings[29]. Fig.2 shows a DNN.

Figure 2

Figure 2   Deep neural network


A CNN is one kind of widely used deep learning structure. A CNN is characterized by consisting of one or more pairs of convolution and pooling layers. The pair of a convolution layer and pooling layer is used for feature extraction[30]. In the convolution layer, a set of filters is applied to process small local parts of the input data. These filters are replicated along the whole input space. Neurons in the convolution layer multiply. In the pooling layer, a lower-resolution version of the convolution layer activation is generated by taking the maximum filter activation from different positions within a specified window. The number of neurons in the pooling layer stays the same. The fully connected layer combines inputs from all positions to classify the overall inputs. A CNN has three unique concepts: local filters, max-pooling, and weight sharing. The paths in wireless channel are always clustered. Thus, the nearby paths can share a similar nature and influence each other. As a result, the unique characteristics of a CNN allow it to extract the wireless channel features better than other deep learning algorithms. Different propagation environments lead to different wireless channels. Based on the tomographic channel model, the static part of the CSI data can be regarded as the wireless channel feature. The CSI data we measured under different measurement conditions may contain different wireless channel features. The datasets of multi-path parameters (e. g. , time delay, amplitude, Doppler frequency)Θ={Θ1, …, Θl, …, ΘL}, Θl = (βl, θl, τl, φl)T, l =1, …, L are the input data in the CNN for wireless channel feature extraction, as shown in Fig.3. The input data sets comprise many multi-path parameter samples. One sample is a matrix whose row vectors can be the time delay, amplitude, or Doppler frequency. The length of each row vector is the number of paths L. If L=10 and the data of the time delay, amplitude, and Doppler frequency are given, one input data sample can be a 10 × 3 matrix. The output is the number of wireless channels to which the input dataset belongs. For example, the input dataset may consist of three different wireless channels, which means that the data of the wireless parameters are obtained in three different positions or under three different measurement conditions. The output can be three bits: 001, 010, or 100.

For the feedforward pass, the convolution layer activations can be computed as a convolution-like oper ation of each filter on the lower layer bands followed by a nonlinear activation function f(Θ)[31],

Figure 3

Figure 3   CNN for multi-path data


net=k=1nwixi+b,    (15)

where xi can be any datum in Θ.

sigmoid(t)=11+et      (16)

f(Θ)=sigmoid(net)=11+exp[(i=1Nwixi+b)]   (17)

For backpropagation, the loss function is defined to estimate the prediction error. There are many kinds of loss functions. In our CNN, the loss function is defined E(Θ)as

E(Θ)=12yf(Θ)22,    (18)

where y is the correct channel number and f(Θ) is the output result of CNN.

The derivative of E(Θ) passes along the CNN from the output layer to the input layer.

Ewi=EfEnetEwi.      (19)

The parameters of CNN update in the negative gradient direction and gradually approximate the optimal solution with iterations. A backpropagation algorithm with an appropriate learning rate can guarantee convergence to the local minimum but not convergence to the global minimum.

5 Experimental evaluation

To verify the performance of the proposed method, indoor and outdoor channel measurement campaigns were carried out in and around the FIT building in Tsinghua University, Beijing. Tab.1 presents the system configuration.

Table 1   System configuration of measurement campaigns

indooroutdoor
transmitter power20 dBm20 dBm
carrier frequency900 MHz900 MHz/2.4 GHz
bandwidth20 MHz20 MHz
transmitter Antenna Height1m10 m
receiver Antenna Height1m1m
LOS/NLOSLOSLOS

New window| CSV


The indoor channel measurement campaigns were carried out in a meeting room. The room was 8. 57 m long and 3. 91 m wide, as shown in Fig.4. Tx and Rx were placed on the desk in the center of the room. The horizontal distance between Tx and Rx was 4 m. The desk was about 1 m high. The outdoor channel measurement campaigns were carried out in the small yard in the middle of the building. Tx was placed outside the window on the fourth floor at a height of about 10 m. Rx was placed in the center of the yard in the FIT building at a height of about 1 m. The horizontal distance between Tx and Rx was about 39 m. Figs. 4 and 5 show the indoor and outdoor layouts, respectively. Throughout the measurements, Tx and Rx stayed still.

Figure 4

Figure 4   Layout of the indoor experiment environment


Figure 5

Figure 5   Layout of the outdoor experiment environment


GMM-EM was applied to obtain the PDF of wireless channel parameters. Based on the channel feature, fewer but redesigned pilots were used for channel estimation. There were 64 pilot symbols in the original system. Based on the distribution of the paths, 48/32/24/16 pilot symbols were used. The normalized mean square error (NMSE) was used as an evaluation standard to compare channel estimation results with the original 64 pilot symbols and with the redesigned 48/32/24/16 pilot symbols.

NMSE=10lgk=1K|h(k^)h(k)|2k=1K|h(k)|2,    (20)

h(k)is the real channel response estimated with original 64 pilot symbols. h(k^) is the channel response estimated with redesigned 48/32/24/16 pilot symbols. K is the number of sample data.

Figs. 6 and 7 show the CDF of NMSE with the redesigned 48/32/24/16 pilot symbols in the indoor and outdoor measurements. The goal of channel estimation with prior information extracted from the huge amount of existing CSI data was achieved. Fewer pilot symbols reduced the performance. Thus, there is a tradeoff between the accuracy of the channel estimation and the pilot cost. The outdoor propagation environment changed in a more complex manner than the indoor propagation environment. The proposed scheme performed better in the indoor propagation environment.

Figure 6

Figure 6   CDF of NMSE in the outdoor measurement


Figure 7

Figure 7   CDF of NMSE in the indoor measurement


The channel recognition performance via the CNN was astonishing. Datasets of three different wireless channels were input to the input layer: the outdoor channel (channel number: 100) with a carrier frequency of 900 MHz, the outdoor channel (channel number: 010) with a carrier frequency of 2. 4 GHz, and the indoor channel(channel number: 001)with a carrier frequency of 900 MHz. The rest of the sys tem configuration was the same as that presented in Tab.1. Tab.2 presents the channel recognition results.

Table 2   Channel recognition result via CNN

hiddenlayerswidthepochsalphaerror
CNN166,12,241 0000.10.039 2
CNN246,121 0000.10.044 2
CNN346,1210010.078 3
CNN446,121000.10.045 2
CNN546,121000.0010.259 0
CNN648,1610010.058 2

New window| CSV


hiddenlayers represents the number of hidden layers in the CNN. As noted above, the hidden layers in a CNN come in pairs(C layer and S layer). Thus, hiddenlayers = 4 means that there are two pairs of convolution and sub-pooling layers. width means the number of connections in the convolution layer. epochs can be regarded as the training times. alpha is the learning rate with the range of (0, 1). The comparison between CNN1 and CNN2 showed that the deeper CNN performed better. The comparison between CNN2 and CNN4 shows that training a CNN more improves its performance. The comparison between CNN3 and CNN6 shows that the wider CNN performed better. CNN3, CNN4, and CNN5 showed that alpha cannot be too big or too small.

6 Conclusions and future work

This study focused on wireless channel feature extraction and recognition via a machine learning algorithm. A GMM was applied to acquire the distribution of wireless channel parameters, and a CNN was applied to distinguish different wireless channels automatically. The results based on indoor and outdoor channel measurement data showed that these machine learning algorithms can enjoy an astonishing level of performance in wireless communication. The contribution of this study is the application of a CNN to dealing with multi-path data. The numerical results help prove the concept of the wireless channel feature and the idea of the tomographic channel model. The recognition of different wireless channels can help with applying prior information to guide the pilot design and save pilot resources.

Research and development of wireless big data is actively ongoing. Most wireless big data research is on the huge amount of service data, and few studies have focused on channel data. Most research has focused on the challenges of fast-growing big data in wireless networks, or“wireless for big data”. In this study, we attempted to apply machine learning to wireless communication, which we term“big data for wireless”.

In the future, our main focus will be carrying out more channel measurement campaigns in different typical propagation scenarios to build a sufficiently complete CSI database. More deep learning algorithms can be applied to deal with wireless communication problems with the database, such as the construction of the magnetic map.

The authors have declared that no competing interests exist.

Reference

G. E. Athanasiadou , A. R. Nix .

A novel 3-D indoor raytracing propagation model: the path generator and evaluation of narrow-band and wide-band predictions

[J]. IEEE transactions on vehicular technology, 2000, 49(4): 1152-1168.

[Cited within: 2]

J. Poutanen .

Geometry-based radio channel modeling: propagation analysis and concept development

[D]. Helsinki: Aalto University, 2011.

[Cited within: 1]

R. Verdone , A. Zanella .

Pervasive mobile and ambient wireless communications: COST action 2100

[M]. Springer Science & Business Media, 2012.

[Cited within: 1]

J. Meinilä , P. Kyösti , T. Jämsä , et al.

WINNER II channel models

[C]// Radio Technologies and Concepts for IMT-Advanced,Wiley, 2009: 39-92.

[Cited within: 1]

J. Meinila , P. Kyosti , L. Hentila , et al.

D5.3: WINNER+ final.channel.models

Wireless World Initiative New Radio WINNER, 2010.

[Cited within: 1]

S. Jaeckel , L. Raschkowski , K. Börner , et al.

QuaDRiGa-Quasi deterministic radio channel generator,user manual and documentation

[R]. Fraunhofer Heinrich Hertz Institute, 2014.

[Cited within: 1]

A. Maltsev , V. Erceg , E. Perahia , et al.

Channel models for 60 GHz WLAN systems

[EB/OL]. IEEE doc, 2010: 802.11-09.

[Cited within: 1]

J. Karedal , F. Tufvesson , N. Czink , et al.

A geometrybased stochastic MIMO model for vehicle-to-vehicle communications

[J]. IEEE transactions on wireless communications, 2009, 8(7): 3646-3657.

[Cited within: 1]

J. Medbo , K. Börner , K. Haneda , et al.

Channel modelling for the fifth generation mobile communications

[C]// The 8th European Conference on Antennas and Propagation (EuCAP),The Hague,Netherlands, 2014: 219-223.

[Cited within: 1]

C. X. Wang , A. Ghazal , B. Ai , et al.

Channel measurements and models for high-speed train communication systems: a survey

[J]. IEEE communications surveys &tutorials, 2016, 18(2): 974-987.

[Cited within: 1]

J. Z. Wang , H. L. Zhu , N. J. Gomes .

Distributed antenna systems for mobile communications in high speed trains

[J]. IEEE journal on selected areas in communications, 2012, 30(4): 675-683.

[Cited within: 1]

B. H. Chen , Z. D. Zhong , B. Ai , et al.

Channel characteristics in high-speed railway: a survey of channel propagation properties

[J]. IEEE vehicular technology magazine, 2015, 10(2): 67-78.

[Cited within: 1]

M. Metis .

Wireless communications enablers for the Twentytwenty Information Society EU 7th Framework Programme project

[R]. ICT-317669-METIS.

[Cited within: 1]

M. Fallgren , B. Timus .

Future radio access scenarios,requirements and KPIs

[J]. METIS deliverable D, 2013, 1.

[Cited within: 1]

A. Osseiran , F. Boccardi , V. Braun , et al.

Scenarios for 5G mobile and wireless communications: the vision of the METIS project

[J]. IEEE communications magazine, 2014, 52(5): 26-35.

[Cited within: 1]

X. H. Wang , E. Anderson , P. Steenkiste , et al.

Improving the accuracy of environment-specific channel modeling

[J]. IEEE transactions on mobile computing, 2016, 15(4): 868-882.

[Cited within: 1]

X. H. Wang , E. Anderson , P. Steenkiste , et al.

Improving the accuracy of environment-specific vehicular channel modeling

[C]// The 7th ACM International Workshop on Wireless Network Testbeds,Experimental Evaluation and Characterization,Istanbul,Turkey, 2012: 43-50.

[Cited within: 1]

M. Boban , J. Barros , O. K. Tonguz .

Geometry-based vehicle-to-vehicle channel modeling for large-scale simulation

[J]. IEEE transactions on vehicular technology, 2014, 63(9): 4146-4164.

[Cited within: 1]

J. Medbo , H. Asplund , J. E. Berg , et al.

Directional channel characteristics in elevation and azimuth at an urban macrocell base station

[C]// The 6th European Conference on Antennas and Propagation (EUCAP),Prague,Czech Republic, 2012: 428-432.

[Cited within: 1]

H. H. Li , Y. Z. Li , S. D. Zhou , et al.

A novel method to obtain CSI based on Gaussian mixture model and expectation maximization

[C]// The 8th International Conference on Wireless Communications & Signal Processing (WCSP),Yangzhou,China, 2016: 1-5.

[Cited within: 1]

D. Tse , P. Viswanath .

Fundamentals of wireless communication

[M]. Cambridge: Cambridge University PressPress, 2005.

[Cited within: 1]

Y. Zhang , S. J. Liu , Y. H. Rui , et al.

Wideband wireless channel predictor relying on environment information

[C]// The 8th International Symposium on Antennas,Propagation and EM Theory (ISAPE),Kunming,China, 2008: 945-948.

[Cited within: 1]

Y. Zhang , S. J. Liu , Y. H. Rui , et al.

Channel prediction assisted by radio propagation environments information

[C]// The 4th IEEE International Conference on Circuits and Systems for Communications (ICCSC),Shanghai,China, 2008: 733-736.

[Cited within: 1]

L. Tong , B. M. Sadler , M. Dong .

Pilot-assisted wireless transmissions: general model,design criteria,and signal processing

[J]. IEEE signal processing magazine, 2004, 21(6): 12-25.

[Cited within: 1]

B. H. Fleury , P. Jourdan , A. Stucki .

High-resolution channel parameter estimation for MIMO applications using the SAGE algorithm

[C]// International Zurich Seminar on Broadband Communications AccessTransmission-Networking,Zurich,Switzerland, 2002: 30-30.

[Cited within: 2]

Z. Zivkovic , .

Improved adaptive Gaussian mixture model for background subtraction

[C]// The 17th International Conference on Pattern Recognition(ICPR),Cambridge,UK, 2004, 2: 28-31.

[Cited within: 1]

T. Toda , A. W. Black , K. Tokuda .

Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model

[J]. Speech communication, 2008, 50(3): 215-227.

[Cited within: 1]

T. K. Moon .

The expectation-maximization algorithm

[J]. IEEE signal processing magazine, 1996, 13(6): 47-60.

[Cited within: 1]

Y. Bengio .

Learning deep architectures for AI

[J]. Foundations and trends in Machine Learning, 2009, 2(1): 1-127.

[Cited within: 1]

WEBSTER M . Phishing[EB/OL]. . 2007.

URL     [Cited within: 1]

Y. LeCun , Y. Bengio .

Convolutional networks for images,speech,and time series

[C]// The handbook of brain theory and neural networks,MIT Press,Cambridge,USA, 1998: 255-258.

[Cited within: 1]

/