Abstract
In cellular communication systems with frequency reuse factor of one, user terminals (UT) at the celledge are prone to intercell interference. Joint processing is one of the coordinated multipoint transmission techniques proposed to mitigate this interference. In the case of centralized joint processing, the channel state information fed back by the users need to be available at the central coordination node for precoding. The precoding weights (with the user data) need to be available at the corresponding base stations to serve the UTs. These increase the backhaul traffic. In this article, partial joint processing (PJP) is considered as a general framework that allows reducing the amount of required feedback. However, it is difficult to achieve a corresponding reduction on the backhaul related to the precoding weights, when a linear zero forcing beamforming technique is used. In this work, particle swarm optimization is proposed as a tool to design the precoding weights under feedback and backhaul constraints related to PJP. The precoder obtained with the objective of weighted interference minimization allows some multiuser interference in the system, and it is shown to improve the sum rate by 66% compared to a conventional zero forcing approach, for those users experiencing low signal to interference plus noise ratio.
Keywords:
coordinated multipoint; joint processing; particle swarm optimization; precoding; stochastic optimization.1 Introduction
Future cellular communication systems tend to be spectrally efficient with a frequency reuse factor of one. The aggressive reuse of frequency resources causes interference between cells, especially at the celledge. Therefore, the user experience is affected and the performance of such systems is interference limited. To overcome this problem, coordinated multipoint (CoMP) transmission/reception is proposed [1]. Joint processing (JP) is one of the techniques that falls into the framework of CoMP transmission. In the downlink, JP involves the coordination of base stations (BSs) such that the interfering signals are treated as useful signals when transmitting to a user terminal (UT). Note that this technique was previously referred to as network coordination [2].
For JP, UTs need to feed back the channel state information (CSI) of their BSUT links. In centralized joint processing (CJP), the CSI is collected at a node in the network called central coordination node (CCN), to form an aggregated channel matrix [3,4]. The CCN can be treated as a logical node that can be implemented at a BS. Based on this aggregated channel matrix, the CCN obtains the precoding weights, consisting of the beamforming weights after power allocation. These precoding weights need to be available along with the user data at the corresponding BSs to control interference via JP. In this work, the backhaul traffic mainly comprises of transporting the CSI coefficients from the cooperating BSs to the CCN, the precoding weights from the CCN to the cooperating BSs and the user data. We restrict the definition of the backhaul load as transporting the precoded weights from the CCN to the cooperating BSs. The feedback load is the traffic due to the CSI forwarding from UTs to the BSs. These definitions are illustrated in Figure 1. Along with the user data, this traffic poses tremendous requirements on the network backhaul [46]. It also imposes delay constraints due to nonstationary channels, but the delay constraints are beyond the scope of this work.
Figure 1. An illustration of the feedback load comprising of the CSI coefficients from the UT to the BS. The backhaul load consists of the precoding weights from the CCN to the BS. The equivalence can be seen in the number of CSI coefficients, h_{1}, h_{2}, h_{3 }and the number of precoding weights w_{1}, w_{2}, w_{3 }for a given UT. The user data is assumed to be routed based on the nonzero precoding weights at the CCN.
One of the approaches to alleviate the complexity requirements in JP is to arrange the BSs in clusters [3]. The BSs involved in JP within a cluster control the intracluster interference, while the BSs belonging to neighboring clusters give rise to intercluster interference. In a static clustering approach the cooperating set of BSs does not change with time, but this can create unfairness for UTs on the cluster edge. Hence, dynamic clustering helps in maintaining fairness among UTs. An example of dynamic clustering could be a family of clusters operating in round robin fashion where each cell takes its turn to be at the cluster boundary. Clustering techniques can also be divided into usercentric or networkcentric depending on where the clustering decision is taking into account the UT determined channel conditions. Since CJP implies full cooperation, it requires extensive feedback and backhaul resources in the cooperative cluster. In order to bring JP close to realistic scenarios, one can further reduce the complexity for a given cluster through suboptimal approaches.
Several such approaches have been considered in the literature to reduce the requirements of CJP, such as limited feedback [7,8] and limited backhauling [5,6,9,10]. Partial joint processing (PJP) is a general framework aiming to reduce the complexity requirements of CJP, basically the feedback and backhaul load. In the particular PJP approach considered in this article, a CCN or the serving BS instructs the UTs to report the CSI of the links in the cluster of BSs whose channel gain fall within an active set threshold or window, relative to their best link (usually the serving BS) [7]. This is summarized in Algorithm 1. Note that a similar approach is used in [8]. PJP can be regarded as a usercentric clustering when it is implemented over a static cluster, since overlapping subclusters or active sets of BSs are dynamically formed. Note that CJP is a particular case of PJP when the threshold tends to infinity.
Algorithm 1: Active set thresholding for limited feedback based on [7]
1: Choose: threshold = 10 dB
2: for each UT do
3: Measure the channel gain from all BSs
4: bestLink = max{channel strength from all BSs}
5: if (bestLink  otherLink) ≤ threshold then
6: UT feed backs the CSI of otherLink
7: CCN marks this link as active
8: else
9: Feedback load reduction:
10: UT does not feed back the otherLink
11: CCN marks this link as inactive
12: end if
13: UT feeds back the bestLink
14: CCN marks this link as active
15: end for
In PJP, the CSI of the links reported by the UTs to the CCN are marked as active links and those not reported are marked as inactive. Based on these, the CCN forms an aggregated channel matrix for interference control, where the coefficients of the inactive links are set to zero. In this article, the CCN identifies the BSs that fall outside the threshold window for a given UT based on the links for which the UT has not reported CSI. It is assumed that the obtained CSI is error free. Protocol aspects of this communication need to be addressed in more detail in a real system implementation. As a result, the aggregated channel matrix is now sparse. Linear techniques such as zero forcing (ZF) can invert the aggregated channel matrix to remove interference, but these techniques fail to invert a sparse aggregated channel matrix and at the same time reduce the backhaul load, such that only the BSs in the active set of a UT receive the precoding weights [9].
The question thus arises, in the PJP framework, can the gains achieved with CSI feedback load reduction translate to an equivalent backhaul load reduction, in the sense that the number of CSI coefficients constituting the feedback load (assuming a single tap channel for simplicity) is the same as that of the precoding weights in the backhaul (Figure 1 illustrates this notion). Particle swarm optimization (PSO) is proposed in this article as a tool to obtain a solution that fits this requirement, since it can find the precoding weights without actually inverting the sparse aggregated channel matrix.
1.1 State of the art techniques
Precoding design for clustered scenarios under JP is a recent problem. In [11], a large network is divided into a number of disjoint clusters of BSs. Linear precoding is carried out within these clusters to suppress intracluster interference as well as intercluster interference. In the case of overlapping clusters, soft interference nulling (SIN) precoding technique is proposed in [12]. For SIN, the complete CSI is available at all BSs and the user data is made available only to the BSs in the coordination cluster. Hence, the BSs can jointly encode the message for transmission. Moreover, in [12], multiple spatial streams are allowed up to the total number of transmit antennas in the coordination clusters. As the exhaustive search for the best clustering combination has a very high complexity, two simple clustering algorithms are proposed in [12]. They are: (a) nearest bases clustering and (b) nearest interferers clustering. The SIN iterative precoder optimization algorithm does not remove the interference completely, but performs better than or equal to any linear interferencefree precoding scheme [[12], Proposition 1]. SIN precoding relaxes the restriction to have zero interference, due to that SIN precoding works even when the number of transmit antennas is less than the total number of receive antennas within a coordination cluster. It should be noted that SIN achieves backhaul reduction in terms of the precoded weights and user data being available at BS where needed, but it does not provide feedback load reduction.
For JP, as long as the aggregated channel matrix at the CCN is well conditioned for inversion, linear ZF beamforming (BF) techniques can be used for interference control. It has been shown in [9] that when using techniques that achieve CSI feedback reduction, such as active set thresholding in PJP, this reduction does not translate to an equivalent backhaul load reduction with the linear ZF BF. When calculating the ZF BF based on the sparse aggregated channel matrix, a link that has been defined as an inactive link may be mapped with a nonzero BF weight for that link. This causes unnecessary backhauling, since the UT has reported that link as inactive and that BS is already outside the active set of that UT. Instead, the BS could use this resource to serve another UT. An intuitive approach could be that the CCN resorts to nulling the BF weights where the links are expected to be inactive. This is a suboptimal solution. In [13], a partial ZF precoding design is proposed based on [14] to remove the interference in a PJP scenario. This solution performs better than the linear ZF BF with a weight nulling assumption, and works even for a sparse aggregated channel matrix at the CCN, but it does not achieve an equivalent backhaul load reduction. On the other hand, there is no linear technique in the literature that can invert the sparse aggregated channel matrix and preserve the zeros in the transposed version of the inverse, when the aggregated channel matrix is not diagonal or blockdiagonal.
To the best of our knowledge, the problem of backhaul load reduction equivalent to feedback load reduction has only been addressed in [9], where two solutions are proposed. One based on scheduling (medium access controlMAC layer approach) and a second one based on a ZF precoding PHY layer approach. The limitations of this approach are discussed in Section 2.2.
1.2 Contributions
The active set thresholding technique in PJP (limited feedback of CSI) is used to achieve the feedback load reduction and these gains need to be preserved with an equivalent backhaul load reduction (limited backhauling of precoding weights). To achieve this, a stochastic optimization algorithm such as PSO is proposed for precoding design. PSO has been shown to obtain the optimal linear precoding vector, aimed to maximize the system capacity in a multiusermultiple input multiple output (MUMIMO) system [15]. The main distinguishing factor of our article compared to [15] is that the PSO is used for designing the precoder under a multicell setting with PJP. PSO has also been proposed as a tool for a scheduling strategy in a MUMIMO system [16]. Recently, a multiobjective PSO has been proposed for accurate initialization of the channel estimates in a MIMOOFDM iterative receiver [17]. Drawing inspiration from [1517], and combining the state of the art PSO implementation with expendable parallel computing power at the CCN, a PSO based precoder should be feasible for the scenario under consideration.
In this article, two objectives are studied using PSO. They are:
(1) Weighted interference minimization: Minimize the interference for the UTs and improve the UT experiencing the minimum signal to interference plus noise ratio (SINR).
(2) Sum rate maximization.
In addition, to fairly compare the linear ZFbased precoder and the proposed PSObased precoder, the use of perturbation theory and Gershgorin's discs is introduced. These discs can be used to obtain a quick graphical snapshot of the intracluster interference remaining in the system. The sum rate bounds under a constrained backhaul and imperfect channel knowledge are important [18] and it is part of our future work.
The article is organized as follows. The system model and the limitations in the state
of the art linear solutions are discussed in Section 2. The PSO as a tool for precoder
design is presented in Section 3. In this section, the objective function, the termination
criteria, the convergence of PSO and the complexity in terms of the big
Notation: The boldface uppercase letters, boldface lowercase letters and italics such as X, x and x denote matrices, vectors and scalars, respectively. The ℂ^{m×n }is a complex valued matrix of size m×n. The (·)^{H }is the conjugate transpose of a matrix.  · _{F }is the Frobenius Norm, diag(A) and OffDiag (A) are the diagonal and offdiagonal elements of the matrix A. Block diagonalizing the matrices A and B is denoted as blockdiag(A, B). The ith row and the jth column of a matrix A is represented as A(i, j). To access all the elements of the ith row of a matrix A is A(i, :) and for the jth column is A(:, j). vec(A) is the vector of stacked columns of matrix A. ℜ{A(i, j)} and ℑ{A(i, j)} are the real part and the imaginary parts of A(i, j), respectively. H and
2 System model
Consider the downlink of a static cluster of K BSs with N_{T }antennas serving M single antenna UTs [7]. In this model, the intracluster interference caused due to the transmission to the UTs located at the cluster center is considered, as shown in Figure 2. For simplicity, the intercluster interference is assumed to be negligible. Assuming CJP between BSs in the cluster, the discrete time signal received at M UTs, is y ∈ ℂ^{M×1 }is
Figure 2. The cluster layout. The hexagon in the middle denotes the cluster area under consideration where the UTs are located at the cluster center.
The aggregated channel matrix available in the CCN is
In the case of ZF BF, the constraint K · N_{T }≥ M needs to be satisfied to maintain orthogonality between UTs [19]. The matrix W is then obtained by taking the MoorePenrose pseudoinverse of H as
Each BS is constrained to a maximum transmit power, P_{max}. The suboptimal power allocation based on [20] is performed for ZF under perBS power constraints, where at least one of the BSs is transmitting at maximum power, and it is defined as
where
and the sum rate per cell in terms of bits per second per Hertz per cell (bps/Hz/cell) is given as
2.1 Linear beamforming
As stated in Section 1, the link for which the CSI is reported to the CCN is marked as an active link and the unreported CSI is marked as an inactive link. These active and inactive links can be represented with a binary matrix of size M × K. The (m, k)th element in this matrix corresponds to the link between the mth UT and the kth BS. An active link is represented with '1' and an inactive link is represented with '0'.
In Equation (2), the linear ZF BF completely removes the interference by inverting
the aggregated channel matrix H. With small active set thresholds, there are few active links, forming a sparse aggregated
channel matrix
In this article, the following ZF scenarios are considered, where the ZF is performed using the pseudoinverse as in Equation (2) on the aggregated channel matrix at the CCN. The main focus is on the ZF with limited feedback (LFB) and limited backhauling (LBH), where the gains of feedback load reduction need to be preserved in the backhaul load reduction. This is denoted as ZF:LFB + LBH. The LFB is achieved based on the active set thresholding technique. The LBH with ZF is achieved with an intuitive approach of nulling of the BF coefficients based on the inactive links in the binary matrix. When the UT is allowed to feed back all the CSI (full feedback, FFB) and allowing full backhauling (FBH), it is represented as ZF:FFB + FBH. This scenario is considered to show the upper bound of the ZF technique, as in the case of CJP. The scenario ZF with FFB and LBH is considered to have a similar configuration as that of the SIN precoding technique [12]. This is denoted as ZF:FFB + LBH. Finally, the scenario ZF with LFB and FBH is considered, similar to that considered in [9], where the ZF is allowed to have the precoded weights at BSs where it is not desired and allowing FBH. This is represented as ZF:LFB + FBH. It should be noted that this approach does achieve some backhaul reduction, but not necessarily equivalent to the feedback load.
2.2 Limitations of the state of the art
The following subsections capture the limitations with the state of the art solutions.
2.2.1 The invertibility of the aggregated channel matrix
To maintain the orthogonality between the UTs, as highlighted earlier, the condition K · N_{T }≥ M needs to be satisfied. Due to this, the number of columns of the matrix H is always greater or equal to the number of rows, and the only way to invert the aggregated channel matrix is by using the right inverse as shown in Equation (2). The invertibility of the linear ZF BF is limited by the ability to invert (HH^{H})^{1 }or in other words, the rank of HH^{H }should be equal to the number of UTs, whose channels are linearly independent.
In the PJP framework, the active set threshold can be increased such that the UTs can feed back the CSI of any additional BSs that fall within this window, thereby increasing the chances of inverting the aggregated channel matrix as proposed in [13]. The worst case could be that the UTs would need to feed back the complete CSI from all the BSs like in the case of CJP. The CCN can now invert the aggregated channel matrix to obtain the BF weights, but at the expense of increasing the feedback load.
2.2.2 Required nulls in beamformer
As stated before, to the best of our knowledge, to overcome the invertibility of the aggregated channel matrix and the required nulls in the BF, the MAC layer and the PHY layer approaches are proposed in [9]. These approaches are analyzed for the remaining part of this section.
In the scheduling MAC layer approach, BS subgroups are formed such that the transmission to the UTs in each time slot is disjoint, where each BS is transmitting in only one subgroup. These disjoint sets give rise to a sparse aggregated channel matrix at the CCN, which presents a blockdiagonal form. Note that the scheduling approach can be mapped to a disjoint clustering solution. This approach solves the problem of equivalent backhaul load reduction, as the inverse of a block diagonal matrix is block diagonal itself, thereby retaining the zeros or nulls in the BF weights where needed. In a given time slot, if the collocated UTs prefer services from the same set of BSs, then the MAC layer approach can only serve the UTs in a time division multiplexing fashion, as disjoint BS sets need to be selected for transmission. To guarantee fairness, such UTs will have to wait for a long time to be served.
The proposed PHY layer ZF precoding solution does not require any specific constraints on scheduling [9], and it allows the formation of overlapping clusters. The interference is reduced by formulating a constrained least squares optimization problem, whose solution is showed to be a pseudoinverse [9]. The closedform solution to find the nonzero BF weights as obtained in [[9], Eq. (29)] is
where
To illustrate the limitations in the PHY layer approach in [9], consider single antenna BSs serving single antenna UTs with the aggregated channel
matrix at the CCN as shown in Table 1. The first step is to build a block diagonal matrix as
Table 1. An example of a sparse aggregated channel matrix giving rise to an overdetermined system when PHY layer precoding is applied
The PHY layer solution does not comment on the fact that there is no PHY layer solution
without scheduling the UTs, as the invertible part of the pseudoinverse in
Due to the limitations in this closedform solution in Equation (6), a proper comparison of the proposed PSO with this PHY layer solution is generally not possible. Hence, here the PHY layer solution of [9] is not considered in the simulations. However, an interested reader can refer to [21] where the comparison is performed when [9] is feasible. In the subsequent section, PSO is presented as a tool for precoder design for backhaul load reduction equivalent to the feedback load reduction in the PJP framework.
3 PSO for precoding in the PJP framework
The PSO was inspired from the movement of a swarm, such as a shoal of fish, a flock of birds, etc, to find food or to escape from enemies, by splitting up into groups. There is no apparent leader of the swarm other than the social interactions between the bird like objects (or boids). The coherent movement of these boids is modeled based on their social interactions with their neighbors. The algorithm simulating these social aspects was simplified in [22] and it was found to perform optimization. In this article, a basic PSO algorithm [23] with inertia weight and velocity restriction is implemented and it is capable of finding a stable solution based on a given objective function.
Classical optimization methods are especially preferred when the optimization problem is known to be convex but this is not the case here. Numerical methods such as Newton's method are not feasible as the objective function is nondifferentiable. Other classical techniques could fail but PSO would always find an equilibrium/stable solution. PSO was chosen over other evolutionary algorithms, as it requires very few parameters to configure, it is easier to understand with computationally lesser bookkeeping and it fits well for reducing the backhaul load. In [23], PSO is viewed as a paradigm within the field of swarm intelligence and the performance measures of basic PSO are highlighted. This reference also provides detailed differences between PSO and other evolutionary algorithms.
In this article, each bird in a swarm carries the real and imaginary parts of the
nonzero elements of the BF matrix, i.e., the ith member of the swarm is the ith particle that carries all the (n = 2 · K · N_{T }· M) BF coefficients. The '2' is due to PSO treating the real and the imaginary part
of the complex BF coefficients as another dimension to the search space. Hence, the
particle having the best n values needs to be found for a given objective function. For example, an infinite
threshold would yield n = 2 · K · N_{T }· M nonzero CSI coefficients in the aggregated channel matrix of size [M × K · N_{T}]. With an active set threshold of 0 dB then only the best link (or reference link)
would be fed back by each UT yielding n = 2 · 1 · N_{T }· M. The real and the imaginary parts of the nonzero BF matrix,
A given objective function is evaluated for every particle i carrying the BF coefficients, and it is demapped to form the BF matrix as
The variables p and q are random numbers drawn from a uniform distribution in the interval [0, 1]. The terms involving c_{1 }and c_{2 }are called the cognitive component and the social component, respectively. The cognitive component tells how much a given particle should rely on itself or believe in its previous memory, while the social component tells how much a given particle should rely on its neighbors. The cognitive and social constant factors, c_{1 }and c_{2}, are equal to 2, as highlighted in [22]. An inertia weight, w, is used to bias the current velocity based on its previous value, such that when the inertia weight is initially being greater than 1 the particles are biased to explore the search space. When the inertia weight decays to a value less than 1, the cognitive and social components are given more attention [24]. The decaying of the inertia weight is governed by a constant decay factor β, such that w ← w · β.
The pseudocode of PSO described above is summarized in Algorithm 2.
Algorithm 2: Pseudocode for obtaining the BF via PSO. Steps 3 to 5 are only mentioned for illustration and can be avoided prior to initialization
1: Initialization:
2: Determine the number of nonzero coefficients n needed in the BF matrix,
3: Map the BF to the particle:
4:
5:
6: Stochastically initialize particles with BF coefficients:
7:
8: x_{min }= x_{max}
9: Position: X(i, j) = x_{min }+ r · (x_{max } x_{min})
10: Velocity:
11: while Termination Criterion do
12: for the ith particle in the swarm do
13: Demap the variables in a particle to form the BF matrix
14:
15: Evaluate the objective function f(X(i, :))
16: Store:
17: if f(X(i,:)) < f^{pb}(X(i,:)) then
18: Particles' Best: X^{pb}(i,:)←X(i,:)
19: end if
20: if f(X(i,:)) < f^{sb}(X(i,:)) then
21: Swarm's Best: x^{sb }← X(i,: )
22:
23: end if
24: end for
25: for Each particle in the swarm with BF coefficients do
26: Update:
27: Velocity:
28: Restrict velocity: V(i, j) < v_{max}
29: Position: X(i, j) ← X(i, j) + V(i, j) · Δt
30: end for
31: w ← w · β
32: end while
33: return BF Weight Matrix,
3.1 Objective function
The particle with the best BF coefficients is demapped to obtain the BF matrix,
In this article, two different objective functions are considered for the PSO to optimize.
3.1.1 Weighted interference minimization
Based on our experience, choosing a single direct objective function of minimizing only the interference skews the PSO algorithm to prefer only the good SINR UTs and to leave out the weak SINR UTs. This gives rise to power savings at the BS, thereby lowering the sum rate of the UTs. One can choose to maximize the weak SINR UTs but then the total interference is not taken into account. Hence, the objective function should not only minimize the interference but also improve the SINR of the weakest UT (minSINRuser). We call this objective function weighted interference minimization, where the interference is minimized with the weight of the SINR of the weakest UT in each iteration. Note that the weakest SINR UT can change in every iteration. Thus, a multiobjective function evaluated for the ith particle in every iteration is defined as
The goal of every particle is to minimize this multiobjective function iteratively. Finally, the swarm's best particle will contain the best BF that has managed to minimize Equation (9).
3.1.2 Sum rate maximization
The PSO presented in Algorithm 2 involves minimization of the objective function. Hence, to maximize the sum rate, the objective function is written as f(X(i,:)): = R_{tot}. This means that prior to evaluating the objective function, the sum rate per cell as in Equation (5) needs to be calculated for every iteration.
3.2 Termination criteria
In [23], various stopping conditions are discussed. A few of them are listed here for completeness. The algorithms can be terminated when at least one of the following conditions is triggered:
(1) Maximum number of iterations has been exceeded.
(2) A solution fulfilling a target value is found.
(3) No improvement is observed over a number of iterations.
(4) Normalized swarm radius is close to zero.
In practice, any one of the above mentioned criteria can be used for termination. In this article, the third criterion is used for termination.
3.3 Convergence
With a basic PSO, the notion of convergence means that the swarm has moved towards an equilibrium state [23]. The Lemma 14.2 in [23] shows that the basic PSO does not satisfy the convergence condition for global search. In our article, a basic PSO with basic variations such as velocity restriction and inertia weight has been used. Proving the optimality conditions of the PSO is not easy, but what can be said is that a stable solution can be achieved. Hence, suitable variations of the PSO algorithm need to be considered in future work, such as Random Particle PSO or Multistart PSO, since they satisfy the convergence condition for global search and can be considered for global optimization [23].
3.4 Computation complexity analysis
The Big
where c refers to the number of iterations in the while loop, which depends on the convergence of the algorithm. In this article, it was observed that the algorithm converges within 100 iterations with no further improvements for the case of LFB and LBH.
• Block1: The initialization of particles carrying the BF coefficients from steps 6
to 10, has a computational complexity of
• Block2: From steps 12 to 24, the complexity is
Objective function: Weighted interference minimization
 The complexity of HW is
Objective function: Sum rate maximization
 The calculation of SINR and consequently the sum rate per cell yields
Therefore, considering the worst case objective function, the complexity of Block2
is
• Block3: From steps 25 to 30, the time and space complexity can only grow with the
number of BF coefficients. Hence, the computational complexity is
Finally, the overall complexity of the PSO is
In this article, we consider M single antenna UTs and K BSs with N_{T }antennas each. As shown in Algorithm 2, the number of BF coefficients carried by a
particle is n = 2·M · K · N_{T}. Therefore, the complexity of the PSO is
4 Analysis of interference using Gershgorin's discs
In the case of ZF BF, the feasibility of the solution is determined by the ability to invert (HH^{H})^{1}. In [25,26], it is shown that any approach to improve the channel inversion must aim to reduce the effect of the largest eigen value. Another metric that has been used in the framework of ZF in a singlecell setup is the Frobenius norm of the channel H, since it is proportionally related to the link level performance as shown in [25]. Their proposed network coordinated BF algorithm combines both metrics such that the mean of the largest eigen value of (HH^{H})^{1 }should be small and the mean of the Frobenius norm of H should be large, so that SINR of the UT is large and the bit error rate is improved, respectively.
In the case of PSO, analyzing the properties of the obtained precoder via (HH^{H})^{1 }is not meaningful. To evaluate the performance of a PSObased precoder,
In the framework of perturbation theory [27], these offdiagonal elements can be seen as a perturbation over the diagonal elements
of
Applying Gershgorin's circle theorem, the matrix
where the right hand side of the inequality is the radius of the ith disc.
5 Simulation results
Consider the cluster layout in Figure 2, where K = 3 BSs with N_{T }= 3 antennas each, are serving M = 6 single antenna UTs. The UTs are uniformly dropped at the cluster center, along
an ellipse with semimajor and semiminor axis of length
where G is the gain of the antenna at the BS and C is the correlation matrix of size N_{T }×N_{T}. The simulation parameters are summarized in Table 2. Note that in order to compare PSO and ZF, only the cases providing invertibility of the aggregated channel matrix are considered. As the focus is at the cluster center along an ellipse as defined earlier, the ZF approach fails to invert only 0.22% of the time. Nevertheless, the probability for failure to invert increases as the UTs move closer to a BS as shown in [13] with a realistic WINNER II channel model (scenario B1, urban microcell, nonline of sight). But, PSO would still be able to find a solution when ZF fails. The parameters governing PSO are summarized in Table 3.
Various configurations of the ZF and PSO precoders are considered for comparison. They are summarized in Table 4. A simple power allocation is performed as per Equation (3). In case of ZF, the power constraint is always applied after the pseudoinverse, and for the scenarios involving PSO, the power constraint is applied in every iteration or after convergence, (refer to Section 3.1 for a more detailed explanation). To obtain a better equilibrium, the solution obtained from PSO with FFB and LBH is fed to one of the particles in the PSO with FFB and FBH during the stochastic initialization stage, for PwrAdj and NoPwrAdj cases, respectively. The scenario PSO:FFB + LBH is considered to simulate the same environment as that of SIN precoding and to compare the same with the corresponding ZF scenario. For the cases of LFB or LBH, an active set threshold of 10 dB was pessimistically considered. This value was decided based on a recent study [29], in which it was found that it is difficult to jointly estimate the channels for a UT with an active set threshold greater than 15 dB at the celledge. Also in [[29], Figure 4.19], the 10 dB threshold defines a cooperation area that is more focused on the cluster center, while a 15 dB threshold considers the cluster center and celledges as well.
Table 4. Various precoding configurations
Figure 3 shows the rate at which the average number of CSI coefficients are fed back (LFB) and the rate at which the average number of precoding weights are backhauled (LBH) for various active set thresholds. In particular, for an active set threshold of 10 dB, the feedback rate, f_{r }due to the CSI coefficients of all the M = 6 UTs is 587.3 kbps, which is calculated as, assuming that every complex coefficient takes 16 bits for quantization and a scheduling interval of 1 ms (LTE). Likewise, following a similar approach, the backhauling rate, b_{r}, due to precoding weights with PSO is 587.3 kbps. Hence, the feedback load reduction is equivalent to the backhaul load reduction. In case of a ZF approach, precoders that show zeros for nulls in the beamformer have a higher rate of 859.4 kbps, thereby increasing the backhaul load. Relaxing the null constraint for the ZF approach by treating a threshold of less than 20 dB as a null in the BF, still yields a higher backhaul rate of 759.3 kbps. The reduction in the backhaul load in terms of precoding weights also translates to the reduction in the user data distribution in the backhaul, as the user data can be selectively routed to a given BS based on the nonzero precoding weights. It should be reiterated here that the ZF approach could have nulled the weights when the BSs required them and thereby reducing the sum rate. This is not accounted in this figure. Hence, for a given active set threshold, PSO achieves the exact bound for the backhaul load being equivalent to the feedback load. Note that in a wideband system, the CSI would be estimated and fed back based on the pilot positions. The estimated CSI would be interpolated for a group of subcarriers, as they are smaller than the coherence bandwidth of the channel and thus this group of subcarriers would experience flat fading. The precoding weights obtained at the CCN would be based on the estimated CSI and could be applied over this group of subcarriers. Hence, every CSI coefficient fed back by the UT still would map to a corresponding precoding weight. However, with a ZF approach, the user data being routed at the CCN, as shown in Figure 1, would cause a substantial and unnecessary increase in the backhaul. This could be avoided with the proposed PSO. It should be noted that in Figure 3 the backhaul rate, b_{r}, (due to precoding weights) does not include the user data rate. The user data rate would be several orders of magnitude larger than the feedback rate, and could be proportionally reduced with selective routing as described above.
Figure 3. Rate of the average number of CSI feedback coefficients and the average number of precoding weights in the backhaul. These need to be transmitted for various active set thresholds for all the 6 UTs is shown in the above figure. User data is routed at the CCN based on the nonzero precoding weights which also translates to the reduction in the user data paths in the backhaul.
5.1 Objective function: weighted interference minimization
Particle swarm optimization with LFB and LBH performs better than ZF under comparable configurations. The cumulative distribution function (CDF) of the sum rate per cell is shown in Figure 4. The PSO with LFB and LBH performs better than the ZF with LFB and LBH by 66.53% on average. PSO with LFB and LBH also performs better than ZF with FFB and LBH by 43.73% on average. The ZF with FFB and FBH performs better than PSO with FFB and FBH with PwrAdj in every iteration, but PSO with NoPwrAdj shows the best average sum rate compared to the other scenarios considered. This is primarily due to the fact that PSO with NoPwrAdj effectively uses the BS peak power constraint. The ZF with LFB and FBH (without backhaul load reduction) performs better than the ZF with FFB and LBH (with backhaul load reduction). This is similar to the results observed in [9], since the signals received by the UT from BSs outside the active set are seen as desired signals and thus help the UTs to accumulate more energy, but it leads to unnecessary backhauling. Due to this minor gain and the undesired additional backhauling, this scenario, ZF with LFB and FBH, is not considered in the following plots.
Figure 4. CDF of the sum rate. PSO with objective function: weighted interference minimization. The value in the legend denotes the mean value μ in a given CDF.
Alternately, PSO with the objective of maximizing the minimum SINR of the UT was simulated. In the case of LFB and LBH, a 2.1% relative increase in the average sum rate per cell was observed when compared to weighted interference minimization but at the cost of 7.7% relative increase in BSs power consumption and 45% relative increase in interference. As expected interference is greatly affected, hence, weighted interference minimization is preferred.
PSO utilizes the available transmit power constraint of P_{max }per BS more effectively, and at the same time, it improves the weakest SINR UT. The CDF of the SINR of any UT for various precoding algorithms in any channel realization is shown in Figure 5 with reasonable improvement in the SINR of the weakest UT (the lower part of the CDF). There is an improvement of 2.97% in the average SINR of PSO compared to ZF, under the same conditions of LFB and LBH. We define the SINR difference as [ΔSINR]_{dB }= [SINR_{m}]_{dB}[SINR_{m'}]_{dB}, where UT m, m ≠ m', experiences the best SINR while UT m' experiences the best SINR while UT m' experience the worst SINR in a given channel realization. The CDF of this SINR difference is shown in Figure 6. With this objective function, where the worst SINR UT is taken care explicitly, the PSO has a much lower variance compared to the ZF approach. As expected, the ZF approach with FFB and FBH has all the UTs with equal SINR, hence the difference is zero. It is interesting to note that PSO with FFB and LBH with PwrAdj and NoPwrAdj are nearly 15 dB apart in the SINR difference between the best and the worst UT. This is because in the case of NoPwrAdj, applying Equation (3) disfigures the BF weights obtained from the PSO after convergence.
Figure 5. CDF of the SINR of any UT for various precoders in any channel realization. PSO with objective function: weighted interference minimization.
Figure 6. CDF of the SINR difference, [ΔSINR]_{dB }= [SINR_{m}]_{dB } [SINR_{m'}]_{dB }between the best and the worst UTs in a given channel realization. PSO with objective function: weighted interference minimization.
The CDF of the average transmitted power at the BS is shown in Figure 7. The maximum BS transmit power is 0.0603 W, corresponding to a celledge SNR of 15 dB. In fact, the PSO keeps the BS power amplifiers on at a higher power, most of the time, which is a desired property for amplifier efficiency. The power consumption is reduced beforehand with the limited feedback and limited backhauling. PSO uses BS transmit power more effectively when there is a null constraint on the BF (LBH).
Figure 7. CDFs of the average BS transmit power, max = 0.0603 W(17.8 dBm), with celledge SNR of 15 dB. PSO with objective function: weighted interference minimization. The value in the legend denotes the mean value μ in a given CDF.
It can be seen in Figure 8 that the PSO allows interference compared to the ZF scenarios. The sum rate is improved even when some interference is remaining in the system. This is similar to the SIN technique in [12], but the SIN technique requires full CSI at the CCN. It is also observed that ZF with FFB and FBH completely removes the interference, but this scheme does not use the available BS transmit power effectively as shown in Figure 7. In the case of PSO, as observed in Figure 8, if LBH is preferred then LFB should also be preferred, as PSO with LFB and FFB under PwrAdj shows the same residual interference in the system.
Figure 8. The cropped CDF of the actual interference due to the various precoders. ZF with FFB and FBH completely removes the interference, hence the brown curve is on the yaxis. PSO with objective function: weighted interference minimization.
The convergence of the PSO algorithm when evaluating the objective function of weighted interference minimization for four randomly chosen aggregated channel matrix realizations is shown in Figure 9 for various precoder configurations. This objective function converges in less than 100 iterations when PSO is applied with LFB and LBH. It can be observed that the number of iterations to find a stable solution is comparatively fast when the number of BF coefficients is small. The convergence pattern of FBH is not examined here. This is because the PSO with FFB and FBH has one of the particles fed with the corresponding solution of PSO with FFB and LBH, during the stochastic initialization phase. This was done to show that the PSO implemented in this article only finds a stable equilibrium solution and not the global optimum, as increasing the dimensionality of the problem makes it harder for the PSO, i.e., PSO with unconstrained backhauling, FBH, yielded a slightly poor solution compared to the PSO with constrained backhauling (LBH), when the objective function was sum rate maximization. To unify our PSO proposal, both objective functions followed the same procedure. This is one of the main reasons why the convergence curves of PSO with FBH remain relatively flat.
Figure 9. Convergence of PSO for various configurations. PSO with objective function: weighted interference minimization.
Based on the analysis in Section 3.4 and on the prior experience, the number of BF coefficients carried by a particle decreases with the sparsity of the aggregated channel matrix. With LBH, the PSO converges faster than the case when there is FBH. Reference to Figure 9 could be unfair, due to the reason cited earlier that the solution of PSO with LBH is fed to one of the particles in the case of FBH. If this is not performed, then the faster convergence of the PSO is observed (not shown here).
5.2 Objective function: sum rate maximization
When the objective of the PSO is to maximize the sum rate, the maximization is indirectly related to the particles in the PSO carrying the BF weights via the logarithm. This objective is very sensitive to the power adjustment performed after the PSO algorithm has converged. The CDF of the SINR of any UT is shown in Figure 10. It can be observed that the ambition of improving only the sum rate of the system penalizes the weak SINR UTs.
Figure 10. CDF of the SINR of any UT for various precoders in any channel realization. PSO with objective function: sum rate maximization.
5.3 Gershgorin's circles
In the complex plane, Figure 11 shows the circumference of the Gershgorin's discs for various precoders, with the
objective of the PSO being weighted interference minimization. This figure is plotted
for a given reference SINR value of the PSO with FFB and FBH. The receiver noise is
assumed to be uniform across all the UTs, and SINR is plotted instead of SIR. The
green '+' refers to the elements in the diagonal of the matrix
Figure 11. Gershgorin discs of
It is interesting to note that for the PSO with LFB and LBH, the actual eigen values map closely to the estimated Gershgorin's eigen values, unlike the ZF with LFB and LBH. From an interference point of view, having concentric circles helps containing the interference within the largest circle. ZF with LFB and LBH shows this attribute.
6 Conclusions
In this work, a particle swarm stochastic optimization algorithm has been proposed in a partial JP framework to design the precoding weights for efficient backhauling, achieving a backhaul reduction proportional to the reduction in the CSI feedback. In this context, two objective functions have been considered, a weighted interference minimization and a sum rate maximization. In the proposed weighted interference minimization, the SINR of the weakest UT is iteratively improved, in addition to the interference minimization. With the limited feedback and limited backhaul constraints, and the weighted interference minimization as the objective function, the average sum rate per cell of the UTs is improved by 66.53% with respect to a ZF precoder. The particle swarm based precoder allows some multiuser interference to remain in the system, still improving the sum rate, and it uses the BS transmit power more effectively.
With recent developments in swarm intelligence, the complexity and the feasibility can be improved to achieve a faster and a more robust particle swarm algorithm. There is potential for improving the PSO algorithm with capabilities to perform global search, such as random PSO, which should improve the already promising results presented in this article.
Abbreviations
BF: beamformer (ing); bps: bits per second; BS(s): base station(s); CCN: central coordination node; CDF: cumulative distribution function; CJP: centralized joint processing; CoMP: Coordinate MultiPoint (transmission); CSI: channel state information; FBH: Full BackHauling; FFB: Full FeedBack; JP: joint processing; LBH: Limited BackHauling; LFB: Limited FeedBack; MAC: medium access control; MUMIMO: MultiUser Multiple Input Multiple Output; NoPwrAdj: no power adjustment; PJP: partial joint processing; PHY: physical; PwrAdj: power adjustment; PSO: particle swarm optimization; SIR: signal to interference ratio; SINR: signal to interference plus noise ratio; SNR: signal to noise ratio; SIN: soft interference nulling; UT(s): user terminal(s); ZF: zero forcing.
Competing interests
The authors declare that they have no competing interests.
Acknowledgements
The authors would like to thank the reviewers for their critical comments that greatly improved the article. This study has been partly supported by the Swedish Research Council, within the project 62120094555 Dynamic Multipoint Wireless Transmission. This study is also been supported by The Swedish Agency for Innovation Systems (VINNOVA) and by the EU FP7 project INFSOICT247223 ARTIST4G. C. Botella's work has been supported by the Spanish MEC Grants CONSOLIDERINGENIO 2010 CSD200800010 "COMONSENS" and COSIMA TEC201019545C0401. The authors would also like to acknowledge the members of the VR meetings, in particular Jingya Li and Agisilaos Papadogiannis for the valuable discussions. Thanks to Bhavishya Goel for the fruitful discussions on complexity. The computations were performed on C^{3}SE computing resources.
References

3GPP TR 36.814900, 3rd Generation Partnership Project; Technical specification group radio access network; Further advancements for EUTRA physical layer aspects (Release 9)

MK Karakayali, GJ Foschini, RA Valenzuela, Network coordination for spectrally efficient communications in cellular systems. IEEE Wirel Commun 13(4), 56–61 (2006). Publisher Full Text

D Gesbert, S Hanly, H Huang, SS Shitz, O Simeone, Wei Yu, Multicell MIMO cooperative networks: a new look at interference. IEEE J Sel Areas Commun 28(9), 1380–1408 (2010)

A Papadogiannis, E Hardouin, D Gesbert, Decentralising multicell cooperative processing: a novel robust framework. EURASIP J Wirel Commun Netw, 1–10 (2009) Article ID 890685

P Marsch, G Fettweis, A framework for optimizing the downlink of distributed antenna systems under a constrained backhaul. Proc European Wireless Conference, Paris, France, 1–5 (2007)

P Marsch, G Fettweis, On multicell cooperative transmission in backhaulconstrained cellular systems. Ann Telecommun 63(5), 253–269 (2008). Publisher Full Text

C Botella, T Svensson, X Xu, H Zhang, On the performance of joint processing schemes over the cluster area. Proc IEEE Vehicular Technology Conference, Taipei, Taiwan, 1–5 (2010)

A Papadogiannis, H Bang, D Gesbert, E Hardouin, Downlink overhead reduction for multicell cooperative processing enabled wireless networks. IEEE Personal, Indoor and Mobile Radio Communications, Cannes, France, 1–5 (2008)

A Papadogiannis, HJ Bang, D Gesbert, E Hardouin, Efficient selective feedback design for multicell cooperative networks. IEEE Trans Veh Technol 60(1), 196–205 (2011)

F Boccardi, H Huang, A Alexiou, Network MIMO with reduced backhaul requirements by MAC coordination. Proc IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, 1125–1129 (2008)

J Zhang, R Chen, JG Andrews, A Ghosh, RW Heath, Networked MIMO with clustered linear precoding. IEEE Trans Wirel Commun 8(4), 1910–1921 (2009)

CTK Ng, H Huang, Linear precoding in cooperative MIMO cellular networks with limited coordination clusters. IEEE J Sel Areas Commun 28(9), 1446–1454 (2010)

TR Lakshmana, C Botella, T Svensson, X Xu, J Li, X Chen, Partial joint processing for frequency selective channels. Proc IEEE Vehicular Technology Conference, Ottawa, Canada, 1–5 (2010)

X Wei, T Weber, A Kuhne, A Klein, Joint transmission with imperfect partial channel state information. Proc IEEE Vehicular Technology Conference, Barcelona, Spain, 1–5 (2009)

S Fang, G Wu, SQ Li, Optimal multiuser MIMO linear precoding with LMMSE receiver. EURASIP J Wirel Commun Netw, 1–10 (2009) Article ID 197682

Y Hei, X Li, K Yi, H Yang, Novel scheduling strategy for downlink multiuser MIMO system: particle swarm optimization. Sci China F 52(12), 2279–2289 (2009). Publisher Full Text

C Knievel, PA Hoeher, A Tyrrell, G Auer, Particle swarm enhanced graphbased channel estimation for MIMOOFDM. Proc IEEE Vehicular Technology Conference, Budapest, Hungary, 1–5 (2011)

P Marsch, G Fettweis, On downlink network MIMO under a constrained backhaul and imperfect channel knowledge. Global Telecommunications Conference, California, USA, 1–6 (2009)

Q Spencer, A Swindlehurst, M Haardt, Zeroforcing methods for downlink spatial multiplexing in multiuser MIMO channels. IEEE Trans Signal Process 52(2), 461–471 (2004). Publisher Full Text

H Zhang, H Dai, Cochannel interference mitigation and cooperative processing in downlink multicell multiuser MIMO networks. EURASIP J Wirel Commun Netw 2, 222–235 (2004)

TR Lakshmana, C Botella, T Svensson, Partial Joint Processing with Efficient Backhauling in Coordinated MultiPoint Networks. Proc IEEE Vehicular Technology Conference, Yokohama, Japan, 1–5 (2012)

J Kennedy, RC Eberhart, Particle swarm optimization. Proc IEEE International Conference on Neural Networks, Perth, Australia, 1942–1948 (1995)

AP Engelbrecht, Fundamentals of Computational Swarm Intelligence John Wiley, 171–172

YH Shi, RC Eberhart, Parameter selection in particle swarm optimization. The 7th Annual Conference on Evolutionary Programming, San Diego, USA, 591–601 (1998)

CB Chae, SH Kim, RW Heath, Network coordinated beamforming for cellboundary users: linear and nonlinear approaches. IEEE J Sel Top Signal Process 3(6), 1094–1105 (2009)

CB Peel, BM Hochwald, AL Swindlehurst, A vectorperturbation technique for nearcapacity multiantenna multiuser communicationpart I: channel inversion and regularization. IEEE Trans Commun 53(1), 195–202 (2005). Publisher Full Text

RA Horn, CR Johnson, Matrix Analysis (Cambridge University Press, Cambridge, 1985), pp. 344–347

3GPP TR 36.942a20, 3rd Generation Partnership Project; Technical specification group radio access network; Evolved universal terrestrial radio access; Radio frequency system scenarios (Release 10)

ARTIST4G D1.2, Innovative advanced signal processing algorithms for interference avoidance, ARTIST4G technical deliverable, 84 [https://ictartist4g.eu/projet/workpackages/wp1/documents/d1.2/d1.2.pdf] webcite. Accessed 17 June 2010