Abstract
In this article, we address the resource allocation problem for the downlink of a large network multiple input multiple output orthogonal frequency division multiplexing system with 3sector base stations. The system is statically divided into a number of disjoint clusters of sectors. A twostep resource allocation scheme is proposed involving the intercluster and the intracluster levels. As a first step or intercluster level, two cooperative frequency reuse approaches are designed to mitigate the intercluster interference. A user partition method is proposed to divide the users of each cluster into clusteredge and clustercenter users. To balance the celledge and the cellaverage performance, a fairness jug function is introduced to determine the frequency partition of the cooperative frequency reuse approaches. Then, as a second step or intracluster level, a utilitybased joint scheduling and power allocation algorithm is proposed for each cluster, to maximize the sum utility of all users in the cluster under persector power constraints. Zeroforcing joint transmission is used across multiple sectors within the same cluster. Simulation results show that the proposed scheme can efficiently reduce the intercluster interference and provide considerable performance improvement in terms of both the celledge and cellaverage user data rate. The proposed twostep resource allocation scheme can be implemented independently in each cluster without intercluster information exchange, which is an attractive property for practical systems, since it reduces both the network signaling overhead and the computational complexity.
Keywords:
OFDMA; network MIMO; frequency reuse; resource allocation; interference coordination1 Introduction
Driven by the demands to support data applications at higher throughput and spectral efficiency, orthogonal frequency division multiplexing (OFDM) based multiple access is being considered as a promising technique for the future wireless networks. OFDMA has been adopted as the downlink access technology of 3rd generation partnership project (3GPP) long term evolution (LTE) and LTEadvanced standards [1]. Based on the OFDM technique, OFDMA inherits the immunity to intracell interference. However, the intercell interference is still a major issue. In fact, a frequency reuse factor being equal to one causes serious intercell interference to users in the celledge areas, leading to poor celledge performance. Viable intercell interference mitigation approaches are reviewed in [2], including the use of power control, fractional frequency reuse, opportunistic spectrum access, intra and intercell interference cancellation, and multiple input multiple output (MIMO) techniques.
Recently, coordinated multipoint transmission/reception (CoMP) has been proposed in 3GPP LTEAdvanced as a key technique to increase the system spectrum efficiency as well as the celledge performance [1]. In the case of CoMP joint transmission, both data and channel state information (CSI) of the users in CoMP mode can be shared by coordinated multiple cells, which can act as a single and distributed antenna array. Data to a user can be simultaneously transmitted from the coordinated cells to improve the received signal quality. Hence, the intercell interference is reduced by exploiting the signals transmitted from other cells to assist the transmission rather than treating them as interference. Notice that this technique is also referred as network coordination or network MIMO [3].
In a global network MIMO system, without any feedback, backhaul and synchronization constraints, the intercell interference can be completely eliminated. However, from a practical point of view, a major setback of global coordination is the large amount of feedback needed from the users and the large signaling overhead required for the intercell information exchange. An interesting tradeoff between the system performance and the required amount of CSI feedback and backhaul exchange has been pointed out [49]. This tradeoff is one of the reasons for restricting the use of network MIMO techniques to a limited number of cells or areas of the system. The system is typically divided into clusters of cells, and the joint transmission is implemented within the cells included in each cluster. The cluster formation can be static [1014], or dynamic [1517]. The static cluster formation specifies a predefined set of clusters of cells which do not change in time, whereas the dynamic clustering approaches form the clusters based on the varying channel conditions that users experience to different cells. Note that a coordinated cluster also causes intercluster interference to the users in the neighboring clusters, especially to users in the clusteredge area. Therefore, the design of efficient intercluster interference coordination strategies and radio resource management algorithms is of great interest in the field of clustered network MIMO systems.
Previous studies about resource allocation in network MIMO OFDMA systems have mainly focused on the global network MIMO case [1820], or on the single cluster case without considering intercluster interference [2123]. Recently, research has shifted towards the limited coordination case with different cluster formation models, that is, resource allocation with dynamic clustering [24,25] or resource allocation with static clustering [14]. In [24], an intercluster interference canceler performing linear processing on the downlink transmission signals is proposed for multiuser MIMO distributed antenna systems. However, a central unit (CU) is needed to collect the global CSI of all users in the system and to calculate the transmission weight vectors for each cluster. Notice that this centralized framework requires an enormous amount of feedback and backhaul overhead. In [25], a centralized multicell network scheduling algorithm is proposed to minimize the intercluster interference by performing clustering from the user's point of view, which also needs a CU for global network scheduling. For a system with a large number of cells and a large number of users, a high computational burden will be caused in the CU. In [14], the authors instead consider a more realistic system model, where the network is divided into a number of disjoint static clusters, and limited intercluster coordination is used to precancel interference for the users at the edge of neighboring clusters. In this approach, perfect CSI is available at the cluster side for both the cluster users and edge users in the neighboring clusters. Hence, each cluster can help the edge users in the neighboring clusters by taking these users into account when designing the precoding matrices. However, with a large number of users in each cluster, in order to serve a clusteredge user, all the neighboring clusters need to provide a given number of degrees of freedom by dropping some scheduled users of their own, which leaves fewer degrees of freedom to serve their own users. In addition, a joint scheduling across clusters is needed for the whole network, which requires intercluster communication and increases the complexity of the resource allocation design.
Fractional frequency reuse is a promising technique for intercell interference mitigation. Instead of using spatial degrees of freedom to suppress the intercell interference, it restricts the available frequency resources of different cells through a predefined frequency reuse rule or through appropriate power control. In [26], the division of frequency resources is investigated for the uplink of a linear network MIMO system. Since the system is considered to be uniformly clustered in a linear grid, the intercluster interference can be completely eliminated with simple reuse strategies, e.g., half of the available frequency resources are assigned to each cluster with different resources assigned to adjacent clusters. In [27], the authors consider a more realistic scenario and employ appropriate power control in frequency such that adjacent clusters generate different interference levels in different subchannels. However, the power control problem is formulated in a centralized way such that a CU is needed for the network to solve the optimization problem for all the clusters. In [28,29], two frequency reuse schemes were proposed for a multicell OFDMA system, supporting noncoherent joint transmission to celledge users by usercentric dynamic clustering. Due to the usercentric nature of the clustering, these approaches also require a joint scheduling across cells for the whole network.
In this article, we address the resource allocation problem for the downlink of a clustered network MIMO OFDMA system with 3sector base stations (BSs). Each sector has one directional antenna, and it is associated with a directional cell area. The whole system is statically divided into disjoint clusters of sectors. Due to practical issues (e.g., synchronization constraints, feedback constraints, backhaul network constraints and the system complexity), intercluster information exchange may not be feasible in realistic cellular systems. Targeting practical scenarios, radio resource allocation is independently performed in each cluster without intercluster communication. Zeroforcing beamforming is considered as the coherent joint transmission scheme within each cluster, which allows multiple users to share the same subchannel in each time slot by choosing proper beamforming weights. A twostep resource allocation scheme is proposed, which involves both intercluster and intracluster levels:
• As a first step or intercluster level of resource allocation, two novel cooperative frequency reuse approaches (CFR1 and CFR2) are proposed to mitigate the intercluster interference. A user partition method based on the long term channel gain is introduced to divide the users of each cluster into clusteredge users (CEU) and clustercenter users (CCU). Frequency subchannels in each cluster are separated into two orthogonal sets, that is, clusteredge and clustercenter frequency sets. The intercluster interference is reduced by mapping different groups of CEU to different subchannels of the clusteredge frequency set in a cooperative way. We also show that the frequency partition (the size of the clustercenter frequency set) and the user partition threshold are the parameters that can be optimized to balance the celledge and cellaverage performance.
• As a second step or intracluster level of resource allocation, a suboptimal utilitybased joint scheduling and power allocation algorithm is proposed for each cluster with a low complexity. Assume that perfect CSI is available at the cluster side for the users within this cluster. The algorithm jointly determines the set of users scheduled on each subchannel, and the power allocation across subchannels. The objective is to maximize the sum utility of all users in the cluster subject to persector power constraints.
The main contributions of our scheme are listed as follows:
• Frequency reuse approach performed in the first step (intercluster level) can effectively reduce the intercluster interference for CEU. Moreover, the user partition and the frequency partition are performed at the first step. In this way, only a subset of users is mapped to each subchannel, leading to a significant reduction of both the feedback requirements and the computational complexity in the second step of the proposed scheme.
• The proposed twostep resource allocation scheme can be implemented in different time scales, i.e., the intercluster interference mitigation would be more static than the intracluster scheduling and power allocation performed in the second step. Moreover, radio resource allocation is independently performed in each cluster without intercluster information exchange. Therefore, no intercluster coordination links are needed, which is an attractive property for the deployment of practical systems.
The proposed resource allocation scheme is compared with the universal frequency reuse (UFR) scheme and the intercluster interference precancellation (IPC) strategy proposed in [14]. Simulation results demonstrate that a significant improvement on both the celledge and the cellaverage performance can be obtained by the proposed scheme, with a much lower computational complexity.
The rest of the article is organized as follows: Section 2 describes the system model and introduces the problem formulation for the downlink of a clustered network MIMO OFDMA system. In Section 3, two cooperative frequency reuse approaches are proposed for mitigating intercluster interference. Then, for the intracluster level, a joint scheduling and power allocation algorithm is proposed for each cluster in Section 4. Section 5 presents the system level simulation results. Conclusions and future work are drawn in Section 6.
Notations: · denotes the cardinality of a set. ∥·∥ denotes the Euclidean norm of a vector or absolute value of a scalar. (·)^{T }and (·)^{* }denote the transpose operation and conjugate transpose operation, respectively.
2 System model and problem formulation
Consider the downlink of a network MIMO OFDMA system with multiple 3sector BSs. Each sector has one directional antenna, and it is associated with a directional cell area. The three antennas of each BS are located at the same site. Each user is equipped with one receive antenna and assigned to a serving sector that is selected based on longterm channel gain, i.e., pathloss and shadow fading. The whole system is statically divided into a number of disjoint clusters, where each cluster consists of three neighboring sectors belonging to different BSs. An example of a coordinated cluster is illustrated in Figure 1.
Figure 1. Example of a coordinated cluster of three neighboring sectors.
Assume that all the clusters have the same number of sectors B. Each sector has the same N subchannels, i.e., there are N subchannels available for joint transmission per cluster. Every cluster is working independently in the system. Without loss of generality, we consider that a given cluster is denoted by c with K^{c }users. Each of the sectors in cluster c is denoted by (c, b), where b ∈ {1,...,B}. Concentrating on an arbitrary time slot, the sectors within cluster c provide joint transmission for the scheduled users (denoted by S(c,n)) on each subchannel n based on the available CSI. The received signal of the scheduled user k on subchannel n in cluster c is given as
where denotes the complex channel response between sector (c, b) and user k on subchannel n, consisting of path loss, shadow fading, and smallscale fading. is the beamforming weight for user k on subchannel n with respect to sector (c, b). denotes the data symbol for user k on subchannel n, which is transmitted from all the sectors inside cluster c. z_{k,n }is the additive white Gaussian noise at user k on subchannel n with zero mean and variance σ^{2}.
Let and denote the channel vector and the beamforming vector from all sectors in cluster c to user k on subchannel n, respectively. The maximum number of users that can be supported on subchannel n in cluster c is bounded by the total number of transmit antennas of cluster c, i.e., S(c,n) ≤ B. In this article, zeroforcing joint transmission is used to eliminate the intracluster interference. The beamforming matrix is defined as the pseudoinverse of the channel matrix. Thus, we have
Then, the received signal becomes
Denote as the symbol power allocated to user k on subchannel n across the B sectors in cluster c. Then, the signal to interference plus noise ratio (SINR) of user k on subchannel n is derived as
Finally, based on the Shannon theorem, the achievable transmission rate of user k on subchannel n can be expressed as
where W is the bandwidth of each subchannel, and β is the SINR gap, which is a constant related to the target bit error rate (BER) given as β = 1.5/ln(BER) using MQAM modulation [30]. Then, the instantaneous data rate for user k at a given time slot becomes
The transmit power of sector (c, b) on subchannel n is given by
and the total transmit power of sector (c, b) is
In this article, the maximum available transmit power at each sector is restricted to a value, that is, for sector (c,b).
Targeting practical scenarios, radio resource allocation is independently performed in each cluster. The objective is to maximize the sum utility of all users in the cluster under persector power constraints. For any given time slot, the coordinated sectors within each cluster can jointly determine (1) the set of users scheduled on each subchannel, and (2) the symbol power allocated to each scheduled user.
Concentrate on one arbitrary cluster c with K^{c }users. Let U_{k }(·) denote the utility function of user k, which is assumed to be continuously differentiable, nondecreasing and concave to balance the efficiency and fairness of the system performance. Let denote the K^{c }× N sized symbol power allocation matrix in a scheduling interval, and S^{c }= [S(c, n)] denote the selected user sets on each subchannel. Then, the objective function of maximizing the sum utility of all users under persector power constraints can be formulated as
The average data rate of user k at time slot t is updated using an exponentially lowpass time window as [31]
where ρ = (T_{s}/T_{w}), T_{s }is the slot length, and T_{w }is the length of the window. R_{k }(t) is the instantaneous data rate for user k at a time slot t and can be derived by (6). Then, based on the firstorder Taylor expansion, the objective function in (9) can be rewritten as
which can be interpreted as maximizing the weighted sum rate, as is fixed at time slot t. From now on, μ_{k }is used to represent . It should be pointed out that the first order Taylor expansion approximation is suboptimal. However, this approximation relaxes the original complex optimization problem to a weighted sum rate maximization problem, which greatly simplifies the algorithm design. The weights are adaptively controlled by the marginal utility with respect to the current average rates. Specifically, as has been analyzed in [31], if the utility function is defined as a natural logarithm of the user's average data rate at the current time slot, the objective becomes to maintain proportional fairness among users. Therefore, the utilitybased algorithm presented by the first order Taylor expansion approximation can be treated as a general framework for allocating multiuser shared resources.
Substituting (6) and (8) into (11), the optimization problem under persector power constraints can be expressed as
Note that the overall resource allocation problem is a nonconvex combinatorial optimization problem. In addition, to solve the overall optimization problem, a significant amount of CSI feedback and information exchange between clusters is needed. Thus, computing its optimal solution within one step would require global network coordination, which is not realistic for implementation in real systems. Therefore, we propose a twostep resource allocation scheme to get closer to a practical implementation. In the following, we focus on a system with B = 3. The proposed scheme can be easily extended to the B > 3 case, e.g., multiple neighboring clusters can be grouped together to form a new bigger cluster.
3 Intercluster interference mitigation
As shown in Section 2, the intracluster interference can be completely eliminated by joint transmission as long as S(c, n) ≤ B. However, since the neighboring clusters are also using the same N subchannels, a cluster of coordinated sectors still causes intercluster interference to the users in the neighboring clusters, especially to the users in the clusteredge area.
In this section, based on the idea of static fractional frequency reuse, two cooperative frequency reuse approaches are proposed to mitigate the intercluster interference. These frequency reuse approaches will be considered as the first step or intercluster level of the proposed resource allocation scheme.
3.1 Cooperative frequency reuse Scheme 1
From a clusterspecific point of view, users in each cluster can be divided into two classes, that is, CEU and CCU. In [14], the authors propose a user partition method based on user locations, and determine an intercluster coordination area by a predefined coordination distance. However, this distance parameter based user partition could be a hard decision for a real implementation. In addition, the effect of shadow fading on the users is ignored. In [28,29], the user partition is instead based on the long term channel gain, which is more suitable for a practical use. Since the clusters are overlapping in [28,29], the partition is performed from a cellspecific point of view, where users in each cell are divided into cellcenter users and celledge users. In this article, we propose a clusterspecific user partition approach based on the long term channel gain, which is defined as follows.
Definition: User partition threshold, Δl, is the threshold used for classifying CEU and CCU. User k in sector (c, b) estimates and feeds back to its serving sector the long term channel gains from its serving sector and from four candidate neighboring sectors, that is, the two neighboring sectors within the same cluster c, and the other two neighboring sectors within the BS where its serving sector belongs to. In the example of Figure 1, the measurement set for the UE consists of its serving sector, the two neighboring sectors belonging to BS1 and the two neighboring sectors belonging to its coordinated cluster (the shadowed area). After obtaining these values, cluster c finds out the weakest long term channel gain within the cluster (denoted by in dB) and the strongest long term channel gain from the two candidate neighboring sectors outside the cluster (denoted by in dB). Note that reflects the weakest link within the cluster, which is the dominant link that affects the performance gain provided by intracluster zeroforcing joint transmission [32]. reflects the strongest interference link outside the cluster. If , intercluster interference would compromise the intracluster joint transmission gain, i.e., intercluster interference would be a big challenge for user k. Hence, user k is considered as a CEU if ; otherwise, it is regarded as a CCU. The threshold value can be predefined by each cluster or by the network (as employed in the handoff algorithm for practical wireless networks), and it can be a parameter to optimize according to the network design objective. Note that the measurements required from the users are based on the long term channel gain, which can be obtained from the ones used for the handoff process [33]. Hence, there is no measurements and feedback increase from the users by using this user partition method. One approach to further reduce the feedback would be to obtain and at the user side. Hence, these values or could be instead fed back for user partition.
The N frequency subchannels are divided into two orthogonal sets, G and F, where F is further divided into three orthogonal subsets, marked by f_{i}, with i = {1, 2, 3}. Subchannels in set G are used for CCU with frequency reuse factor of one for each cluster, while subchannels in set F are used for CEU with frequency reuse factor of for each cluster. This cooperative frequency reuse scheme, named as CFR1, is shown in Figure 2. Note that neighboring clusters adopt orthogonal subchannels for the CEU. Hence, the intercluster interference can be significantly reduced. The frequency partition (the size of the clustercenter frequency set G) can be a parameter to optimize, and it is treated by system level simulation in Section 5.
Figure 2. Frequency reuse rule for CFR1.
3.2 Cooperative frequency reuse Scheme 2
In CFR1, the frequency reuse factor for CEU is , that is, only one third of the subchannels in set F is available for CEU in each cluster. In this subsection, we propose a second cooperative frequency reuse scheme, named as CFR2, where the frequency reuse factor for CEU is in each cluster.
Assume that every three neighboring clusters are grouped together and respectively marked as Clusters 1, 2 and 3 (see Figure 3). Given the marker of each cluster (Cluster 1, 2 or 3), CEU in each cluster are further divided into two types according to their dominant interference clusters. The dominant interference cluster of user k is defined as the one that the neighboring sector with the strongest long term channel gain belongs to. The subchannels in set F are further divided into three orthogonal subsets, marked by f_{i}, with i = {1,2, 3}. Then, frequency subset f_{i }is assigned for the CEU whose dominant interference cluster is cluster i. Based on the above defined frequency reuse rule, two subsets of F are available for CEU within each cluster, where different type of CEU use the subchannels belonging to different subsets. As illustrated in Figure 3, the subchannels used for CEU in neighboring clusters in CFR2 are not orthogonal any more. However, for each CEU, the intercluster interference coming from its dominant interference cluster can be eliminated. In the example of Figure 3, the subchannels belonging to subset f_{2 }can be used for CEU1, when the dominant interference cluster of CEU1 is Cluster 2. Since f_{2 }is not available in Cluster 2 according to the frequency reuse rule of CFR2, there will be no intercluster interference introduced by Cluster 2 for CEU1.
Figure 3. Frequency reuse rule for CFR2.
4 Joint scheduling and power allocation
In the intercluster interference mitigation or first step, users in each cluster are divided into two groups (CEU/CCU) and mapped to different frequency sets. In this section, a utilitybased joint scheduling and power allocation algorithm is proposed for each cluster to solve (12), which is considered as the intracluster level or the second step of the proposed resource allocation scheme.
In realistic cellular systems, intercluster information exchange may not be feasible due to practical issues, e.g., synchronization constraints, feedback constraints, backhaul network constraints and the system complexity. Targeting practical scenarios, joint scheduling and power allocation is proposed to be independently performed in each cluster without intercluster communication. Assume that each cluster only has perfect CSI of the users inside this cluster, and the user data are shared by these sectors errorfree and without delay. With joint transmission among the sectors within the cluster, the intracluster interference can be completely eliminated as long as S(c,n) ≤ B. Note that the intercluster interference is reduced by frequency reuse in the first step of the resource allocation scheme. In order to avoid intercluster information exchange and the interdependency issues among clusters, the remaining intercluster interference is not considered in this section in the SINR expression, i.e., the signal to noise ratio (SNR) of user k on subchannel n in cluster c based on the zero forcing joint transmission given in (4) is derived as . Without loss of generality, we suppress the cluster index, and concentrate on one arbitrary cluster. b is used to represent (c, b) to denote each of the sectors in cluster c. The optimization problem (12) is then given by
where γ_{k,n }= p_{k,n}/σ^{2}, and P_{max,b }is the maximum transmit power of sector b. Note that the system performance could be slightly improved by redesigning a joint scheduling and power allocation algorithm where the remaining intercluster interference is taken into account. However, this would require an exchange of information between clusters. Moreover, the interdependency issues among clusters due to considering remaining interference would result in a system overall optimization problem, which is intractable and needs global coordination.
Equation (12) is a nonconvex combinatorial optimization problem, thus, computing its globally optimal solution may not be feasible in practice. Here, we propose a suboptimal low complexity joint scheduling and power control algorithm for practical implementations. Assume P_{max,b }= P_{max }for all sectors, and that the total transmit power P_{max }in each sector is equally preallocated to all the available subchannels. Then, the persector power constraints are reduced to persubchannel power constraints with a constraint value P_{max}/N, and the joint scheduling and power allocation problems are decoupled on each subchannel. On each subchannel n, equal user power allocation [3] is adopted for the users scheduled in set S(n), that is, each selected user is allocated with the same symbol power given by
It should be pointed out that equal user power allocation [3] is suboptimal, since it typically results in only one sector meeting the maximum transmitted power preallocated to subchannel n, and the remaining B  1 sectors transmit below the P_{max}/N value. Actually, for each subchannel n, the relaxed power allocation problem is a convex problem for the scheduled users in set S(n). Similar to reference [19], the optimal solution can be obtained based on waterfilling distribution via standard optimization techniques. However, as mentioned in [19], the computational complexity for obtaining the optimal value is still high. For simplicity, equal user power allocation is adopted in this article.
Since we consider only SNR, i.e., γ_{k,n }= p_{k,n}/σ^{2}. Thus,
Then, the sum utility of the user set S(n) on subchannel n can be calculated by
Hence, under the above assumptions, the suboptimal solution becomes an exhaustive search. For each subchannel n, the coordinated sectors search all possible user sets S(n) in the cluster. The chosen user set S*(n) will be the one that achieves the highest sum utility on subchannel n. Since the maximum number of users that can be supported on a subchannel is bounded by the total number of transmit antennas of the cluster, i.e., S(n) ≤ B, the number of feasible selected user sets for each subchannel is K^{B}, with K = K^{c }denotes the number of users in cluster c. Therefore, the complexity is O (N × K^{B}), which is prohibitively high. However, after the user partition and frequency partition at the first step shown in Section 3, only a subset of users is mapped to each subchannel. Therefore, the number of the feasible user sets for exhaustive search on each subchannel is reduced. The complexity is then reduced to , where K_{i }is the number of CEU that are mapped to the frequency subset f_{i}, and K_{g }is the number of CCU that are mapped to the frequency subset G, with and . The proposed twostep resource allocation algorithm is summarized in Algorithm 1.
5 Simulation results
In this section, the performance of the proposed scheme is studied by system level simulations. We consider a network MIMO OFDMA system with 19 clusters. A wraparound technique is adopted to avoid the boundary effect, which causes the clusters in the boundary of the cellular network to receive less interference. Each cluster consists of B = 3 neighboring sectors with one transmit antenna each. The antenna gain pattern measured in dB is given based on [1]
where φ is the angle that the user forms with the sector boresight φ ∈ [180°, 180°]. φ_{3dB }= 70° is the angle associated with the half power beamwidth and A_{m }= 20 dB is the maximum attenuation for the sidelobe. The persector power constraint, P_{max}, is 43dBm. The cell radius is 500 m. A typical urban multipath channel model [34] is used with path loss L(d) = 128.1 + 37.6log_{10}(d) in dB, with d in km. The number of subchannels, N, is 50, with each subchannel consists of 12 contiguous subcarriers. The subcarrier spacing is 15 kHz. K = 27 users are uniformly dropped in each cluster with one receive antenna for each user. A fullbuffer traffic model is assumed for each user. The natural logarithm function is used as the users' utility function, U_{k}(·). The throughput filter window length, T_{w}, is set to 100 time slots. The user partition threshold, Δl, is set to 2dB, which will be shown as a proper choice to balance the celledge and cellaverage data rate. This value of Δl results in 16 CEU and 11 CCU in average for each cluster (over 100 different user locations). The average number of CCU and CEU per cluster (K_{i }= 16, K_{g }= 11) will be used for calculating the complexity of the proposed Schemes 1 and 2, as shown in Table 1. Let Scheme1 denote the proposed resource scheme adopting CFR1 as the first step, while Scheme2 denotes the proposed resource allocation scheme using CFR2 as the first step. The simulation parameters are summarized in Table 2.
5.1 Frequency partition and user partition
In this article, we consider the following performance metrics defined in 3GPP [1]:
• Throughput cumulative distribution function (CDF) or user average data rate CDF, which is the CDF of the average data rate including all the users in the system.
• Cellaverage user data rate, which is the 50% point of the user average data rate CDF, denoted by R_{ave}.
• Celledge user data rate, which is the 5% point of the user average data rate CDF, denoted by R_{edge}.
First, we consider the effect of frequency partition on the performance of Schemes 1 and 2. Assume the set of subchannels used for CEU, denoted by F, is equally divided into three subsets, that is f_{1} = f_{2} = f_{3} = F/3, with F + G = 50. The subchannels in set G are used for CCU.
Considering different sizes of the clustercenter frequency set G, Figure 4 shows the CDF of the user average data rate including all CEU in the system and the CDF curves including all CCU in the system for Scheme1. While the corresponding CDF curves for Scheme2 are plotted in Figure 5. It can be seen that there is a tradeoff when choosing G. Actually, if G is large, more subchannels will be allocated to CCU, leading to CCU's performance increasing with CEU's performance decreasing for both of these two schemes. In Scheme1, the intercluster interference is significantly reduced for CEU due to the use of CFR1. The clustercenter areas are still interference limited, since the frequency reuse factor is one for the these areas. Hence, we can observe from Figure 4 that CEU achieve higher date rates compared with CCU in Scheme1 in most cases, e.g., when G<29. In Scheme2, the intercluster interference is handled by CFR2, and CEU still suffer the interference from some neighboring clusters. Therefore, the CEU's data rates are lower compared with that of CCU in Scheme2, as shown in Figure 5.
Figure 4. CDF of user average data rate for CEU and CCU in Scheme1.
Figure 5. CDF of user average data rate for CEU and CCU in Scheme2.
In order to have a further understanding of the two proposed schemes with respect to the system performance metrics R_{edge }and R_{ave}, the user average data rate CDF curves including all users in the system with respect to different values of G are plotted in Figures 6 and 7 for Schemes 1 and 2, respectively. Notice that R_{edge }is defined based on the user average data rate (the 5% point of the user average data rate CDF including all the users in the system) according to 3GPP [1], instead of the user category (CEU). Hence, R_{edge }is not equivalent to the average date rate of CEU. For example, as shown in Figure 4, the users with lower average data rate actually come from CCU in Scheme1. While for Scheme2, the users with lower average data rate come from CEU as shown in Figure 5. The average date rate of CCU increases as the number of frequency bands allocated to the CCU (G) increases. Hence, it is reasonable to see from Figures 6 and 7 that R_{edge }increases as G increases in Scheme1, while R_{edge }decreases as G increases in Scheme2.
Figure 6. CDF of user average data rate in Scheme1.
Figure 7. CDF of user average data rate in Scheme2.
In Figure 6, there are plateau regions at CDF values of 0.4 and 0.93. The plateau region at 0.4 is actually the point separating the CCU and CEU in the Scheme1, while the plateau region at the CDF value around 0.93 comes from the CEU (see Figure 4). Based on the user partition method proposed in this article, the CEU group consists of both the users that are close to the neighboring clusters and have poor channel qualities (sectoredge users), and the users that are close to their serving sector and have good channel qualities (sectorcenter users). The CDF value around 0.93 is actually the point separating the sectoredge users and the sectorcenter users within the CEU group.
The above results are obtained by setting the user partition threshold, Δl, to be 2dB. Besides of frequency partition, the user partition would also affect the value of R_{edge }and R_{ave}. In order to balance the celledge and cellaverage performance, a utility function is defined in [14] to evaluate the effect of coordination distance on both R_{edge }and R_{ave}. In this article, we map the distance parameter to the size of the clustercenter frequency set G and the user partition threshold Δl. Then, a fairness jug function with respect to the value of G and Δl is defined as
where α ∈ [0,1] is a fairness factor reflecting the design objective. When the aim is to improve the celledge date rate, we choose a larger value of α. If the objective is the sum rate, a smaller value of α is picked. As an example, α = 2/3 is selected in our simulation, which means we target the celledge performance. Recall that a user k is considered as a CEU if ; otherwise, it is a CCU. Therefore, the number of CEU decreases as the value of Δl increases. For example, in our simulations, the percentage of CEU is around 22.2% with Δl = 6dB, and 66.7% with Δl = 6dB.
The values of Fairness Jug Index with respect to different G and Δl are shown in Figures 8 and 9 for Schemes 1 and 2, respectively. For each scheme, the user average data rate CDF including all users in the system is first plotted with respect to each pair of (G,Δl). Then, R_{edge }(G,Δl) and R_{ave }(G,Δl) can be obtained from the corresponding CDF curve, according to the definition in 3GPP. The Fairness Jug Index, J(G,Δl), is finally derived by substituting R_{edge }(G, Δl) and R_{ave}(G,Δl) into (18). It can be seen that the maximum values for both Schemes 1 and 2 are achieved when Δl = 2dB. While the optimal frequency partition is achieved with G = 29 and F = 7 for Scheme1, G = 17 and F = 11 for Scheme2. Compared with Scheme1, Scheme2 needs more subchannels for CEU. As has been mentioned above, Scheme1 performs CFR1 for intercluster interference mitigation, where neighboring clusters use orthogonal subchannels for CEU. Hence, the intercluster interference is significantly reduced in Scheme1, leading to a large performance improvement for CEU (see Figure 4). In Scheme2, CEU still suffer the interference from some neighboring clusters. Hence, more clusteredge subchannels are needed in Scheme2 to increase the user data rate of CEU (see Figure 5).
Figure 8. Fairness Jug Index in Scheme1 with different G and Δl.
Figure 9. Fairness Jug Index in Scheme2 with different G and Δl.
5.2 Performance analysis
In this simulation, based on the results from Figures 8 and 9, we choose G = 29 for Scheme1 and G = 17 for Scheme2. Δl is set to be 2dB. Besides the proposed Schemes 1 and 2, the following two schemes are considered as reference schemes for performance comparison.
• UFR, where all subchannels are available for each cluster and the proposed joint scheduling and power allocation is adopted independently within each cluster irrespective of the user category (CEU/CCU).
• IPC, where each cluster helps the neighboring CEU by taking these users into account when designing precoding matrices. In this scheme, the twostep joint scheduling approach described in [14] is adopted as follows.
 In a first step (intracluster level), each cluster performs joint scheduling and power allocation within its own cluster (the same approach as in the UFR scheme).
 In a second step (intercluster level), IPC is independently performed on each subchannel: CEU scheduled on the corresponding subchannel inform the neighboring helper clusters. Then, each cluster deals with the requests from CEU in the neighboring clusters in a sequential way, and it selects to always help those CEU by randomly dropping some of its own users scheduled on the same subchannel. After this rescheduling process, each cluster redesigns the transmit power for the scheduled users using (14).
In Figure 10, the CDF of the average SINR of all users in the system is evaluated for the two proposed schemes. For each user, the SINR is calculated per trial, and then averaged over 100 trials. In each trial, the SINR of each user is obtained by averaging the SINR values over all subchannels that are assigned to it, considering the remaining interference power from one ring of six neighboring clusters. Note that the wraparound technique is adopted to avoid the boundary effect for the users in the boundary clusters (the clusters in the boundary of the cellular deployment). Compared with the UFR Scheme, it is shown that the average SINR performance of the proposed schemes is significantly improved due to the intercluster interference reduction. Similar to the analysis for Figure 6, the plateau region at 0.4 for Scheme1 in this figure is actually the point separating the SINR values of CCU and CEU. Due to the use of CFR1 in Scheme1, the intercluster interference reduction of CEU is quite significant, resulting in a better SINR performance for CEU compared with that of CCU in Scheme1.
Figure 10. CDF of user average SINR for different schemes.
Figures 11 and 12 show the CDF of the user average data rate and the CDF of the user average utility for the considered resource allocation schemes. Table 1 gives the performance comparison of the different schemes in terms of Fairness Jug Index, computational complexity and intercluster communication requirements. We can see that:
• Although intercluster interference is treated in the IPC scheme, its performance is even worse than the UFR scheme, which does not take intercluster interference into account. The poor performance of the IPC scheme was caused by the following two main reasons.
 Limited degrees of freedom for each cluster to help the neighboring CEU while serving its own users. In the IPC scheme, in order to help a neighboring CEU by interference precancellation, the cluster needs to provide a certain number of degrees of freedom by dropping some scheduled users of its own, which leaves fewer degrees of freedom to serve its own users. As explained in Lemma 2 of [14], for a cluster with B coordinated sectors and k_{e }neighboring CEU to help, the maximum number of users that can be supported simultaneously by joint transmission in this cluster is bounded by , where N_{t }denotes the number of transmit antennas per sector and N_{r }denotes the number of receive antennas per user. Therefore, to help a neighboring CEU, the total number of users that the IPC scheme can support is reduced. In our system model, N_{t }= N_{r }= 1. Hence, the maximum number of users that can be served within a cluster is determined by the cluster size B. With a larger B, there would be spare degrees of freedom left for each cluster to help neighboring CEU. However, note that due to path loss, the CEU do not benefit from far away sectors' transmission. As shown in [14], there is a diminishing gain with the increase of the cluster size. Therefore, the performance of Schemes 1 and 2 with respect to the IPC scheme might improve for a larger B.
 A smart global scheduler is required for jointly scheduling users across multiple clusters. Note that the twostep joint scheduling approach in [14] aggressively protects the neighboring CEU by randomly dropping some scheduled users of its own, which leaves fewer degrees of freedom to serve its own users. Since the dropped users in each cluster are randomly picked out, some of its own scheduled CEU might also be dropped out in Step 2, resulting in the performance degradation of CEU. In addition, for each cluster, the neighboring CEU come from six neighboring clusters. Hence, the chance that a cluster receives requests from neighboring CEU is very high. In an extreme case for a subchannel, where the number of scheduled neighboring CEU of a cluster is large (larger or equal to B), the twostep joint scheduling strategy would force the cluster to drop all its scheduled users on this subchannel to aggressively protect the neighboring CEU, leading to both CEU and CCU performance degradation. Hence, in order to improve the performance of IPC scheme, a smart global scheduler is required for joint user scheduling across multiple clusters. However, this global optimization requires intercluster communication and increases the complexity of the resource allocation design.
• In Figure 11, compared with UFR, the celledge user data rate of the proposed Scheme2 is improved by 20%, while the cellaverage user data rate in the Scheme2 is improved by 5%. The proposed Scheme1 achieves a much more significant performance improvement compared to the UFR scheme, with about 180% increase of celledge user data rate and 90% increase of cellaverage user data rate. Note that although the average data rate and the average utility performance of Scheme2 is close to that of UFR scheme, the complexity of Scheme2 is much lower compared with UFR scheme, which can be observed in Table 1.
• As shown in Table 1, the computational complexity of the proposed two schemes is much lower than the UFR scheme and the IPC scheme, with Scheme2 achieving the lowest complexity. Since the intercluster interference mitigation approaches (CFR1 and CFR2) are adopted in the first step of Schemes 1 and 2, only a subset of users is mapped to each subchannel. Hence, the number of feasible user sets for exhaustive search in the second step for Schemes 1 and 2 is significantly reduced. With Δl = 2dB, the average numbers of CEU and CCU per cluster are K_{i }= 16 and K_{g }= 11 respectively in our simulation. As explained in Section 4, the complexity per cluster for Schemes 1 and 2 is . The number of the feasible selected user sets for each subchannel in the UFR scheme is K^{B}. Therefore, the complexity for the UFR scheme is N × K^{B}. In the IPC scheme, each cluster performs joint scheduling and power allocation within its own cluster based on UFR in a first step, then each cluster needs to perform one more time user selection according the requests from CEU in the neighboring clusters. Hence, the complexity for IPC is N × (K^{B }+ 1 ). Note that the complexity of the four schemes considered in this article increases exponentially with the cluster size B.
Figure 11. CDF of user average data rate for different schemes.
Figure 12. CDF of user average utility for different schemes.
6 Conclusions
The resource allocation problem has been considered for the downlink of a clustered network MIMO OFDMA system. A twostep resource allocation scheme with intercluster interference mitigation and intracluster joint scheduling and power allocation has been presented. In particular, the main task of managing the intercluster interference is accomplished by two cooperative frequency reuse approaches at the first step of the proposed resource allocation scheme. A user partition method based on the long term channel gain is introduced to divide the users of each cluster into clusteredge and clustercenter users. Frequency subchannels in each cluster are separated into clusteredge and clustercenter frequency sets. The intercluster interference is reduced by mapping different groups of clusteredge users to different subchannels of the clusteredge frequency set in a cooperative way. We have shown that there is a tradeoff between the celledge and cellaverage performance while choosing the frequency partition and the user partition, i.e., the size of the clustercenter frequency set and the user partition threshold. As the second step, a suboptimal utilitybased joint scheduling and power allocation algorithm is proposed for each cluster as the intracluster level of resource allocation. The objective is to maximize the sum utility of all users within the cluster under persector power constraints. Note that in realistic scenarios, the twostep approach can be implemented in different time scales, i.e., the intercluster interference mitigation would be more static than the intracluster scheduling and power allocation performed in the second step. It has been demonstrated by simulation results that our proposed resource allocation scheme can provide a considerable performance improvement in terms of the celledge user data rate, the cellaverage user data rate and the user average utility. In addition, the proposed twostep resource allocation scheme can be implemented independently in each cluster without intercluster communication, which is an attractive property for practical systems, since it reduces both the network signaling overhead and the computational complexity.
In this article, we have assumed that each sector has one transmit antenna and each user has one receive antenna. Zeroforcing beamforming is used as the network MIMO joint transmission scheme. In future work, multiple antennas at both the sector side and the user side will be investigated. Frequency reuse combined with interference precancellation techniques will be studied for managing the intercluster interference. More efficient joint scheduling and power control algorithms with advanced multiantenna joint transmission methods will be considered. The results in this article assume that perfect CSI is available at the cluster side for the users within the cluster. Investigation of imperfect CSI is of practical importance and it is a topic of our future work.
Abbreviations
3GPP: 3rd generation partnership project; BER: bit error rate; BSs: base stations; CCU: clustercenter users; CDF: cumulative distribution function; CEU: clusteredge users; CFR1: cooperative frequency reuse Scheme 1; CFR2: cooperative frequency reuse Scheme 2; CoMP: coordinated multipoint transmission/reception; CSI: channel state information; CU: central unit; IPC: intercluster interference precancellation; LTE: long term evolution; MIMO: multiple input multiple output; OFDM: orthogonal frequency division multiplexing; OFDMA: orthogonal frequency division multiple access; Scheme1: resource allocation Scheme 1; Scheme2: resource allocation Scheme 2; SINR: signal to interference plus noise ratio; SNR: signal to noise ratio; UFR: universal frequency reuse.
Competing interests
The authors declare that they have no competing interests.
Algorithm 1. Proposed twostep resource allocation scheme
Step 1 Intercluster interference mitigation
1: In each cluster, divide the users into CCU and CEU.
2: Map CCU to the subchannels in frequency set G.
3: Map CEU to the subchannels in frequency subset f_{i }according to the predefined cooperative frequency reuse scheme (CFR1 or CFR2).
Step 2 Joint scheduling and power allocation
1: For each subchannel n, find all the users mapped to it.
2: For each feasible user set S(n) on subchannel n (S(n) ≤ B), derive the beamforming matrix by zeroforcing joint transmission.
3: Calculate the sum utility U_{n }(S(n)) of user set S(n), based on (14), (15) and (16). 4: Find the optimal user set S*(n) with the maximum sum utility U_{n }(S*(n)), and derive the corresponding transmit power for each user in S*(n) by (14).
Acknowledgements
This work has been supported by the EU FP7 project INFSOICT247223 ARTIST4G and the Swedish Agency for Innovation Systems (VINNOVA). C. Botella's work is supported by the Spanish MEC Grants CONSOLIDERINGENIO 2010 CSD200800010 "COMONSENS" and COSIMA TEC201019545C0401. We would like to acknowledge the contributions of our colleagues. We also thank the anonymous reviewers for their useful comments.
References

3GPP TR 36.814 v9.0.0, Further Advancements for EUTRA Physical Layer Aspects

G Boudreau, J Panicker, N Guo, R Chang, N Wang, S Vrzic, Interference coordination and cancellation for 4G networks. IEEE Commun Mag 47(4), 74–81 (2009)

M Karakayali, G Foschini, R Valenzuela, Network coordination for spectrally efficient communications in cellular systems. IEEE Trans Wirel Commun 13(4), 56–61 (2006). Publisher Full Text

H Skjevling, D Gesbert, A Hjørungnes, Lowcomplexity distributed multibase transmission and scheduling. EURASIP J Adv Signal Process (2008)

A Tölli, M Codreanu, M Juntti, Cooperative MIMOOFDM cellular system with soft handover between distributed base station antennas. IEEE Trans Wirel Commun 7(4), 1428–1440 (2008)

S Khattak, W Rave, G Fettweis, Distributed iterative multiuser detection through base station cooperation. EURASIP J Wirel Commun Netw 2008, 1–15 (2008)

C Botella, G Pinero, A Gonzalez, M De Diego, Coordination in a multicell multiantenna multiuser WCDMA system: beamforming approach. IEEE Trans Wirel Commun 7(11), 4479–4485 (2008)

S Jing, DNC Tse, JB Soriaga, J Hou, JE Smee, R Padovani, Multicell downlink capacity with coordinated processing. EURASIP J Wirel Commun Netw 2008, 1–19 (2008)

H Huang, M Trivellato, A Hottinen, M Shafi, PJ Smith, R Valenzuela, Increasing downlink cellular throughput with limited network MIMO coordination. IEEE Trans Wirel Commun 8(6), 2983–2989 (2009)

F Boccardi, H Huang, Limited downlink network coordination in cellular networks. Proceedings of the 18th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (Athens, Greece, 2007), pp. 1–5

P Marsch, G Fettweis, A framework for optimizing the downlink performance of distributed antenna systems under a constrained backhaul. Proceedings of the 13th European Wireless Conference (Paris, France, 2007), pp. 1–5

S Venkatesan, Coordinating base stations for greater uplink spectral efficiency in a cellular network. Proceedings of the 18th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (Athens, Greece, 2007), pp. 1–5

P Marsch, G Fettweis, A decentralized optimization approach to backhaulconstrained distributed antenna systems. Proceedings of the 16th IST Mobile and Wireless Communications Summit (Budapest, Hungary, 2007), pp. 1–5

J Zhang, R Chen, JG Andrews, A Ghosh, RW Heath, Networked MIMO with clustered linear precoding. IEEE Trans Wirel Commun 8(4), 1910–1921 (2009)

A Papadogiannis, D Gesbert, E Hardouin, A dynamic clustering approach in wireless networks with multicell cooperative processing. Proceedings of the IEEE International Conference on Communications (Beijing, China, 2008), pp. 4033–4037

F Boccardi, H Huang, A Alexiou, Network MIMO with reduced backhaul requirements by MAC coordination. Proceedings of Asilomar Conference on Signals, Systems and Computers (Pacific Grove, California, 2008), pp. 1125–1129

S Zhou, J Gong, Z Niu, Y Jia, P Yang, A decentralized framework for dynamic downlink base station cooperation. Proceedings of the IEEE Global Telecommunications Conference (Hawaii, USA, 2009), pp. 1–6

J Li, X Xu, X Chen, X Tao, T Svensson, C Botella, Downlink radio resource allocation for coordinated cellular OFDMA networks. IEICE Trans Commun E93.B(4), 3480–3488 (2010)

A GarciaArmada, M SanchezFernandez, R Corvaja, Waterfilling schemes for zeroforcing coordinated base station transmission. Proceedings of the IEEE Global Telecommunications Conference (Hawaii, USA, 2009), pp. 1–5

B Luo, Q Cui, H Wang, X Tao, Optimal joint waterfilling for coordinated transmission over frequencyselective fading channels. IEEE Commun Lett 15(2), 190–192 (2011)

N Reider, A Racz, G Fodor, On scheduling and power control in multicell coordinated clusters. Proceedings of the IEEE Global Telecommunications Conference (Hawaii, USA, 2009), pp. 1–7

J Li, T Svensson, C Botella, T Eriksson, X Xu, X Chen, Joint scheduling and power control in coordinated multipoint clusters. Proceedings of the 74th IEEE Vehicular Technology Conference (San Francisco, USA, 2011), pp. 1–5

J Li, X Chen, C Botella, T Svensson, T Eriksson, Resource allocation for OFDMA systems with multicell joint transmission. Proceedings of the IEEE international workshop on Signal Processing Advances in Wireless Commun (Çeşme, Turkey, 2012), pp. 1–5

K Maruta, T Maruyama, A Ohta, M Nakatsugawa, Intercluster interference canceller for multiuser MIMO distributed antenna systems. Proceedings of the 20th IEEE Personal, Indoor and Mobile Radio Communications (Tokyo, Japan, 2009), pp. 3079–3083

S Kaviani, W Krzymien, Multicell scheduling in network MIMO. Proceedings of the IEEE Global Telecommunications Conference (Miami, Florida, USA, 2010), pp. 1–5

E Katranaras, M Imran, R Hoshyar, Sumrate of linear cellular systems with clustered joint processing. Proceedings of the 69th IEEE Vehicular Technology Conference (Barcelona, Spain, 2009), pp. 1–5

G Caire, S Ramprashad, H Papadopoulos, C Pepin, CEW Sundberg, Multiuser MIMO downlink with limited intercell cooperation: approximate interference alignmnent in time, frequency and space. Proc 46th Annual Allerton Conference on Communication, Control, and Computing (Allerton, Chicago, 2008), pp. 730–737

J Li, H Zhang, X Xu, X Tao, T Svensson, C Botella, B Liu, A novel frequency reuse scheme for coordinated multipoint transmission. Proceedings of the 71th IEEE Vehicular Technology Conference (Taipei, Taiwan, 2010), pp. 1–5

J Li, X Xu, H Zhang, X Tao, T Svensson, C Botella, X Chen, Multibeam cooperative frequency reuse for coordinated multipoint transmission. J China Univ Posts Telecommun 13(6), 11–18 (2010)

A Goldsmith, SG Chua, Variablerate variablepower MQAM for fading channels. IEEE Trans Commun 45(10), 1218–1230 (1997). Publisher Full Text

G Song, Y Li, Crosslayer optimization for OFDM wireless networkspart II: algorithm development. IEEE Trans Wirel Commun 4(2), 625–634 (2005)

R Abildgaard Olesen, M Sternad, D Aronsson, Measurementbased evaluation of robust linear precoding for downlink CoMP. Proc of the IEEE International Conference on Communications (Ottawa, Canada, 2012), pp. 1–6

K Dimou, M Wang, Y Yang, M Kazmi, A Larmo, J Pettersson, W Muller, Y Timner, Handover within 3GPP LTE: design principles and performance. Proceedings of the 70th IEEE Vehicular Technology Conference (Anchorage, Alaska, USA, 2009), pp. 1–5

3GPP TR 36.942 V9.1.0, Evolved Universal Terrestrial Radio Access (EUTRA) (Radio Frequency (RF) system scenarios, 2010)