later) observes the network state (e.g., eNB load, VNF utilization, traffic
prediction), selects an action (e.g., adjust uplink/downlink resource
allocation, scale VNFs), and receives a reward based on the network's
performance. This cycle repeats, allowing the agent to refine its control
policy over time.
2. Mathematical Model and Algorithm Explanation
Let’s delve into the mathematics. The heart of DACC is the DQN. DQNs
use a technique called Q-learning, which estimates the "quality" (Q-
value) of taking a particular action in a specific state. It aims to find the
optimal policy, assigning highest Q-values to the best actions in each
possible state and iteratively improving this estimation.
The ARIMA model, used for traffic prediction, is represented by the
equation:
(1−φ_1??????−1−…−φ_????????????−??????)(1−??????−1)^?????? = (1−??????_1??????−1−…−??????_????????????−??????)/(1−??????−1)
Don't let the symbols intimidate you. Picture it this way: the left side
represents the future traffic based on its past values (autoregressive –
φ), and the differencing (d) accounts for trends. The right side reflects
the moving average (θ), smoothing out fluctuations. The goal is to
identify the best values for φ, θ, p, d, and q to accurately predict future
traffic demands. The research uses this predicted traffic to proactively
adjust resource allocation.
The Reward Function, R = ??????⋅(Throughput - Throughput0) -
??????⋅PacketLoss, is crucial. This equation dictates what the agent is trying
to maximize. Throughput (R) is a positive reward, encouraging high data
rates. Packet Loss is a penalty, discouraging congestion. Alpha (??????) and
Beta (??????) are weights that control the relative importance of throughput
and packet loss – a tuning parameter optimized via Bayesian
Optimization, explained later.
3. Experiment and Data Analysis Method
The experiments were conducted using NS-3, a widely-used network
simulator. NS-3 allows for creating realistic network environments
without needing hardware. The simulated network mirrored a typical
urban 5G NSA deployment – multiple eNBs connected to VNFs (Virtual
Network Functions) like the MME, S-GW, and P-GW which handle core
network functionality. Tests were run under varying traffic load profiles: