In this version, some theoretical background of performance bounds was added. The results obtained with the NN are compared to the bounds. In addition, a short listing of future development ideas was initiated in this release.
At the end of the training process, average values (BLER and losses) of episodes are plotted.
This is a bug-squashing version. Actor loss gradients are no longer backpropagated through critic NN. Some other smaller fixes are also included.
This version has a few major improvements.
In this version, tracking training activity by BLER and loss plots is introduced.
Added SNR as input to critic neural network to improve accuracy of value function estimation.
This version has only minor improvements to the model, but more documentation added.
This version fixes the exploding base matrices by normalizing them during training. Also, some code style improvements are included.
In this version, the following features are added:
The working features include: