Dynamic adaptation of quantization thresholds for soft-decision viterbi decoding with a reinforcement learning neural network

Wu, Yu-jhih; Alston, Michael D.; Chau, Paul M.

doi:10.1007/BF01581961

Dynamic adaptation of quantization thresholds for soft-decision viterbi decoding with a reinforcement learning neural network

Published: 01 June 1993

Volume 6, pages 77–84, (1993)
Cite this article

Journal of VLSI signal processing systems for signal, image and video technology Aims and scope Submit manuscript

Yu-jhih Wu¹,
Michael D. Alston¹ &
Paul M. Chau¹

101 Accesses
8 Citations
3 Altmetric
Explore all metrics

Abstract

Two reinforcement learning neural network architectures which enhance the performance of a soft-decision Viterbi decoder used for forward error-correction in a digital communication system have been investigated and compared. Each reinforcement learning neural network is designed to work as a co-processor to a demodulator dynamically adapting the soft quantization thresholds toward optimal settings in varying noise environments. The soft quantization thresholds of the demodulator are dynamically adjusted according to the previous performance of the Viterbi decoder, with updates occurring in fixed intervals (every 200 decoded bits out of the Viterbi decoder.) To facilitate implementaiton in digital hardware, each weight of the neural network and related parameters are specified as binary numbers. Computer simulation results demonstrate that, on average, the performance of a Viterbi decoder on an AWGN channel with nonuniformly-spaced soft decision thresholds dynamically adjusted by these neural networks is better than the performance of a Viterbi decoder with uniformly-spaced thresholds. This approach may be used for a variety of other digital communication applications such as channel estimation, adaptive equalization, and signal acquisition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Algorithm Is Described for Predicting the Probability of Success of Signal Transmission in a Wireless Communication System Using Machine Learning

Article 01 September 2022

Case Study IV: Tuned Reinforcement Learning (in Python)

Reinforcement Learning with Neural Networks: A Survey

References

S. Lin and D.J. Costello, Jr.,Error Control Coding: Fundamentals and Applications, Englewood Cliffs, NJ: Prentice-Hall, 1983.
Google Scholar
J.A. Heller and I.M. Jacobs, “Viterbi Decoding for Satellite and Space Communication,”IEEE Trans. Commun. Tech., vol. COM-19, 1971, pp. 835–848.
Article Google Scholar
Y. Yasuda, Y. Hirata, and A. Ogawa, “Optimum Soft Decision for Viterbi Decoding,”Proceedings of the 5th Int. Conf. on Digital Satellite Communications, 1981, pp. 251–258.
J.P. Odenwalder, “Optimal Decoding of Convolutional Codes,” Ph.D. Dissertation, University of California, Los Angeles, 1970.
Google Scholar
J.L. Massey, “Coding and Modulation in Digital Communication,”Proc. Int. Zurich Seminar on Digital Communications, 1974, pp. E2(l)–E2(4).
L.N. Lee, “On Optimal Soft-Decision Demodulation,”IEEE Trans. Inform. Theory, vol. IT-22, 1976, pp. 437–444.
Article MATH Google Scholar
L. Chin and D.P. Mital, “Application of Neural Networks in Robotic Control,”IEEE Int. Symp. Circuits and Systems, 1991, pp. 2522–2525.
H. Date, M. Seki, and T. Hayashi, “LSI Module Placement Methods Using Neural Computation Networks,”Int. Joint Conf. Neural Networks, 1990, vol. III, pp. 831–836.
Google Scholar
J.M. Lambert and R. Hecht-Nielsen, “Application of Feedforward and Recurrent Neural Networks to Chemical Plant Predictive Modeling,”Int. Joint Conf. Neural Networks, 1991.
P.J. Werbos, “Backpropagation Through Time: What It Does and How to Do It,”Proceedings of the IEEE, vol. 78, Oct. 1990, pp. 1550–1560.
Article Google Scholar
P.J. Werbos, “Consistency of HDP Applied to a Simple Reinforcement Learning Problem, ”Neural Networks, vol. 3, 1990, pp. 179–189.
Article Google Scholar
A.G. Barto, R.S. Sutton, and C. Anderson, “Neuron-like adaptive elements that can solve difficult learning control problems,”IEEE Trans. on Systems, Man, and Cybernetics, vol. SMC-13, 1983, pp. 834–846.
Article Google Scholar
P.J. Werbos, “Advanced forecasting methods for global crisis warning and models of intelligence,”General Systems yearbook, Appendix B., 1977.
P.J. Werbos, “A Menu of designs for reinforcement learning over time,” in T. Miller, R. Sutton, and P.J. Werbos, Eds.,Neural Networks for Control, Cambridge, MA: MIT Press, 1990, pp. 67–95.
Google Scholar
J.G. Proakis,Digital Communications, New York: McGraw-Hill, 1989.
Google Scholar
R.S. Sutton, “Learning to Predict by the Methods of Temporal Difference,”Machine Learning, vol. 3, 1988, pp. 9–44.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical & Computer Engineering, University of California, 92093-0407, San Diego, La Jolla, CA
Yu-jhih Wu, Michael D. Alston & Paul M. Chau

Authors

Yu-jhih Wu
View author publications
You can also search for this author in PubMed Google Scholar
Michael D. Alston
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Chau
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, Yj., Alston, M.D. & Chau, P.M. Dynamic adaptation of quantization thresholds for soft-decision viterbi decoding with a reinforcement learning neural network. J VLSI Sign Process Syst Sign Image Video Technol 6, 77–84 (1993). https://doi.org/10.1007/BF01581961

Download citation

Received: 10 October 1991
Revised: 10 March 1992
Published: 01 June 1993
Issue Date: June 1993
DOI: https://doi.org/10.1007/BF01581961

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dynamic adaptation of quantization thresholds for soft-decision viterbi decoding with a reinforcement learning neural network

Abstract

Access this article

Similar content being viewed by others

An Algorithm Is Described for Predicting the Probability of Success of Signal Transmission in a Wireless Communication System Using Machine Learning

Case Study IV: Tuned Reinforcement Learning (in Python)

Reinforcement Learning with Neural Networks: A Survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Dynamic adaptation of quantization thresholds for soft-decision viterbi decoding with a reinforcement learning neural network

Abstract

Access this article

Similar content being viewed by others

An Algorithm Is Described for Predicting the Probability of Success of Signal Transmission in a Wireless Communication System Using Machine Learning

Case Study IV: Tuned Reinforcement Learning (in Python)

Reinforcement Learning with Neural Networks: A Survey

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation