TY - JOUR
T1 - Reinforcement-learning calibration of coherent-state receivers on variable-loss optical channels
AU - Bilkis, M.
AU - Rosati, Matteo
AU - Calsamiglia, John
N1 - Funding Information:
ACKNOWLEDGMENT MR acknowledges J. Nötzel and A. Cacioppo for suggesting the optimization of a Dolinar receiver in the framework of compound channels, see [21]. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 845255. MB and JCC acknowledges support from the Spanish Agencia Estatal de Investigación, project PID2019-107609GB-I00, Generalitat de Catalunya CIRIT 2017-SGR-1127.
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - We study the problem of calibrating a quantum receiver for optical coherent states when transmitted on a quantum optical channel with variable transmissivity, a common model for long-distance optical-fiber and free/deep-space optical communication [1]-[7]. We optimize the error probability of legacy adaptive receivers, such as Kennedy's and Dolinar's [8], [9], on average with respect to the channel transmissivity distribution. We then compare our results with the ultimate error probability attainable by a general quantum device, computing the Helstrom bound for mixtures of coherent-state hypotheses, for the first time to our knowledge, and with homodyne measurements. With these tools, we first analyze the simplest case of two different transmissivity values; we find that the strategies adopted by adaptive receivers exhibit strikingly new features as the difference between the two transmissivities increases. Finally, we employ a recently introduced library of shallow reinforcement learning methods [10], demonstrating that an intelligent agent can learn the optimal receiver setup from scratch by training on repeated communication episodes on the channel with variable transmissivity and receiving rewards if the coherent-state message is correctly identified.
AB - We study the problem of calibrating a quantum receiver for optical coherent states when transmitted on a quantum optical channel with variable transmissivity, a common model for long-distance optical-fiber and free/deep-space optical communication [1]-[7]. We optimize the error probability of legacy adaptive receivers, such as Kennedy's and Dolinar's [8], [9], on average with respect to the channel transmissivity distribution. We then compare our results with the ultimate error probability attainable by a general quantum device, computing the Helstrom bound for mixtures of coherent-state hypotheses, for the first time to our knowledge, and with homodyne measurements. With these tools, we first analyze the simplest case of two different transmissivity values; we find that the strategies adopted by adaptive receivers exhibit strikingly new features as the difference between the two transmissivities increases. Finally, we employ a recently introduced library of shallow reinforcement learning methods [10], demonstrating that an intelligent agent can learn the optimal receiver setup from scratch by training on repeated communication episodes on the channel with variable transmissivity and receiving rewards if the coherent-state message is correctly identified.
UR - http://www.scopus.com/inward/record.url?scp=85123433107&partnerID=8YFLogxK
U2 - 10.1109/ITW48936.2021.9611396
DO - 10.1109/ITW48936.2021.9611396
M3 - Article
AN - SCOPUS:85123433107
ER -