[Paper Review] Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Lee, Lisa, et al. “Efficient exploration via state marginal matching.” arXiv preprint arXiv:1906.05274 (2019).
Haarnoja, Tuomas, et al. “Soft actor-critic algorithms and applications.” arXiv preprint arXiv:1812.05905 (2018).
Fujimoto, Scott, Herke Van Hoof, and David Meger. “Addressing function approximation error in actor-critic methods.” arXiv preprint arXiv:1802.09477 (2018).
Silver, David, et al. “Deterministic policy gradient algorithms.” 2014.
Dai, Zihang, et al. “Transformer-xl: Attentive language models beyond a fixed-length context.” arXiv preprint arXiv:1901.02860 (2019).
Zhou, Allan, Tom Knowles, and Chelsea Finn. “Meta-learning symmetries by reparameterization.” arXiv preprint arXiv:2007.02933 (2020).
BWilson, James T., et al. “Efficiently sampling functions from Gaussian process posteriors.” arXiv preprint arXiv:2002.09309 (2020).
Baevski, Alexei, Steffen Schneider, and Michael Auli. “vq-wav2vec: Self-supervised learning of discrete speech representations.” arXiv preprint arXiv:1910.05...
Schneider, Steffen, et al. “wav2vec: Unsupervised pre-training for speech recognition.” arXiv preprint arXiv:1904.05862 (2019).
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Karapandza, Rasa. “Stock returns and future tense language in 10-K reports.” Journal of Banking & Finance 71 (2016): 50-61.
Heaton, J. B., N. G. Polson, and Jan Hendrik Witte. “Deep learning for finance: deep portfolios.” Applied Stochastic Models in Business and Industry 33.1 (20...
Gary Antonacci. McGraw-Hill Education. 2014
Gray, Wesley R., Vogel, Jack R. Wiley. 2016