[Paper Review] Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Lee, Lisa, et al. “Efficient exploration via state marginal matching.” arXiv preprint arXiv:1906.05274 (2019).
Haarnoja, Tuomas, et al. “Soft actor-critic algorithms and applications.” arXiv preprint arXiv:1812.05905 (2018).
Fujimoto, Scott, Herke Van Hoof, and David Meger. “Addressing function approximation error in actor-critic methods.” arXiv preprint arXiv:1802.09477 (2018).
Silver, David, et al. “Deterministic policy gradient algorithms.” 2014.
Dai, Zihang, et al. “Transformer-xl: Attentive language models beyond a fixed-length context.” arXiv preprint arXiv:1901.02860 (2019).
Zhou, Allan, Tom Knowles, and Chelsea Finn. “Meta-learning symmetries by reparameterization.” arXiv preprint arXiv:2007.02933 (2020).
BWilson, James T., et al. “Efficiently sampling functions from Gaussian process posteriors.” arXiv preprint arXiv:2002.09309 (2020).
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Heaton, J. B., N. G. Polson, and Jan Hendrik Witte. “Deep learning for finance: deep portfolios.” Applied Stochastic Models in Business and Industry 33.1 (20...
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Gary Antonacci. McGraw-Hill Education. 2014
Gray, Wesley R., Vogel, Jack R. Wiley. 2016
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Karapandza, Rasa. “Stock returns and future tense language in 10-K reports.” Journal of Banking & Finance 71 (2016): 50-61.
Heaton, J. B., N. G. Polson, and Jan Hendrik Witte. “Deep learning for finance: deep portfolios.” Applied Stochastic Models in Business and Industry 33.1 (20...
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Gary Antonacci. McGraw-Hill Education. 2014
Gray, Wesley R., Vogel, Jack R. Wiley. 2016
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Karapandza, Rasa. “Stock returns and future tense language in 10-K reports.” Journal of Banking & Finance 71 (2016): 50-61.
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Lee, Lisa, et al. “Efficient exploration via state marginal matching.” arXiv preprint arXiv:1906.05274 (2019).
Haarnoja, Tuomas, et al. “Soft actor-critic algorithms and applications.” arXiv preprint arXiv:1812.05905 (2018).
Fujimoto, Scott, Herke Van Hoof, and David Meger. “Addressing function approximation error in actor-critic methods.” arXiv preprint arXiv:1802.09477 (2018).
Silver, David, et al. “Deterministic policy gradient algorithms.” 2014.
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gary Antonacci. McGraw-Hill Education. 2014
Gray, Wesley R., Vogel, Jack R. Wiley. 2016
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Karapandza, Rasa. “Stock returns and future tense language in 10-K reports.” Journal of Banking & Finance 71 (2016): 50-61.
Baevski, Alexei, Steffen Schneider, and Michael Auli. “vq-wav2vec: Self-supervised learning of discrete speech representations.” arXiv preprint arXiv:1910.05...
Schneider, Steffen, et al. “wav2vec: Unsupervised pre-training for speech recognition.” arXiv preprint arXiv:1904.05862 (2019).
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Levin, Asriel E. “Stock selection via nonlinear multi-factor models.” Advances in Neural Information Processing Systems. 1996.
Lee, Lisa, et al. “Efficient exploration via state marginal matching.” arXiv preprint arXiv:1906.05274 (2019).
Haarnoja, Tuomas, et al. “Soft actor-critic algorithms and applications.” arXiv preprint arXiv:1812.05905 (2018).
Fujimoto, Scott, Herke Van Hoof, and David Meger. “Addressing function approximation error in actor-critic methods.” arXiv preprint arXiv:1802.09477 (2018).
Silver, David, et al. “Deterministic policy gradient algorithms.” 2014.
Wei, Haoran, et al. “Model-based Reinforcement Learning for Predictions and Control for Limit Order Books.” arXiv preprint arXiv:1910.03743 (2019).
Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. “Autoencoder asset pricing models.” Available at SSRN (2019).
Heaton, J. B., N. G. Polson, and Jan Hendrik Witte. “Deep learning for finance: deep portfolios.” Applied Stochastic Models in Business and Industry 33.1 (20...
Baevski, Alexei, Steffen Schneider, and Michael Auli. “vq-wav2vec: Self-supervised learning of discrete speech representations.” arXiv preprint arXiv:1910.05...
Schneider, Steffen, et al. “wav2vec: Unsupervised pre-training for speech recognition.” arXiv preprint arXiv:1904.05862 (2019).
Heaton, J. B., N. G. Polson, and Jan Hendrik Witte. “Deep learning for finance: deep portfolios.” Applied Stochastic Models in Business and Industry 33.1 (20...
Karapandza, Rasa. “Stock returns and future tense language in 10-K reports.” Journal of Banking & Finance 71 (2016): 50-61.
BWilson, James T., et al. “Efficiently sampling functions from Gaussian process posteriors.” arXiv preprint arXiv:2002.09309 (2020).
BWilson, James T., et al. “Efficiently sampling functions from Gaussian process posteriors.” arXiv preprint arXiv:2002.09309 (2020).
Zhou, Allan, Tom Knowles, and Chelsea Finn. “Meta-learning symmetries by reparameterization.” arXiv preprint arXiv:2007.02933 (2020).
Zhou, Allan, Tom Knowles, and Chelsea Finn. “Meta-learning symmetries by reparameterization.” arXiv preprint arXiv:2007.02933 (2020).
Dai, Zihang, et al. “Transformer-xl: Attentive language models beyond a fixed-length context.” arXiv preprint arXiv:1901.02860 (2019).
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).
Levine, Sergey. “Reinforcement learning and control as probabilistic inference: Tutorial and review.” arXiv preprint arXiv:1805.00909 (2018).