Semi-Markov Offline Reinforcement Learning for Healthcare Paper • 2203.09365 • Published Mar 17, 2022
Medical Dead-ends and Learning to Identify High-risk States and Treatments Paper • 2110.04186 • Published Oct 8, 2021
Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning Paper • 1906.00572 • Published Jun 3, 2019
Policy Networks with Two-Stage Training for Dialogue Systems Paper • 1606.03152 • Published Jun 10, 2016
Improving Observability of Stochastic Complex Networks under the Supervision of Cognitive Dynamic Systems Paper • 1412.6162 • Published Nov 7, 2014
Systematic Rectification of Language Models via Dead-end Analysis Paper • 2302.14003 • Published Feb 27, 2023