Fast Reinforcement Learning
Bottleneck in applying RL:
Need great amounts of data.
Dialog data is typically very expensive to get.
Intuitive solution:
Make more efficient use of training data.
Generalize learning over similar states/actions.
Previous slide
Next slide
Back to first slide
View graphic version