What to evaluate?
Usability (can kids use it?)
- 1993 Wizard of Oz experiments
- Lab and in-school user tests of successive versions
Assistiveness (do kids perform better with than without?)
- 1994 Reading Coach boosted comprehension by ~20%
- But: evaluation obtrusive, costly, sparse, subjective, noisy
Learning (do kids improve over time?)
- Within tutor: this talk
- On unassisted reading: pre-/post-test by school
- More than with alternatives: future studies