Evaluating a black-box algorithm

With Rina Foygel Barber (University of Chicago)

Evaluating a black-box algorithm: stability, risk, and model comparisons

When we run a complex algorithm on real data, it is standard to use a holdout set, or a cross-validation strategy, to evaluate its behavior and performance. When we do so, are we learning information about the algorithm itself, or only about the particular fitted model(s) that this particular data set produced? In this talk, we will establish fundamental hardness results on the problem of empirically evaluating properties of a black-box algorithm, such as its stability and its average risk, in the distribution-free setting.
This work is joint with Yuetian Luo and Byol Kim.

Add to your calendar or Include in your list