More pipeline diagnostics #239

janosh · 2019-10-08T20:21:08Z

Besides #238, I think the diagnostics into a fitted pipe could be further improved. In particular, it's too difficult to determine which model actually performed best.

ardunn · 2019-10-08T20:30:25Z

I agree, it could definitely be organized better.

If you're just interested in the underlying tpot model, you can get it with:

pipe.learner.best_pipeline

If you're interested in the best "entire" pipeline in terms of going from material object to prediction (including featurization, cleaning, reduction, learning), that is a bit more difficult, because the fitted matpipe is the best pipeline lol.

My thoughts are to either add another method which only returns the most important information. E.g., which featurizers were used, what are the cleaning rules generally, what is the best autoML pipeline, etc.

janosh · 2019-10-08T20:39:04Z

My thoughts are to either add another method which only returns the most important information. E.g., which featurizers were used, what are the cleaning rules generally, what is the best autoML pipeline, etc.

I think that would be nice!

It took me some time to discover that pipe.learner.best_pipeline and pipe.learner.best_models was what I was looking for. I noticed, however, that these aren't available on saved and loaded pipes.

ardunn · 2019-10-08T20:47:10Z

In the case of tpot pipelines saved and loaded, you are correct, because pickling tpot objects doesn't work last time I checked (may have been updated though). Current behavior is to select the best pipeline from the tpot object and save that single sklearn Pipeline as the backend (similar to a SinglePipelineAdaptor learner object). So the entire backend becomes the "best pipeline" and unfortunately, all the other, previously tried models are lost :/

Tl;dr: you can open up the best pipeline from a loaded (toot-backend) pipe using:

pipe.learner.backend

Only the best pipeline is saved. The best_models is not saved.

I've opened an issue addressing this #241

ardunn · 2019-10-12T04:39:42Z

related to #221

ardunn added the enhancement label Oct 12, 2019

ardunn mentioned this issue Oct 14, 2019

serialize backend and test improvements #246

Merged

ardunn closed this as completed in #246 Oct 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More pipeline diagnostics #239

More pipeline diagnostics #239

janosh commented Oct 8, 2019

ardunn commented Oct 8, 2019

janosh commented Oct 8, 2019

ardunn commented Oct 8, 2019

ardunn commented Oct 12, 2019

More pipeline diagnostics #239

More pipeline diagnostics #239

Comments

janosh commented Oct 8, 2019

ardunn commented Oct 8, 2019

janosh commented Oct 8, 2019

ardunn commented Oct 8, 2019

ardunn commented Oct 12, 2019