Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Analysis class needs to be beefed up with something actually useful #105

Closed
ardunn opened this issue Oct 24, 2018 · 2 comments
Closed

Analysis class needs to be beefed up with something actually useful #105

ardunn opened this issue Oct 24, 2018 · 2 comments

Comments

@ardunn
Copy link
Contributor

ardunn commented Oct 24, 2018

Given a PredictionPipeline object (or just a tpot model and feature list until I get the top level classes working better), analysis should give back a nice html (or other format?) containing:

  • identification of outliers
  • identification of the most important features
  • partial dependence plots w/ skater
  • LIME plots w/ skater
  • details of the features dropped/features retained
  • breakdown of the featurization time/fitting time/etc.
  • t-SNE plot based on features and labelled by material formula/phase

@Doppe1g4nger @ADA110 any other ideas on cool things to include here?

@albalu
Copy link
Contributor

albalu commented Oct 24, 2018

For outliers, there is single class SVM and isolation forest which are basically what they sound like but with one-class classification. In isolation forest you run random forest and the points that are identified by too few splits can be the outliers. This could be used as a default. I wanted to implement it but don't think will have time :| It is already implemented in sklearn but someone should just integrate it in the workflow

@Doppe1g4nger
Copy link
Contributor

Also we should add a tab of data giving a description of how the model that was selected works, at least in a shallow not-directed-towards-experts manner.

@ardunn ardunn added the ugrads label Dec 11, 2018
@ardunn ardunn changed the title analysis ideas - wip Analysis class needs to be beefed up with something actually useful Jan 26, 2019
@ardunn ardunn closed this as completed in cbe9437 Feb 8, 2019
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants