Convergence of bootstrapping results, `nan` values for small sample sizes #7

privong · 2020-06-22T19:42:28Z

If the sample size is small (or the number of bootstraps is large), the correlation coefficients can be undefined and return nan values. The use of np.percentile() then returns nan from pymccorrelation(). If there's many nan values this probably suggests the bootstrapping is not well-converged. When looking at the mock dataset to check recovery (#4), the convergence of bootstrapping would be good to consider.

Ultimately, decide if nanpercentile() should be used, optionally with a warning if the size of the dataset is too small for reliable bootstrap error estimation.

There is probably statistics literature about this too...

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convergence of bootstrapping results, `nan` values for small sample sizes #7

Convergence of bootstrapping results, `nan` values for small sample sizes #7

privong commented Jun 22, 2020

Convergence of bootstrapping results, nan values for small sample sizes #7

Convergence of bootstrapping results, nan values for small sample sizes #7

Comments

privong commented Jun 22, 2020

Convergence of bootstrapping results, `nan` values for small sample sizes #7

Convergence of bootstrapping results, `nan` values for small sample sizes #7