sampler stats #95

mjhajharia · 2021-04-02T18:17:23Z

This notebook uses InferenceData and ArviZ for plotting. Explains multi-step sampler_stats as well.
Changed sampler_stats description according to - https://arviz-devs.github.io/arviz/schema/schema.html#sample-stats

Addresses issue #46

review-notebook-app · 2021-04-02T18:17:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

mjhajharia · 2021-04-02T18:18:37Z

deleted PR Samplerstats(WIP) #94 and created this, will take care of comments made there and make a commit, sorry about the inconvenience, i tried resetting to an initial commit and that messed up the previous PR so started it again

mjhajharia · 2021-04-02T19:16:45Z

I tried trace.sample_stats["tree_depth"].plot(row="chain", ls="none", marker=".", alpha=.3)
but I think because of the number of points or something, it still resembles a line plot, visually looks way better though.

@OriolAbril also can you mark the useless plot again, since that pr has 0 commits i cannot see the location of those comments

I'm not very sure if this plot is better than the previous one:

previous being:

finally im so sorry i made these comments on the issue you referenced the PR in, instead of here, deleted those and put them here

OriolAbril · 2021-04-02T20:02:25Z

I tried trace.sample_stats["tree_depth"].plot(row="chain", ls="none", marker=".", alpha=.3)
but I think because of the number of points or something, it still resembles a line plot, visually looks way better though.

Let's go with col instead of hue then, but keep the alpha. tree_depth is an integer, and it has one value per draw, as there isn't much variation, it generates these visually looking flat lines, but you can see that on 1 there are some stray points. I think the scatterplot with col will look good enough.

The plot step_size bar is the one I find useless, we should just remove it and be done with it.

The comment about using plot_posterior was for the histograms for the acceptance rates, that you have already updated, not for this step size plot.

examples/diagnostics_and_criticism/sampler-stats.ipynb

OriolAbril · 2021-04-02T20:14:21Z

examples/diagnostics_and_criticism/sampler-stats.ipynb

@@ -14,29 +14,40 @@
  },


dims and coords seem to have been ignored, could be a bug in ArviZ. I'll look into that. Do change this for the plot anyway though, even if at least for now the labels are wrong. Using plot_posterior like we did above should work here too.

Reply via ReviewNB

okayy, could it be because we didn't refer to dims in step1 and step2

ArviZ ignores the dimensions for sample stats: https://github.com/arviz-devs/arviz/blob/main/arviz/data/io_pymc3.py#L323 it may be to avoid name collisions though.

do you think that could use some fixing, maybe an informative warning or error could be given during a name collision. Because in real time models and stuff diagnostics running well, following general practices of arviz/pymc3 seems to be actually very important

Yes, we could fix that in ArviZ and have it use coords and dims also in the sample stats, either with a try except or checking things more carefully. But this is very low priority, compound steps are not that common and we have to make sure to implement that in a way that it never backfires.

examples/diagnostics_and_criticism/sampler-stats.ipynb

mjhajharia · 2021-04-04T13:19:15Z

made some changes(experimental)

OriolAbril · 2021-04-04T22:04:14Z

@michaelosthege can you explain what are perf_counter_diff , process_time_diff and perf_counter_start? Or point us to an issue or PR about them?

mjhajharia · 2021-04-04T22:07:26Z

@michaelosthege can you explain what are perf_counter_diff , process_time_diff and perf_counter_start? Or point us to an issue or PR about them?

hi, maybe this comment wasn't notified or something because it was in the reviewnb thing,
#95 (comment) , but I found these descriptions in the NUTS sampler documentation

mjhajharia · 2021-04-04T22:08:51Z

i think it still doesn't open in the mobile app, so im copying the comment here:

@OriolAbril no problem at all! also I just checked and the items ive added are the exact same, I went through the code for sample_stats_to_array here , that clarified it for me. thanks anyway this looks like a helpful comment.

this is from the current commit on the notebook, in case you wanted to see(only tune needs to go i think)

mjhajharia · 2021-04-04T22:11:46Z

here is the relevant documentation

OriolAbril · 2021-04-04T22:12:04Z

Yeah, I am seeing it now, sorry about that. Thanks for looking into this.

OriolAbril · 2021-04-04T22:13:23Z

You are right that now all the relevant variables are present, only tune has to be removed.

Regaring the accept variable from metropolis, I am not sure it "deserves" so much analysis, but I still don't understand what it is. The analysis is great though

mjhajharia · 2021-04-04T22:20:42Z

You are right that now all the relevant variables are present, only tune has to be removed.

Regaring the accept variable from metropolis, I am not sure it "deserves" so much analysis, but I still don't understand what it is. The analysis is great though

thanks!! and yeah that is somewhat weird, all i could find in metropolis.py was:
accept = self.delta_logp(q, q0)

Also tune is a stat in metropolis.py but I don't see it in sample_stats, so I'm a little confused about whether it should be needed or not. I think it got removed because of the compound step thing, if it's there in a single model sampling for metropolis dist. then we might need it?

mjhajharia · 2021-04-04T22:25:39Z

I looked at the original notebook once again, and possibly the reason why they chose to show accept, was just that it was common in both the samplers, which might not be a good enough reason for an elaborate plot which also has outliers. I might be entirely wrong here, I'm just speculating.

OriolAbril · 2021-04-04T22:29:25Z

and yeah that is somewhat weird, all i could find in metropolis.py was:
accept = self.delta_logp(q, q0)

I took a look at that, now it make sense. In metropolis one generates a proposal q and then compares it's probability to the current sample q0 with alpha = P(q) / P(q0). If alpha > 1, that is, q has higher probability than q0 the proposal is always accepted, otherwise alpha is in the range 0, 1 so sample from the uniform distribution u is generated, if u>alpha the proposal is rejected, accepted otherwise. accept is alpha which is also sometimes called acceptance ratio

I looked at the original notebook once again, and possibly the reason why they chose to show accept, was just that it was common in both the samplers, which might not be a good enough reason for an elaborate plot which also has outliers. I might be entirely wrong here, I'm just speculating.

yes, this sounds right, our goal here should probably be to show that when using a compound step, the common variables in sample_stats will get an extra dimension, one per sampler.

mjhajharia · 2021-04-04T22:33:33Z

I took a look at that, now it make sense. In metropolis one generates a proposal q and then compares it's probability to the current sample q0 with alpha = P(q) / P(q0). If alpha > 1, that is, q has higher probability than q0 the proposal is always accepted, otherwise alpha is in the range 0, 1 so sample from the uniform distribution u is generated, if u>alpha the proposal is rejected, accepted otherwise. accept is alpha which is also sometimes called acceptance ratio

yeah, that makes sense, thanks for explaining, i just started reading up about it and noticed acceptance rate was important for metropolis hastings.

yes, this sounds right, our goal here should probably be to show that when using a compound step, the common variables in sample_stats will get an extra dimension, one per sampler.

In that case, the last plot is unnecessary, but possibly showing the range of accept on both the samplers, would make some sense?

OriolAbril · 2021-04-04T22:38:59Z

acceptance rate was important for metropolis hastings.

In metropolis it's acceptance ratio which is very different from the acceptance rate in hmc/nuts

In that case, the last plot is unnecessary, but possibly showing the range of accept on both the samplers, would make some sense?

Yes, I think that showing how to work with sample stats that have multiple steps is useful and won't be explained anywhere else probably

mjhajharia · 2021-04-04T22:42:16Z

In metropolis it's acceptance ratio which is very different from the acceptance rate in hmc/nuts

yes, my bad, ive been looking at nuts doc. over and over for the stats thing sort of fixated in my head.

Yes, I think that showing how to work with sample stats that have multiple steps is useful and won't be explained anywhere else probably

yes we can do that, also I think a detailed explanation could be added here as well - compound step notebook

OriolAbril · 2021-04-04T22:54:57Z

yes we can do that, also I think a detailed explanation could be added here as well - compound step notebook

good catch, I think I haven't gotten to create the issue for that notebook, I don't remember ever reading it until now.

mjhajharia · 2021-04-04T23:00:19Z

yes we can do that, also I think a detailed explanation could be added here as well - compound step notebook

good catch, I think I haven't gotten to create the issue for that notebook, I don't remember ever reading it until now.

thanks! I haven't come across an issue on that, if you want I can make one, or I can directly make a draft PR too, whichever seems better?

OriolAbril · 2021-04-05T00:11:18Z

I will create an issue now

mjhajharia · 2021-04-05T00:49:03Z

I will create an issue now

sure!

examples/diagnostics_and_criticism/sampler-stats.ipynb

OriolAbril · 2021-04-06T00:01:06Z

can you restart and run all on the notebook? This should get rid of this warning in the last cell due to having executed it twice:

The watermark extension is already loaded. To reload it, use:
%reload_ext watermark

mjhajharia · 2021-04-06T16:25:43Z

@OriolAbril done, it can be merged now

mjhajharia changed the title ~~initial commit~~ sampler stats(WIP) Apr 2, 2021

mjhajharia mentioned this pull request Apr 2, 2021

Add step dimension in sampler stats for compound steps pymc-devs/pymc#4602

Open

OriolAbril reviewed Apr 2, 2021

View reviewed changes

OriolAbril reviewed Apr 3, 2021

View reviewed changes

mjhajharia requested a review from OriolAbril April 3, 2021 17:04

mjhajharia commented Apr 3, 2021

View reviewed changes

examples/diagnostics_and_criticism/sampler-stats.ipynb Show resolved Hide resolved

OriolAbril reviewed Apr 4, 2021

View reviewed changes

examples/diagnostics_and_criticism/sampler-stats.ipynb Show resolved Hide resolved

examples/diagnostics_and_criticism/sampler-stats.ipynb Show resolved Hide resolved

examples/diagnostics_and_criticism/sampler-stats.ipynb Show resolved Hide resolved

mjhajharia commented Apr 4, 2021

View reviewed changes

examples/diagnostics_and_criticism/sampler-stats.ipynb Show resolved Hide resolved

OriolAbril mentioned this pull request Apr 4, 2021

Explain sample_stats naming convention arviz-devs/arviz#1063

Merged

3 tasks

mjhajharia mentioned this pull request Apr 5, 2021

Modify compound steps notebook #141

Merged

mjhajharia added 5 commits April 5, 2021 10:53

initial commit

75a3d71

added arviz stuff, improved some plots

9f4584b

deleted step size plot, added accept plot

61040ec

plot fixes and inference data description

d2a06ba

new plot

6ced00b

OriolAbril reviewed Apr 5, 2021

View reviewed changes

mjhajharia added 2 commits April 6, 2021 00:40

dims changes

655e663

remove tune

2c1bfbc

mjhajharia requested a review from OriolAbril April 5, 2021 19:33

mjhajharia changed the title ~~sampler stats(WIP)~~ sampler stats Apr 5, 2021

mjhajharia marked this pull request as ready for review April 5, 2021 19:33

rerun cells

3c78786

OriolAbril merged commit fdc0e58 into pymc-devs:main Apr 6, 2021

mjhajharia deleted the stats branch April 6, 2021 18:49

Uh oh!

sampler stats #95

sampler stats #95

Uh oh!

Conversation

mjhajharia commented Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 2, 2021

Uh oh!

mjhajharia commented Apr 2, 2021

Uh oh!

mjhajharia commented Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril commented Apr 2, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OriolAbril Apr 2, 2021

Choose a reason for hiding this comment

Uh oh!

mjhajharia Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OriolAbril Apr 3, 2021

Choose a reason for hiding this comment

Uh oh!

mjhajharia Apr 3, 2021

Choose a reason for hiding this comment

Uh oh!

OriolAbril Apr 4, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

OriolAbril commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

OriolAbril commented Apr 4, 2021

Uh oh!

OriolAbril commented Apr 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mjhajharia commented Apr 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

OriolAbril commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 4, 2021

Uh oh!

OriolAbril commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril commented Apr 4, 2021

Uh oh!

mjhajharia commented Apr 2, 2021 •

edited

Loading

mjhajharia commented Apr 2, 2021 •

edited

Loading

mjhajharia Apr 2, 2021 •

edited

Loading

OriolAbril commented Apr 4, 2021 •

edited

Loading

mjhajharia commented Apr 4, 2021 •

edited

Loading

mjhajharia commented Apr 4, 2021 •

edited

Loading

mjhajharia commented Apr 5, 2021 •

edited

Loading