add references

OriolAbril · OriolAbril · commit a7da870dc403 · 2022-01-28T20:45:59.000+02:00
diff --git a/examples/BART/BART_introduction.ipynb b/examples/BART/BART_introduction.ipynb
@@ -445,7 +445,7 @@
    "source": [
     "From this plot we can see the main effect of each covariate on the predicted value. This is very useful we can recover complex relationship beyond monotonic increasing or decreasing effects. For example for the `hour` covariate we can see two peaks around 8 and and 17 hs and a minimum at midnight.\n",
     "\n",
-    "When interpreting partial dependence plots we should be careful about the assumptions in this plot. First we are assuming variables are independent. For example when computing the effect of `hour` we have to marginalize the effect of `temperature` and this means that to compute the partial dependence value at `hour=0` we are including all observed values of temperature, and this may include temperatures that are actually not observed at midnight, given that lower temperatures are more likely than higher ones. We are seeing only averages, so if for a covariate half the values are positively associated with predicted variable and the other half negatively associated. The partial dependence plot will be flat as their contributions will cancel each other out. This is a problem that can be solved by using instead individual conditional expectation plots `pm.bart.plot_dependence(idata_bikes, kind=\"ice\")`. Notice that all this assumptions are assumptions of the partial dependence plot, not of our model! In fact BART can easily accommodate interaction of variables Although the prior in BART regularizes high order interactions). For more on interpreting Machine Learning model you could check this [book](https://christophm.github.io/interpretable-ml-book/).\n",
+    "When interpreting partial dependence plots we should be careful about the assumptions in this plot. First we are assuming variables are independent. For example when computing the effect of `hour` we have to marginalize the effect of `temperature` and this means that to compute the partial dependence value at `hour=0` we are including all observed values of temperature, and this may include temperatures that are actually not observed at midnight, given that lower temperatures are more likely than higher ones. We are seeing only averages, so if for a covariate half the values are positively associated with predicted variable and the other half negatively associated. The partial dependence plot will be flat as their contributions will cancel each other out. This is a problem that can be solved by using instead individual conditional expectation plots `pm.bart.plot_dependence(idata_bikes, kind=\"ice\")`. Notice that all this assumptions are assumptions of the partial dependence plot, not of our model! In fact BART can easily accommodate interaction of variables Although the prior in BART regularizes high order interactions). For more on interpreting Machine Learning model you could check the \"Interpretable Machine Learning\" book {cite:p}`molnar2019`.\n",
     "\n",
     "Finally like with other regression method we should be careful that the effects we are seeing on individual variables are conditional on the inclusion of the other variables. So for example, while `humidity` seems to be mostly flat, meaning that this covariate has an small effect of the number of used bikes. This could be the case because `humidity` and `temperature` are correlated to some extend and once we include `temperature` in our model `humidity` does not provide too much information. Try for example fitting the model again but this time with `humidity` as the single covariate and then fitting the model again with `hour` as a single covariate. You should see that the result for this single-variate models will very similar to the previous figure for the `hour` covariate, but less similar for the `humidity` covariate."
    ]
@@ -502,6 +502,21 @@
     "* Authored by Osvaldo Martin in Dec, 2021 ([pymc-examples#259](https://github.com/pymc-devs/pymc-examples/pull/259))"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "id": "3c184bc8",
+   "metadata": {},
+   "source": [
+    "## References\n",
+    "\n",
+    ":::{bibliography}\n",
+    ":filter: docname in docnames\n",
+    "\n",
+    "martin2018bayesian\n",
+    "martin2021bayesian\n",
+    ":::"
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "2c557ed8",
diff --git a/examples/references.bib b/examples/references.bib
@@ -217,6 +217,13 @@ @book{martin2018bayesian
   publisher={Packt Publishing Ltd}
 }
 
+@book{martin2021bayesian,
+  title={Bayesian Modeling and Computation in Python},
+  author={Martin, Osvaldo A and Kumar, Ravin and Lao, Junpeng},
+  year={2021},
+  publisher={Chapman and Hall/CRC},
+  doi={10.1201/9781003019169}
+}
 
 @book{mcelreath2018statistical,
   title={Statistical rethinking: A Bayesian course with examples in R and Stan},
@@ -245,6 +252,14 @@ @misc{mnih2013playing
   primaryClass={cs.LG}
 }
 
+@book{molnar2019,
+  title = {Interpretable Machine Learning},
+  author = {Christoph Molnar},
+  year = {2019},
+  subtitle = {A Guide for Making Black Box Models Explainable},
+  url={https://christophm.github.io/interpretable-ml-book/}
+}
+
 @article{nowlan1992simplifying,
   title={Simplifying Neural Networks By Soft Weight-Sharing},
   author={Nowlan, Steven J and Hinton, Geoffrey E},