-
-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
[Feature Request] Better documentation of the estimated parameters for Gaussian processes #1725
Comments
I think requests to extend the doc are very reasonable. I am not sure though when I will have time for it myself at the moment. If you have concrete thoughts on what to add based on what you had to figure out yourself from other sources, I would be happy to review a PR adding some more details to the docs. |
that makes a lot of sense - i know maintaining & expanding software while keeping up an academic career is a huge task. i think something along the lines of even just explaining what the estimated parameters are and how to interpret them would be very helpful, and the rest could be left to the forums. for example, adding a short table to the
some very basic guidance on prior choice for relevant parameters would be useful too -- i could certainly gather some of this info from the forums and the literature and compile a summary. |
yeah, makes sense. This may be something I can quickly add before the next release. If you happen to have time for a PR beforehand (even just the parts that you know), feel free and I can then edit in the remaining parts. |
thanks so much ! where would be your preferred place for me to put the documentation ? the first thing that comes to mind is in the roxygen doc section of the |
yes that would be a good place!
Jack Leary ***@***.***> schrieb am Fr., 17. Jan. 2025, 20:13:
… thanks so much ! where would be your preferred place for me to put the
documentation ? the first thing that comes to mind is in the roxygen doc
section of the gp() function, but i can do whatever works best for you.
—
Reply to this email directly, view it on GitHub
<#1725 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADCW2AEIQXG6B5IX4ZW3W7T2LFB55AVCNFSM6AAAAABVICDHBOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKOJYHEZDSNZXGY>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
as shown above i added some details on each hyperparameter to the docs; let me know if anything looks amiss and i can correct it. |
just realized i made a slight mistake in the code sent in the PR, i used |
you can always push more commits to the PR so you can just fix it
Jack Leary ***@***.***> schrieb am Mo., 20. Jan. 2025, 02:11:
… just realized i made a slight mistake in the code sent in the PR, i used
$\rho$ to refer to the length-scale, which i should have denoted $\ell$,
in lines 82-83 of formula-gp.R. This is because I had just a few minutes
prior written a line about how the MC-Stan docs denote the length-scale as
$\rho$ instead.
—
Reply to this email directly, view it on GitHub
<#1725 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADCW2AEKKS5BJE3KX4FBXVD2LQ5KHAVCNFSM6AAAAABVICDHBOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMBRGA4DMNZRHE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
All done ! |
Hello,
First of all, just want to say I greatly appreciate all the work on
brms
; it is fantastic software and I cannot imagine how much effort has gone into maintaining and improving it. You're doing a great job !!My request is that the documentation of how Gaussian processes work in$k$ , and the number of dimensions in the multivariate GP, but I'm unsure what
brms
be fleshed out a bit. As it currently stands, I've visited pretty much every GP thread on the MC-Stan dev forums (and posted a few questions of my own), read the documentation of thegp()
function inbrms
, and even bought Bayesian Data Analysis 3rd Edition and read the chapter on GPs (Ch. 20 if I remember correctly). Despite all this, I'm still rather confused as to what the syntaxbrms
uses is. Some of the parameters make sense e.g.,lscale
is the estimated length-scale of the GP. Some are less clear, such as the parameters named followingzgp_gpxy[i]
(when fitting a multivariate GP with location variables namedx
andy
). I assume the numeric indicatori
is a function of the number of basis functions approximating the GPz
denotes or how to summarize the parameter information.For context (not expecting you to solve my current problem here, as I'm aware the MC-Stan forums are the more appropriate place to ask for help), I am working with spatial transcriptomics data and I'm trying to model gene expression (a Negative-binomially distributed count) as a function of a multivariate GP (approximated as a Hilbert space by$k = 20$ basis functions) based on location in Euclidean space, with individual GPs fit for each gene (an unordered factor). Thus, the model I'm fitting is as follows below, assuming the data are stored in a variable called
expr_df
:I hope this a reasonable request (and please let me know if it isn't !); the GP additions in the latest dev version of
brms
are awesome and I know I and others find them quite useful. I'm happy to contribute if possible r.e. sketching out other use cases, contributing code for examples, etc. if that would be helpful.Thanks !
Jack
University of Florida
The text was updated successfully, but these errors were encountered: