CategoricalLikelihood compatibility with LatentGP #404

ancorso · 2024-07-15T03:33:28Z

I would like to model a multi-class dataset using LatentGPs and the CategoricalLikelihood (from GPLikelihoods.jl). The CategoricalLikelihood requires multiple latent GPs, and expects their output to be a AbstractVector{<:AbstractVector{<:Real}}. Instead, the design choice for multi-output GPs is to concatenate the outputs into one long vector, which is not what is necessary for a LatentGP with a CategoricalLikelihood. Below is an example:

gpm = GP(IndependentMOKernel(Matern52Kernel()))
x = rand(100)
X = MOInput(x, 3)

rand(gpm(X)) # produces a 300 element vector 

cgp = LatentGP(gpm, CategoricalLikelihood(), 1e-3)
cgpx = cgp(X)

res = rand(cgpx) # produces 300 values for `f` and a single sample for `y` because the softmax is applied over the full vector

Let me know if this is the wrong way of handling categorical likelihoods or if there is a recommendation on how to get this working out of the box. Happy to work on a PR.

The text was updated successfully, but these errors were encountered:

theogf · 2024-07-15T08:11:43Z

Hey! Yes the interface is far from ideal right now, you can find a multi class example (using a different inference approach) here: https://juliagaussianprocesses.github.io/AugmentedGPLikelihoods.jl/dev/examples/categorical/

ancorso · 2024-07-15T14:49:50Z

Thanks for the quick reply! I did see that example, but I ran into some issues with the aug_elbo calculation (which is ultimately what I am after here).

If I'm correct, your representation for the mean and variance of the distribution over inducing points is an ArraysOfArrays object (that's why you broadcast the following line in the CAVI algorithm posts_u = u_posterior.(Ref(fz), ms, Ss). However the current call to aug_elbo (aug_elbo(lik, uposterior(fz, m, S), x, y) which isn't in a code block btw), doesn't include this broadcasting, nor does it internally handle the resulting vector of posteriors if you do broadcast. Would it be possible to clarify my understanding of how that example should work?

Thanks so much!

ancorso · 2024-07-15T15:48:46Z

It seems like perhaps the output of this line: https://github.com/JuliaGaussianProcesses/AugmentedGPLikelihoods.jl/blob/41336971ee8882a147e996cf4e791831422da393/examples/categorical/script.jl#L138
should get transformed into a AbstractVector{<:AbstractVector{<:Normal}}?

theogf · 2024-07-15T16:03:12Z

That's a good point! I will try to fix the script in the repo directly!

ancorso · 2024-07-16T04:52:19Z

That's a good point! I will try to fix the script in the repo directly!

That would be a big help, thank you!

theogf mentioned this issue Jul 20, 2024

Fix categorical script JuliaGaussianProcesses/AugmentedGPLikelihoods.jl#129

Merged

theogf closed this as completed in JuliaGaussianProcesses/AugmentedGPLikelihoods.jl#129 Jul 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CategoricalLikelihood compatibility with LatentGP #404

CategoricalLikelihood compatibility with LatentGP #404

ancorso commented Jul 15, 2024

theogf commented Jul 15, 2024

ancorso commented Jul 15, 2024

ancorso commented Jul 15, 2024 •

edited

Loading

theogf commented Jul 15, 2024

ancorso commented Jul 16, 2024

CategoricalLikelihood compatibility with LatentGP #404

CategoricalLikelihood compatibility with LatentGP #404

Comments

ancorso commented Jul 15, 2024

theogf commented Jul 15, 2024

ancorso commented Jul 15, 2024

ancorso commented Jul 15, 2024 • edited Loading

theogf commented Jul 15, 2024

ancorso commented Jul 16, 2024

ancorso commented Jul 15, 2024 •

edited

Loading