[MRG] make transformer_from_metric more robust #191

wdevazelhes · 2019-04-11T10:00:49Z

Fixes #175

This PR fixes the problem we had with transformer_from_metric, by checking if the matrix is diagonal depending on the relative gap between the maximum absolute value of the outer diagonal coeffs, and the minimum absolute value of the diagonal coefficients, rather than what was done before.

It is also more robust for detecting whether to use Cholesky or the eigendecomposition: instead of computing the determinant to check if the matrix is not definite, it tries to do Cholesky and if an error is returned it does the eigendecomposition. Indeed, maybe the determinant can be not null if there is a very small eigenvalue (v=1e-5) (that should be considered as null), but a lot of big eigenvalues (see one example in the test)

It also raises an error message when the input is not symmetric and a warning when switching from the Cholesky decomposition to the eigendecompostion if the Cholesky decomposition is not doable.

TODO:

to make a good test, check the value of v for which cholesky will fail, and from it build the example where the determinant is not "np.close" to zero

wdevazelhes · 2019-04-11T12:22:50Z

Now that I think about it, maybe the warning is not a good thing to throw ? Maybe it would be better a verbose flag to decide whether to print the warning (which would not be anymore a warning but a regular print) ? What do you think ? (And the verbose attribute of the estimator that uses this would be given as an argument to this function)

bellet

As discussed I think it would be better to only allow symmetric PSD matrices as input and throw an error if it is not (but allowing for some tolerance for numerical errors where some eigenvalues are very close to zero but negative). Also, for the diagonal case, we can go for the safe solution of simply checking if the off diagonal terms are exactly zero (which will cover the most concrete case where the matrix is learned to be diagonal in the first place as in MMC with diagonal=True) and treat the matrix as non-diagonal otherwise.

wdevazelhes · 2019-04-16T15:07:06Z

I agree, I just pushed a more recent version that takes into account these remarks

bellet

Some nitpicks.

Also the docstring seems overly detailed. I think "Computes the transformation matrix from the Mahalanobis matrix, i.e. the matrix L such that metric=L.T.dot(L)." is enough, the rest could be put as comments inside the function to explain what is being done?

bellet · 2019-04-16T15:21:09Z

metric_learn/_util.py

+
+  tol : float, optional
+    Negative eigenvalues above - tol are considered zero. If
+    tol is None, and w are `metric`'s eigenvalues, and eps is the


"and w are metric's eigenvalues" is not needed

That's right, done

bellet · 2019-04-16T15:22:16Z

metric_learn/_util.py

+  """
+  if tol is None:
+    tol = w.max() * len(w) * np.finfo(w.dtype).eps
+  if any(w[w < 0] < - tol):


w<0 not needed

although it actually prevents that anything is done in case the provided tol is negative (which is not supposed to be the case according to the docstring but is not checked)

That's right, I'll let it be w < - tol and add a quick test to check that tol is positive

bellet · 2019-04-16T15:25:51Z

metric_learn/_util.py

+
+  tol : positive float, optional
+    Eigenvalues of `metric` between 0 and - tol are considered zero. If tol is
+    None, and w are `metric`'s eigenvalues, and eps is the epsilon value for


same as above

How to explain how the tolerance is set here then without talking about the eigenvalues ? Though maybe an explanation talking about the highest eigenvalue could be better ? Like:

Eigenvalues of `metric` between 0 and - tol are considered zero. If tol is None, and w_max is `metric`'s largest eigenvalue, and eps is the epsilon value for datatype of w, then tol is set to w_max * metric.shape[0] * eps.

nevermind, indeed here we need to mention eigenvalues
the suggested reformulation looks good

bellet · 2019-04-16T15:28:21Z

test/test_transformer_metric_conversion.py

+    P = ortho_group.rvs(7, random_state=rng)
+    M = P.dot(D).dot(P.T)
+    with pytest.raises(ValueError) as raised_error:
+      transformer_from_metric(M)


here you could also easily test that the error is raised in the diagonal case, for instance when calling transformer_from_metric(D)

I agree, will do

wdevazelhes · 2019-04-17T08:06:32Z

Thanks for the review @bellet, I addressed all the comments, there's just that needs to be resolved

bellet · 2019-04-17T09:50:28Z

LGTM!

William de Vazelhes added 2 commits April 11, 2019 11:51

ENH: make transformer_from_metric more robust

0f7daf2

FIX: enhance test on an undefinite matrix with high computed determinant

c9eec1f

wdevazelhes changed the title ~~[WIP] make transformer_from_metric more robust~~ [MRG] make transformer_from_metric more robust Apr 11, 2019

FIX: only look at the value of slogdet, not the sign

5700778

bellet requested changes Apr 16, 2019

View reviewed changes

MAINT: improve transformer_from_metric

7163c33

bellet approved these changes Apr 16, 2019

View reviewed changes

Address scikit-learn-contrib#191 (review)

f0088b2

bellet merged commit 99b0322 into scikit-learn-contrib:master Apr 17, 2019

[MRG] make transformer_from_metric more robust #191

[MRG] make transformer_from_metric more robust #191

Uh oh!

Conversation

wdevazelhes commented Apr 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wdevazelhes commented Apr 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bellet left a comment

Choose a reason for hiding this comment

Uh oh!

wdevazelhes commented Apr 16, 2019

Uh oh!

bellet left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wdevazelhes commented Apr 17, 2019

Uh oh!

bellet commented Apr 17, 2019

Uh oh!

Uh oh!

wdevazelhes commented Apr 11, 2019 •

edited

Loading

wdevazelhes commented Apr 11, 2019 •

edited

Loading