TR: fix definition of ν #124

dpo · 2023-10-07T22:44:18Z

The definition in the paper is

$$ \nu_k = 1 / (L(x_k) + \alpha^{-1} \Delta_k^{-1}), $$

and $L(x_k) = \Vert B_k \Vert$.
There is no $\theta$ involved.

codecov · 2023-10-07T22:49:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (20fb633) 61.40% compared to head (3e191d1) 63.55%.
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #124      +/-   ##
==========================================
+ Coverage   61.40%   63.55%   +2.14%     
==========================================
  Files          11       11              
  Lines        1293     1295       +2     
==========================================
+ Hits          794      823      +29     
+ Misses        499      472      -27

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2023-10-07T23:15:50Z

Here are the
demos-results

geoffroyleconte · 2023-10-09T16:10:09Z

Isn't it

$$ \nu_k \le 1 / (L(x_k) + \alpha^{-1} \Delta_k^{-1}) $$

in the paper?
I've tried to launch the benchmark tables and I experienced instabilities with unconstrained FH on every random seed I've tried.

dpo · 2023-10-09T16:37:43Z

In principle, there is no reason to take a smaller step size than necessary.

What kind of instabilities?

geoffroyleconte · 2023-10-09T16:47:57Z

Issues with DifferentialEquations.jl

rjbaraldi · 2023-10-10T07:06:16Z

This is probably because FH doesn't actually satisfy our problem assumptions. I recall now that my experience is also that DifferentialEquations.jl fails occasionally.

github-actions · 2023-11-01T14:13:28Z

Here are the
demos-results

dpo · 2023-11-03T15:21:02Z

@geoffroyleconte @rjbaraldi What do you think of these numerical results?

src/TR_alg.jl

Co-authored-by: geoffroyleconte <[email protected]>

github-actions · 2023-11-06T23:03:13Z

Here are the
demos-results

dpo · 2023-11-08T16:55:28Z

@geoffroyleconte What do you think of these demo results?

geoffroyleconte · 2023-11-08T18:37:57Z

I am wondering whether we should use
$$\nu_k = \frac{\alpha \Delta_k}{1 + \Vert B_k \Vert (1 + \alpha \Delta_k)}$$
or
$$\nu_k = \frac{1}{\alpha^{-1} \Delta_k^{-1} +\Vert B_k \Vert (1 + \alpha^{-1} \Delta_k^{-1})}$$
?
I observe different results with these 2 expressions. I would choose the 2nd because we set $\alpha$ to 1 / eps(), but I'm not 100% sure.
I both cases, this changes the benchmarks of the TRDH paper (I've created a branch on my fork with the current state of this repo to keep the results). Maybe we should also experiment more with the value of $\alpha$ in another PR (the old implementation used the parameter $\theta$ set to $10^{-3}$).

dpo · 2023-11-08T19:15:49Z

I observe different results with these 2 expressions. I would choose the 2nd because we set $\alpha$
to 1 / eps(), but I'm not 100% sure.

I agree that if $\alpha$ is that small, the second expression would be better.

The old and new formulae coincide if $\alpha = \theta^{-1} \Delta^{-1}$. If we assume that $\Delta$ remains $\Theta(1)$, that means $\alpha \approx \theta^{-1}$, i.e., $\alpha \approx 10^3$ in this case.

geoffroyleconte · 2023-11-09T16:13:34Z

I observe different results with these 2 expressions. I would choose the 2nd because we set α
to 1 / eps(), but I'm not 100% sure.

I agree that if α is that small, the second expression would be better.

You mean $\alpha$ large?

The old and new formulae coincide if α=θ−1Δ−1. If we assume that Δ remains Θ(1), that means α≈θ−1, i.e., α≈103 in this case.

Yes, the old expression is

$$\nu_k = \frac{1}{\alpha^{-1} \Delta_k^{-1} + \Vert B_k \Vert(1 + \theta)}$$

But the first term of the denominator will change as well if we decrease $\alpha$ (maybe it is not really probablematic).

Should I commit the changes I suggested? And may be tune $\alpha$ differently in the benchmarks in another PR?

dpo · 2023-11-12T22:44:37Z

Yes, let's merge this if you think the results look reasonable. I would really like to introduce proper benchmarks in this repo so we can have a clear view of the performance without skimming through the demos.

src/TR_alg.jl

Co-authored-by: geoffroyleconte <[email protected]>

github-actions · 2023-11-13T22:37:46Z

Here are the
demos-results

geoffroyleconte · 2023-11-14T16:23:04Z

I have instabilities with FH, we would need to change the value of $\alpha$ and / or increase $\epsilon$ (but I can do it in another PR if you want).

TR: fix definition of ν

5c63896

dpo requested review from rjbaraldi and geoffroyleconte October 7, 2023 22:44

steplength to safeguard against unbounded Hessian

f741e8c

geoffroyleconte requested changes Nov 3, 2023

View reviewed changes

src/TR_alg.jl Outdated Show resolved Hide resolved

dpo requested a review from geoffroyleconte November 6, 2023 13:19

Update src/TR_alg.jl

332e62b

Co-authored-by: geoffroyleconte <[email protected]>

geoffroyleconte requested changes Nov 13, 2023

View reviewed changes

src/TR_alg.jl Outdated Show resolved Hide resolved

src/TR_alg.jl Outdated Show resolved Hide resolved

src/TR_alg.jl Outdated Show resolved Hide resolved

src/TR_alg.jl Outdated Show resolved Hide resolved

dpo and others added 4 commits November 13, 2023 17:19

Update src/TR_alg.jl

b7675a8

Co-authored-by: geoffroyleconte <[email protected]>

Update src/TR_alg.jl

11551e9

Co-authored-by: geoffroyleconte <[email protected]>

Update src/TR_alg.jl

fe0c1f4

Co-authored-by: geoffroyleconte <[email protected]>

Update src/TR_alg.jl

3e191d1

Co-authored-by: geoffroyleconte <[email protected]>

dpo merged commit 7e53ad2 into master Feb 20, 2024

dpo deleted the tr-nu branch February 20, 2024 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TR: fix definition of ν #124

TR: fix definition of ν #124

dpo commented Oct 7, 2023

codecov bot commented Oct 7, 2023 •

edited

Loading

github-actions bot commented Oct 7, 2023

geoffroyleconte commented Oct 9, 2023 •

edited

Loading

dpo commented Oct 9, 2023

geoffroyleconte commented Oct 9, 2023

rjbaraldi commented Oct 10, 2023

github-actions bot commented Nov 1, 2023

dpo commented Nov 3, 2023

github-actions bot commented Nov 6, 2023

dpo commented Nov 8, 2023

geoffroyleconte commented Nov 8, 2023

dpo commented Nov 8, 2023

geoffroyleconte commented Nov 9, 2023

dpo commented Nov 12, 2023

github-actions bot commented Nov 13, 2023

geoffroyleconte commented Nov 14, 2023

TR: fix definition of ν #124

TR: fix definition of ν #124

Conversation

dpo commented Oct 7, 2023

codecov bot commented Oct 7, 2023 • edited Loading

Codecov Report

github-actions bot commented Oct 7, 2023

geoffroyleconte commented Oct 9, 2023 • edited Loading

dpo commented Oct 9, 2023

geoffroyleconte commented Oct 9, 2023

rjbaraldi commented Oct 10, 2023

github-actions bot commented Nov 1, 2023

dpo commented Nov 3, 2023

github-actions bot commented Nov 6, 2023

dpo commented Nov 8, 2023

geoffroyleconte commented Nov 8, 2023

dpo commented Nov 8, 2023

geoffroyleconte commented Nov 9, 2023

dpo commented Nov 12, 2023

github-actions bot commented Nov 13, 2023

geoffroyleconte commented Nov 14, 2023

codecov bot commented Oct 7, 2023 •

edited

Loading

geoffroyleconte commented Oct 9, 2023 •

edited

Loading