feat(scalarization): Add DWA by ppraneth · Pull Request #731 · SimplexLab/TorchJD

ppraneth · 2026-06-11T04:18:38Z

Adds DWA, the Dynamic Weight Average scalarizer from End-to-End Multi-Task Learning with Attention (CVPR 2019). It weights each task by how fast its loss has been going down compared to the others.

Working

At epoch $t$, the values are combined as:

$$\sum_k \lambda_k(t), L_k(t), \qquad \lambda_k(t) = \frac{K \exp(w_k(t-1) / T)}{\sum_i \exp(w_i(t-1) / T)}, \qquad w_k(t-1) = \frac{L_k(t-1)}{L_k(t-2)}$$

$w_k$ is the ratio of a task's average losses over the two previous epochs (a task whose loss dropped less gets more weight).
$T$ is the temperature (larger $T$ → more uniform weights). The paper uses 2.0.
$K$ is the number of values, so the weights sum to $K$.

The weights only need past loss values, no gradients (the paper notes this is why it's simpler than GradNorm), so it fits the Scalarizer interface.

Usage

The weights at epoch $t$ depend on the average losses of epochs $t-1$ and $t-2$. The scalarizer can't tell on its own when an epoch ends, so the user calls it every batch and calls a step() method once at the end of each epoch:

scalarizer = DWA()
for epoch in range(n_epochs):
    for batch in loader:
        losses = ...                 # one loss per task
        loss = scalarizer(losses)    # weighted sum, also records the batch's losses
        loss.backward()
        optimizer.step()
    scalarizer.step()                # roll the epoch history, once per epoch

forward records each batch's losses, and step() finalizes the just-finished epoch's average loss and rolls the history forward (dropping the one from two epochs ago). This matches how the paper and LibMTL use it (per-epoch). During the first two epochs (before there are two averages) the weights are uniform.

Design notes

DWA(temperature=2.0), temperature must be > 0.
its state is a non-trainable buffer. reset() clears it.
No shape argument is needed (unlike UW/IMTLL) the buffer is created lazily from the inputs.
The weights are detached, so gradients flow only through the current batch's losses.
It weights each value by the ratio of its losses over consecutive epochs, which the paper defines as a descending rate in the range (0, +∞). So the losses are expected to keep a consistent, nonzero sign across epochs, they need not be positive, and positivity is not enforced.

Tests

tests/unit/scalarization/test_dwa.py covers: the uniform/bootstrap behavior for the first two epochs, the exact weight formula, that the per-epoch average is used (not just the last batch), that step() drops the oldest epoch, that the weights sum to the number of values, scalar output and gradient flow over all input shapes (including the computed-weights path after two epochs), support for consistently-signed negative losses (the ratio of same-sign losses stays positive), reset(), step() being a no-op with no data, that there are no learnable parameters, shape-change errors (within and between epochs), temperature validation, and the representations.

Signed-off-by: ppraneth <pranethparuchuri@gmail.com>

ValerianRey

Very good! Just a few nitpicks and some minor problems to the documentation.

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

Signed-off-by: ppraneth <pranethparuchuri@gmail.com>

ppraneth · 2026-06-11T10:34:28Z

@ValerianRey I have update the doc strings

ValerianRey · 2026-06-11T12:22:10Z

Tyvm! LGTM. @PierreQuinton are you ok with merging this?

ppraneth added 2 commits June 11, 2026 09:28

add DWA

adf8304

Signed-off-by: ppraneth <pranethparuchuri@gmail.com>

add notes

e568dbd

Signed-off-by: ppraneth <pranethparuchuri@gmail.com>

ppraneth requested review from a team, PierreQuinton and ValerianRey as code owners June 11, 2026 04:18

Merge branch 'main' into scalarization-6

f317edd

ppraneth added cc: feat Conventional commit type for new features. package: scalarization labels Jun 11, 2026

github-actions Bot changed the title ~~Scalarization 6~~ feat(scalarization): Scalarization 6 Jun 11, 2026

ppraneth changed the title ~~feat(scalarization): Scalarization 6~~ feat(scalarization): Add DWA Jun 11, 2026

Merge branch 'main' into scalarization-6

b791e6d

ValerianRey mentioned this pull request Jun 11, 2026

Scalarizer Tracker #667

Open

ValerianRey requested changes Jun 11, 2026

View reviewed changes

Comment thread tests/unit/scalarization/test_dwa.py Outdated

Comment thread tests/unit/scalarization/test_dwa.py Outdated

Comment thread src/torchjd/scalarization/_dwa.py Outdated

Comment thread src/torchjd/scalarization/_dwa.py Outdated

Comment thread src/torchjd/scalarization/_dwa.py

ppraneth and others added 4 commits June 11, 2026 15:46

Update src/torchjd/scalarization/_dwa.py

2e7e69f

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

Update tests/unit/scalarization/test_dwa.py

c0527dd

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

Update tests/unit/scalarization/test_dwa.py

b9ae04b

Co-authored-by: Valérian Rey <31951177+ValerianRey@users.noreply.github.com>

fix doc

ad4df9f

Signed-off-by: ppraneth <pranethparuchuri@gmail.com>

ppraneth requested a review from ValerianRey June 11, 2026 10:31

Minor improvement of the docs

1fd7038

ValerianRey approved these changes Jun 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(scalarization): Add DWA#731

feat(scalarization): Add DWA#731
ppraneth wants to merge 9 commits into
SimplexLab:mainfrom
ppraneth:scalarization-6

ppraneth commented Jun 11, 2026 •

edited

Loading

Uh oh!

ValerianRey left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ppraneth commented Jun 11, 2026

Uh oh!

ValerianRey commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ppraneth commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Working

Usage

Design notes

Tests

Uh oh!

ValerianRey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ppraneth commented Jun 11, 2026

Uh oh!

ValerianRey commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ppraneth commented Jun 11, 2026 •

edited

Loading