Changes for release 1.2.0 (v1 release with fixes/improvements backported from v2) #1258

opcode81 · 2025-05-15T11:09:41Z

See change log

… on for discrete case

…uto-handling with defaults)

… spaces and allow coefficient to be modified, adding an informative docstring (previous implementation was reasonable only for continuous action spaces) Adjust parametrisation to match procedural example in atari_sac_hl

… in NPG and TRPO * Parameter optim must not include the actor parameters (as they are updated via natural gradients that are computed internally) * Fix incorrect optimizer instantiation in high-level API

…to running the trainer, yet recently introduced parameter `reset_prior_to_run` of `run` suggested that it was optional. But it was not respected, because `__iter__` would always call `reset(reset_collectors=True, reset_buffer=False)` regardless. The parameter was removed; instead, the parameters of `run` now mirror the parameters of `reset`, and the implicit `reset` call in `__iter__` was removed. This aligns with upcoming changes in Tianshou v2.0.0.

responsible for creating the snapshot(s) on the original branch and then compare with results on a modified branch. Add writing of a log file for determinism tests.

…iner

Fix some broken tests that directly used the trainer's iterator instead of using run(): * test/continuous/test_ppo * test/continuous/test_td3

…QN, QRDQN, Rainbow)

* Require only core message equivalence (network parameter hashes) for the test to pass * Allow to ignore certain messages on a per-test level

codecov-commenter · 2025-05-19T11:26:27Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 49.15254% with 150 lines in your changes missing coverage. Please review.

Project coverage is 84.09%. Comparing base (14b97ea) to head (0db2e74).

Files with missing lines	Patch %	Lines
tianshou/utils/determinism.py	31.97%	117 Missing ⚠️
tianshou/trainer/base.py	63.63%	12 Missing ⚠️
tianshou/evaluation/rliable_evaluation_hl.py	0.00%	11 Missing ⚠️
tianshou/highlevel/params/alpha.py	16.66%	5 Missing ⚠️
tianshou/highlevel/agent.py	85.71%	2 Missing ⚠️
tianshou/highlevel/env.py	93.75%	2 Missing ⚠️
tianshou/highlevel/experiment.py	50.00%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1258      +/-   ##
==========================================
- Coverage   85.28%   84.09%   -1.19%     
==========================================
  Files         102      104       +2     
  Lines        9083     9327     +244     
==========================================
+ Hits         7746     7844      +98     
- Misses       1337     1483     +146

Flag	Coverage Δ
unittests	`84.09% <49.15%> (-1.19%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Slightly enhanced docstrings in collector

The mechanism introduced in v1.1.0 was completely revised: - The `train_seed` and `test_seed` attributes were removed from `SamplingConfig`. Instead, the seeds are derived from the seed defined in `ExperimentConfig`. - Seed attributes of `EnvFactory` classes were removed. Instead, seeds are passed to methods of `EnvFactory`.

Control validation enabling with global flag

Don't mutate incoming dict, don't load invalid fields

opcode81 and others added 30 commits March 4, 2025 19:46

Fix missing reset in discrete_dqn

bbc36d0

ActorFactoryDefault: Fix hidden sizes and activation not being passed…

b5665e3

… on for discrete case

ExperimentConfig: Do not inherit from anything (breaks jsonargparse a…

decb416

…uto-handling with defaults)

Use DummyVectorEnv instead of Subproc in test_a2c_with_il

eeb6610

Fix misleading docstring and corresponding errors pertaining to optim…

8dbf0bf

… in NPG and TRPO * Parameter optim must not include the actor parameters (as they are updated via natural gradients that are computed internally) * Fix incorrect optimizer instantiation in high-level API

Update changelog

1713331

Add basic implementation for determinism tests

4f17673

Log parameters of ActorCritic components separately

d78f0ed

Fix failure message

5061c22

Add TorchDeterministicModeContext

c88f844

Devcontainer

9fbfd99

Update sensai-utils to 1.4.0

cf0e0d8

Support new mode of operation determinism tests, where each developer is

3ed3c20

responsible for creating the snapshot(s) on the original branch and then compare with results on a modified branch. Add writing of a log file for determinism tests.

Add determinism test for DiscreteBCQ

364814d

Fix message assignment

7a8902a

Log TrainingStats with TraceLogger after every training step

60e8cea

Log sampled batch indices with TraceLogger when performing update

57ec496

Formatting

4c0699b

ReplayBuffer: Establish determinism by using a well-defined RandomState

c1f580e

Improve change log entry pertaining to the breaking change in the tra…

5f515a1

…iner

Add determinism tests for virtually all algorithms

c05294f

Fix some broken tests that directly used the trainer's iterator instead of using run(): * test/continuous/test_ppo * test/continuous/test_td3

Fix determinism test name

61c9fa3

Fix test name

2816d04

Add more trace log messages for context

c7d48a3

Configure training eps value for initial data collection (DQN, BDQ)

63c5e95

Fix test names

b735e0b

Configure training eps value for initial data collection (C51, FQF, I…

809279b

…QN, QRDQN, Rainbow)

TraceLogger: Add flag 'verbose'

790dbb3

MischaPanch and others added 14 commits May 14, 2025 22:16

v1: Removed unused and failing test

3fc484a

v1: minor type validation

5b46038

Merge branch 'dev-v1' of github.com:thu-ml/tianshou into dev-v1

547f0a5

Fix test name

f73b247

Use trainer run instead of direct iteration

b6fe90e

Improve trace log message

744561e

Improve change log

a89cb14

Fix mypy issues

cd57fa7

Relax determinism tests:

2b57654

* Require only core message equivalence (network parameter hashes) for the test to pass * Allow to ignore certain messages on a per-test level

test_drqn: Collect initial data in training mode

0c385f9

Formatting

a4e81ea

Fix assertion (stats can be None)

eaa7f96

Fix create_toc_py not accounting for spaces in paths

af0a959

Fix unquoted maths in docstring

619051c

MischaPanch and others added 6 commits May 19, 2025 13:41

v1: improvement in doc-build commands

cf22adf

Fix ruff complaint

3192dbf

Document determinism test usage

de78ecb

Mentioned determinism tests in PR template

802fb83

Allow collection of empty episodes (done on reset)

2cd40cb

Slightly enhanced docstrings in collector

opcode81 force-pushed the dev-v1 branch from 7d7bb67 to 981e649 Compare May 19, 2025 18:45

opcode81 and others added 6 commits May 19, 2025 20:55

Fix syntax issue

5f5bab9

Merge remote-tracking branch 'thuml/master' into dev-v1

ffdc9d4

AtariEnvFactory: Fix super call

a86e246

v1: adjust range for seed to be compatible with envpool

856e2b8

v1: disable buffer hasnull checks by default

d8daab2

Control validation enabling with global flag

v1: fixes in rliable eval data loading, better logging

0db2e74

Don't mutate incoming dict, don't load invalid fields

MischaPanch force-pushed the dev-v1 branch from 5c1e4b2 to 0db2e74 Compare June 21, 2025 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Changes for release 1.2.0 (v1 release with fixes/improvements backported from v2) #1258

Changes for release 1.2.0 (v1 release with fixes/improvements backported from v2) #1258

Uh oh!

opcode81 commented May 15, 2025

Uh oh!

codecov-commenter commented May 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Changes for release 1.2.0 (v1 release with fixes/improvements backported from v2) #1258

Are you sure you want to change the base?

Changes for release 1.2.0 (v1 release with fixes/improvements backported from v2) #1258

Uh oh!

Conversation

opcode81 commented May 15, 2025

Uh oh!

codecov-commenter commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

codecov-commenter commented May 19, 2025 •

edited

Loading