Improve dataset preparation support + multiresolution prep #39

a-r-r-o-w · 2024-10-16T15:07:34Z

Fixes #4

sayakpaul

Nice work. Love the thread-based serialization.

training/dataset.py

training/prepare_dataset.py

a-r-r-o-w · 2024-10-17T09:14:41Z

@sayakpaul okay to merge? could you approve if yes?

sayakpaul · 2024-10-17T09:18:53Z

training/dataset.py

+            if len(bucket) == 0:
+                continue
+            if self.shuffle:
+                random.shuffle(bucket)


Could also fix the seed here?

set_seed already does this in the training and preparation scripts

sayakpaul · 2024-10-17T09:19:20Z

training/prepare_dataset.py

+    parser.add_argument(
+        "--height_buckets",
+        nargs="+",
+        type=check_height,


sayakpaul · 2024-10-17T09:19:59Z

training/prepare_dataset.py

+    # 3. Prepare models
+    device = f"cuda:{rank}"
+
+    generator = torch.Generator(device).manual_seed(args.seed)


Maybe better to always initialize the seed on a CPU.

hmm, setting it to cpu is giving some weird device mismatch errors 🤔 i will try debugging soon, but for now, on a given gpu and same seed, seems to be fully reproducible (but yes i know why cpu is better alternative). todo

update

ea110b8

a-r-r-o-w requested review from sayakpaul and zRzRzRzRzRzRzR October 16, 2024 15:07

make style

7cac9d4

sayakpaul reviewed Oct 16, 2024

View reviewed changes

training/dataset.py Show resolved Hide resolved

training/prepare_dataset.py Outdated Show resolved Hide resolved

training/prepare_dataset.py Show resolved Hide resolved

training/prepare_dataset.py Outdated Show resolved Hide resolved

training/prepare_dataset.py Show resolved Hide resolved

a-r-r-o-w added 3 commits October 16, 2024 17:19

renormalize correctly

677637e

apply suggestions from review

a7fb0f7

apply suggestions from review

f2eaea9

sayakpaul reviewed Oct 17, 2024

View reviewed changes

sayakpaul approved these changes Oct 17, 2024

View reviewed changes

update

6588d97

a-r-r-o-w merged commit feb2e26 into main Oct 17, 2024

a-r-r-o-w deleted the improve-data-preparation branch October 17, 2024 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve dataset preparation support + multiresolution prep #39

Improve dataset preparation support + multiresolution prep #39

Uh oh!

a-r-r-o-w commented Oct 16, 2024 •

edited

Loading

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-r-r-o-w commented Oct 17, 2024

Uh oh!

sayakpaul Oct 17, 2024

Uh oh!

a-r-r-o-w Oct 17, 2024

Uh oh!

sayakpaul Oct 17, 2024 •

edited

Loading

Uh oh!

sayakpaul Oct 17, 2024

Uh oh!

a-r-r-o-w Oct 17, 2024

Uh oh!

Uh oh!

Improve dataset preparation support + multiresolution prep #39

Improve dataset preparation support + multiresolution prep #39

Uh oh!

Conversation

a-r-r-o-w commented Oct 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

a-r-r-o-w commented Oct 17, 2024

Uh oh!

sayakpaul Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

sayakpaul Oct 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Oct 17, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

a-r-r-o-w commented Oct 16, 2024 •

edited

Loading

sayakpaul Oct 17, 2024 •

edited

Loading