Jumping dawn 1790 #28

johndpope · 2024-08-27T08:34:22Z

doesnt collapse.

no noise
no mixed precision
no r1
no ema

loss:
type: "wasserstein" ??? maybe vanilla is better.

https://wandb.ai/snoozie/IMF/runs/efvle5h6?nw=nwusersnoozie

# Model parameters
model:
  latent_dim: 32
  base_channels: 64
  num_layers: 4
  use_resnet_feature: False
  use_mlgffn: False
# Training parameters
training:
  initial_video_repeat: 1
  final_video_repeat: 1
  use_ema: False
  use_r1_reg: False
  batch_size: 2 #  
  num_epochs: 1000
  save_steps: 250
  learning_rate_g: 1.0e-4 # Reduced learning rate for generator
  initial_learning_rate_d: 3e-6  # Set a lower initial learning rate for discriminator
  # learning_rate_g: 5.0e-4  # Increased learning rate for generator
  # learning_rate_d: 5.0e-4  # Increased learning rate for discriminator
  ema_decay: 0.999
  style_mixing_prob: 0.0
  noise_magnitude: 0.0
  final_noise_magnitude: 0.001
  gradient_accumulation_steps: 1
  lambda_pixel: 10  # in paper lambda-pixel = 10 Adjust this value as needed
  lambda_perceptual: 10 # lambda perceptual = 10
  lambda_adv: 1 # adverserial = 1
  lambda_gp: 10  # Gradient penalty coefficient
  lambda_mse: 1.0
  n_critic: 2  # Number of discriminator updates per generator update
  clip_grad_norm: 1.0  # Maximum norm for gradient clipping
  r1_gamma: 10
  r1_interval: 16
  label_smoothing: 0.1

  min_learning_rate_d: 1.0e-6
  max_learning_rate_d: 1.0e-3
  d_lr_adjust_frequency: 100  # Adjust D learning rate every 100 steps
  d_lr_adjust_factor: 2.0  # Factor to increase/decrease D learning rate
  target_d_loss_ratio: 0.6  # Target ratio of D loss to G loss
  every_xref_frames: 16
  use_many_xrefs: False
  
  scales: [1, 0.5, 0.25, 0.125]
  enable_xformers_memory_efficient_attention: True

# Dataset parameters
dataset:
  # celeb-hq torrent https://github.com/johndpope/MegaPortrait-hack/tree/main/junk
  root_dir: "/media/oem/12TB/Downloads/CelebV-HQ/celebvhq/35666" # for overfitting M2Ohb0FAaJU_1.mp4 use https://github.com/johndpope/MegaPortrait-hack/tree/main/junk
  json_file: './data/overfit.json'  # Selena Gomez
  # json_file: './data/celebvhq_info.json' # 35k

# Checkpointing
checkpoints:
  dir: "./checkpoints"
  interval: 10

# Logging and visualization
logging:
  log_every: 250
  sample_every: 100
  sample_size: 1 # for images on wandb
  output_dir: "./samples"
  visualize_every: 100  # Visualize latent tokens every 100 batches
  print_model_details: False

# Accelerator settings
accelerator:
  mixed_precision: "no"  # Options: "no", "fp16", "bf16"
  cpu: false
  num_processes: 1  # Set to more than 1 for multi-GPU training

# Discriminator parameters
discriminator:
  ndf: 64  # Number of filters in the first conv layer

# Optimizer parameters
optimizer:
  beta1: 0.5
  beta2: 0.999

# Loss function
loss:
  type: "wasserstein"  # Changed to Wasserstein loss for WGAN-GP
  weights:
      perceptual: [10, 10, 10, 10, 10]
      equivariance_shift: 10
      equivariance_affine: 10

https://wandb.ai/snoozie/IMF/runs/jg33shsi/overview

johndpope added 23 commits August 26, 2024 13:41

ok

fea633f

🧗

71dc6fe

🧗

e416119

🧗

b90d407

🧗

b837759

🧗

7af2a25

🧗

3023ab8

🧗

7733baf

🧗

830afcd

going crazy

5a0976a

ok

ec606ef

pre-new resblocks

353c49d

https://wandb.ai/snoozie/IMF/runs/jg33shsi/overview

🧗

be37b5a

🧗

17689ea

🧗

4c585ef

ok

761533c

fix

2a56306

ok

3c2d833

ok

adb05a1

ok

fd7c0f8

ok

0d418b2

breaks

8c8c9e9

stable

5fd0bda

johndpope mentioned this pull request Aug 27, 2024

Is the paper reproducible? #25

Closed

johndpope closed this Aug 28, 2024

johndpope deleted the jumping-dawn-1790 branch September 4, 2024 01:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Jumping dawn 1790 #28

Jumping dawn 1790 #28

Uh oh!

johndpope commented Aug 27, 2024

Uh oh!

Uh oh!

Jumping dawn 1790 #28

Jumping dawn 1790 #28

Uh oh!

Conversation

johndpope commented Aug 27, 2024

Uh oh!

Uh oh!