WhileOp reverse derivative #160

Pangoraw · 2024-11-08T09:30:46Z

conjugates

wsmoses · 2024-11-08T15:21:20Z

src/enzyme_ad/jax/Implementations/StableHLOAutoDiffOpInterfaceImpl.cpp

+    // The primal is augmented to store the number of iterations
+
+    auto newWhile = cast<WhileOp>(gutils->getNewFromOriginal(orig));
+    auto cond = &newWhile.getCond().front();


Usually one of the variables is a loop induction variable -- which means we don't need to store this. Not sure if this optimization would go here or in a canonicalization though.

Way more powerful versions of this would be here: https://github.com/llvm/Polygeist/blob/77c04bb2a7a2406ca9480bcc9e729b07d2c8d077/lib/polygeist/Passes/CanonicalizeFor.cpp#L662

The pro of doing this early though, is that we can potentially determine a static loop count and then make all of the augmented tensors instead of <?x32>

I was able to reuse the analysis from enzyme-hlo-unroll which we can make more robust in the future to support more complex patterns. the forward is not needed anymore if there is no cache push inside the body and the loop is a for loop.

in this case I don't think we even care if the value is a constant, we just cache that value regardless of constant or not

[and if it's constant cache fixups ought clean it up automatically for us]

wsmoses · 2024-11-09T16:56:40Z

src/enzyme_ad/jax/Implementations/StableHLOAutoDiffOpInterfaceImpl.cpp

+          SplatElementsAttr::get(unrankedTensorType,
+                                 ArrayRef<Attribute>(IntegerAttr::get(
+                                     bodyBuilder.getI64Type(), 1))));
+      Value bodyIterVar =


as weird, annoying as it is, we should probably use the createAdd from the autodiff type interface. Even though here we have to raise it back into a stablehlo.add, it does mean that new types will use the right add [if for example not correct in stablehlo.add]. I'm okay with this though if you feel strongly.

ah apologies, this add is for a new induction variable.

In this case same comment applies of we should re-use an existing induction variable [almost all cases presently], when possible.

Alternatively [or perhaps in addition to], we really should import most of that while op optimization stuff from polygeist [which also does redundant induction variable elimination somewhere iirc]

wsmoses · 2024-11-09T17:02:13Z

src/enzyme_ad/jax/Implementations/WhileLoopInfo.h

+  std::optional<int64_t> getConstantStart();
+  std::optional<int64_t> getConstantLimit();
+
+  /// Needs to be constant


Yeah here for the get number of induction variables, we can take a builder and compute the number of iterations

wsmoses · 2024-11-09T17:04:59Z

test/lit_tests/diffrules/stablehlo/while3.mlir

+    return %results1 : tensor<f64>
+  }
+}
+


can you add forward and reverse checks here?

wsmoses · 2024-11-09T17:07:47Z

test/lit_tests/diffrules/stablehlo/while3.mlir

+// CHECK-NEXT:    } do {
+// CHECK-NEXT:      %14 = stablehlo.add %iterArg, %c_0 : tensor<i64>
+// CHECK-NEXT:      "enzyme.set"(%4, %iterArg_2) : (!enzyme.Gradient<tensor<f64>>, tensor<f64>) -> ()
+// CHECK-NEXT:      %15 = "enzyme.get"(%4) : (!enzyme.Gradient<tensor<f64>>) -> tensor<f64>


we should make sure that remove-enzyme-ops can deal with this %4 case without extra work.

%4 is defined right before this in a dominating way allowing %15 to be replaced with %iterArg_2.

Then now that only set's are called on it, we should be able to remove the whole thing.

No while special case handling required here

wsmoses · 2024-11-09T17:09:16Z

test/lit_tests/diffrules/stablehlo/while3.mlir

+// CHECK-NEXT:    }
+// CHECK-NEXT:    %11 = "enzyme.get"(%0) : (!enzyme.Gradient<tensor<f64>>) -> tensor<f64>
+// CHECK-NEXT:    %12 = arith.addf %11, %10#1 : tensor<f64>
+// CHECK-NEXT:    "enzyme.set"(%0, %12) : (!enzyme.Gradient<tensor<f64>>, tensor<f64>) -> ()


and also here with %0

wsmoses · 2024-11-09T17:14:05Z

test/lit_tests/diffrules/stablehlo/while3.mlir

+// CHECK-NEXT:      %16 = "enzyme.pop"(%3) : (!enzyme.Cache<tensor<f64>>) -> tensor<f64>
+// CHECK-NEXT:      %17 = "enzyme.pop"(%2) : (!enzyme.Cache<tensor<f64>>) -> tensor<f64>
+// CHECK-NEXT:      %18 = stablehlo.multiply %15, %17 : tensor<f64>
+// CHECK-NEXT:      %19 = "enzyme.get"(%1) : (!enzyme.Gradient<tensor<f64>>) -> tensor<f64>


The %1 cache here confuses me. In principle your good design setup such that we don't have any gradient +='s that span the outside of the while regions. Yet here %4 is not able to be mem2reg'd within scope. So something is clearly going awry...

wsmoses · 2024-11-09T19:08:02Z

cc @mofeing I think the same argument of linearity from earlier applies here where we don't need to care about conjugates [which is nice]

WhileOp reverse derivative

4eff05f

Pangoraw mentioned this pull request Nov 8, 2024

Tracking issue for missing HLO derivatives #88

Open

wsmoses reviewed Nov 8, 2024

View reviewed changes

reuse while loop analysis from enyzme-hlo-unroll

cf54c64

wsmoses reviewed Nov 9, 2024

View reviewed changes

test/lit_tests/diffrules/stablehlo/while3.mlir

return %results1 : tensor<f64>

}

}

Copy link

Member

wsmoses Nov 9, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you add forward and reverse checks here?

wsmoses reviewed Nov 9, 2024

View reviewed changes

Pangoraw added 2 commits November 9, 2024 20:27

non constant number of iterations

e3a095e

zero diff of arguments to make it easier to remove ops

02891ea

Pangoraw force-pushed the while-rev branch from 69fa035 to 02891ea Compare November 9, 2024 20:08

wsmoses merged commit e7e0cc3 into EnzymeAD:main Nov 10, 2024
3 of 9 checks passed

Pangoraw deleted the while-rev branch November 13, 2024 10:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WhileOp reverse derivative #160

WhileOp reverse derivative #160

Uh oh!

Pangoraw commented Nov 8, 2024 •

edited

Loading

Uh oh!

wsmoses Nov 8, 2024

Uh oh!

Pangoraw Nov 9, 2024 •

edited

Loading

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses Nov 9, 2024

Uh oh!

wsmoses commented Nov 9, 2024

Uh oh!

Uh oh!

Uh oh!

WhileOp reverse derivative #160

WhileOp reverse derivative #160

Uh oh!

Conversation

Pangoraw commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Pangoraw Nov 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wsmoses commented Nov 9, 2024

Uh oh!

Uh oh!

Uh oh!

Pangoraw commented Nov 8, 2024 •

edited

Loading

Pangoraw Nov 9, 2024 •

edited

Loading