candle-onnx: Implement layer normalization operator #2919

BrunoSienkiewicz · 2025-04-24T15:30:06Z

Added Layer Normalization operator with tests.
Related issue: #2849

A2va · 2025-04-24T19:47:33Z

Why not using LayerNorm from candle-nn ?

Or that is a different thing ? (I'm not that familiar with ml things)

BrunoSienkiewicz · 2025-04-24T20:40:57Z

Why not using LayerNorm from candle-nn ?

Or that is a different thing ? (I'm not that familiar with ml things)

Thank you for your comment. Honestly I didn't see the implementation in candle-nn, maybe it can be used in this case. However I see difference in ONNX version of LayerNorm, that is additional axis parameter. I will mark this PR as draft until i clear this out.

BrunoSienkiewicz · 2025-04-26T19:07:34Z

I have changed the implementation to use built-in candle-nn layer normalization. All tests are passing so I think everything should be alright with this approach.

LaurentMazare · 2025-05-01T07:10:23Z

candle-onnx/src/eval.rs

+
+                let x_mat = xs.reshape((row_number, col_number))?;
+
+                let y_mat = candle_nn::ops::layer_norm_slow(


Why use layer-norm slow here rather than the optimized variants?

LaurentMazare · 2025-05-01T07:12:32Z

candle-onnx/tests/ops.rs

+            .to_dtype(DType::F32)?;
+
+        let expected = Tensor::new(expected, &Device::Cpu)?.to_dtype(DType::F32)?;
+        match expected.dims().len() {


Why split these cases and not compare the tensors directly?

At the start i thought it would be a good idea to test out different dimensionality of tensors, but they in the end they are cast to 2D so it would not matter. I changed the test case to compare tensors directly.

BrunoSienkiewicz added 11 commits April 9, 2025 20:49

add: layer normalization variables from input

451b4bf

add: basic tests

ebdecff

add: default eps test

8632e06

add: layer_norm implementation

4eac7a3

fix: imports

82ec410

fix: variable types

7a6c3bc

fix: unused variable

18f0f5a

fix: parameter parsing

1e36212

fix: test variables and tensor operations

4b000cd

update: working operator

88684af

fix: tests

51c537e

BrunoSienkiewicz marked this pull request as draft April 24, 2025 20:41

BrunoSienkiewicz added 2 commits April 26, 2025 20:52

add: additional test for non default axis parameter

5113395

update: used candle-nn layer_norm function

2e50281

BrunoSienkiewicz marked this pull request as ready for review April 26, 2025 19:07

LaurentMazare reviewed May 1, 2025

View reviewed changes

BrunoSienkiewicz added 3 commits May 18, 2025 13:01

update: removed multiple shapes comparison

1bc07f7

update: cast to vec2 and compare directly

160fbbd

update: use optimized version

eb6e0e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

candle-onnx: Implement layer normalization operator #2919

candle-onnx: Implement layer normalization operator #2919

Uh oh!

BrunoSienkiewicz commented Apr 24, 2025 •

edited

Loading

Uh oh!

A2va commented Apr 24, 2025 •

edited

Loading

Uh oh!

BrunoSienkiewicz commented Apr 24, 2025

Uh oh!

BrunoSienkiewicz commented Apr 26, 2025

Uh oh!

LaurentMazare May 1, 2025

Uh oh!

LaurentMazare May 1, 2025

Uh oh!

BrunoSienkiewicz May 18, 2025

Uh oh!

Uh oh!


		let x_mat = xs.reshape((row_number, col_number))?;

		let y_mat = candle_nn::ops::layer_norm_slow(

candle-onnx: Implement layer normalization operator #2919

Are you sure you want to change the base?

candle-onnx: Implement layer normalization operator #2919

Uh oh!

Conversation

BrunoSienkiewicz commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

A2va commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BrunoSienkiewicz commented Apr 24, 2025

Uh oh!

BrunoSienkiewicz commented Apr 26, 2025

Uh oh!

LaurentMazare May 1, 2025

Choose a reason for hiding this comment

Uh oh!

LaurentMazare May 1, 2025

Choose a reason for hiding this comment

Uh oh!

BrunoSienkiewicz May 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BrunoSienkiewicz commented Apr 24, 2025 •

edited

Loading

A2va commented Apr 24, 2025 •

edited

Loading