[camb]add w8a8 support #176

JackWeiw · 2025-02-19T03:14:54Z

No description provided.

jinminxi104 · 2025-02-26T06:04:27Z

dlinfer/vendor/camb/camb_ops.py

+    we need to reshape the input tensor to 2D tensor if it is 3D tensor.
+    """
+    bsz, seq_len = None, None
+    if x.dim() == 3:


please merge dim==3 and dim==2. just reshape to (bsz * seq_len, -1) for both dim3 and 2

jinminxi104 · 2025-02-26T06:05:51Z

dlinfer/vendor/camb/camb_ops.py

+    assert quant_dtype == torch.int8
+    assert quant_granularity == "PER_TOKEN"
+    if x.shape[-1] in smooth_dic:
+        smooth = smooth_dic[x.shape[-1]]


could u explain more about smooth_dic

jinminxi104 · 2025-02-26T13:01:44Z

dlinfer/vendor/camb/__init__.py

@@ -1,3 +1,6 @@
 import torch

 from . import pytorch_patch, camb_ops
+
+# TODO. weitao: camb torch-mlu-ops-v1.2.0 per_token_smooth_quantize need smooth_vec
+SMOOTH_VEC = torch.ones(8000, dtype=torch.float32, device="mlu")


please change to 8*1024

jinminxi104 · 2025-02-26T13:02:37Z

dlinfer/vendor/camb/camb_ops.py

@@ -6,6 +6,8 @@
 from dlinfer.utils.registry import register_ops
 from dlinfer.utils.type_annotation import Tensor, Optional, Sequence, Tuple

+from .__init__ import SMOOTH_VEC


from .__init__ ==> from .

jinminxi104 · 2025-02-26T13:08:03Z

dlinfer/vendor/camb/camb_ops.py

+    if x.shape[-1] <= SMOOTH_VEC.shape[0]:
+        smooth = SMOOTH_VEC[: x.shape[-1]]
+    else:
+        smooth = torch.ones(x.shape[-1], dtype=torch.float32, device=x.device)


Don't create a new one locally, update the global SMOOTH_VEC. and round up to 2^N K.

[camb]add w8a8 support

d42d52c

JackWeiw requested a review from jinminxi104 as a code owner February 19, 2025 03:14

JackWeiw added camb platform camb enhancement New feature or request labels Feb 19, 2025

jinminxi104 requested changes Feb 26, 2025

View reviewed changes

change smooth vec at init time

bd9f91c

jinminxi104 requested changes Feb 26, 2025

View reviewed changes

JackWeiw added 4 commits February 27, 2025 10:27

[camb]update smooth to next pow of 2

a7e141f

[camb]format

17f53b5

[camb]merge process dim2 and 3

11b7820

[camb]format

4f624bc

jinminxi104 merged commit e01ad15 into DeepLink-org:main Feb 27, 2025
4 checks passed

JackWeiw deleted the w8a8 branch March 10, 2025 02:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[camb]add w8a8 support #176

[camb]add w8a8 support #176

Uh oh!

JackWeiw commented Feb 19, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Uh oh!

JackWeiw Feb 27, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Uh oh!

JackWeiw Feb 27, 2025

Uh oh!

Uh oh!

Uh oh!

[camb]add w8a8 support #176

[camb]add w8a8 support #176

Uh oh!

Conversation

JackWeiw commented Feb 19, 2025

Uh oh!

jinminxi104 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

jinminxi104 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

jinminxi104 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

JackWeiw Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

jinminxi104 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

jinminxi104 Feb 26, 2025

Choose a reason for hiding this comment

Uh oh!

JackWeiw Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!