A single A100-80G can't run Llama-2-70b model? #106

JustVelkhana · 2025-04-17T07:24:36Z

We meet out of memory. Adding '--multigpu' doesn't work though.

JustVelkhana · 2025-04-17T07:38:37Z

our setting
CUDA_VISIBLE_DEVICES=2,3 python main.py
--model /cache/model/Llama-2-70b-hf
--epochs 20 --output_dir ./log/Llama-2-70b-w4a16
--wbits 4 --abits 16 --lwc
--tasks piqa
--act-scales /cache/pt_for_let/Llama-2-70b-hf_scales.pt --act-shifts /cache/pt_for_let/Llama-2-70b-hf_shifts.pt
--multigpu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A single A100-80G can't run Llama-2-70b model? #106

A single A100-80G can't run Llama-2-70b model? #106

JustVelkhana commented Apr 17, 2025

JustVelkhana commented Apr 17, 2025

Uh oh!

A single A100-80G can't run Llama-2-70b model? #106

A single A100-80G can't run Llama-2-70b model? #106

Comments

JustVelkhana commented Apr 17, 2025

JustVelkhana commented Apr 17, 2025

Uh oh!