-
-
Notifications
You must be signed in to change notification settings - Fork 307
Issues: turboderp-org/exllamav2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG]Exllamav2 repeats itself in the answer
bug
Something isn't working
#764
opened Apr 1, 2025 by
manitadayon
3 tasks done
convert.py exits with a "ValueError: ## Could not find lm_head.* in model" error
bug
Something isn't working
#763
opened Mar 30, 2025 by
CyntexMore
3 tasks done
qwq32b run good in colab t4
bug
Something isn't working
#761
opened Mar 24, 2025 by
kim90000
3 tasks done
[BUG] Windows 11 Tensor Parallelism slow
bug
Something isn't working
#760
opened Mar 23, 2025 by
frenzybiscuit
3 tasks done
[BUG] 0.2.7 had smaller quant sizes
bug
Something isn't working
#759
opened Mar 23, 2025 by
frenzybiscuit
3 tasks done
[BUG] Cant convert model Qwen2.5-VL-7B-Instruct
bug
Something isn't working
#757
opened Mar 21, 2025 by
MadMenHitBooker
3 tasks done
[BUG] Loss in Accuracy with Paged=False with Qwen2.5_VL Vision Models on Linux
bug
Something isn't working
#753
opened Mar 18, 2025 by
RaahimSiddiqi
3 tasks done
[BUG] Bug in attention mechanism when Paged=False for Qwen2.5_VL Models
bug
Something isn't working
#752
opened Mar 18, 2025 by
RaahimSiddiqi
3 tasks done
[REQUEST] It is very difficult to service exlamav2 using RestfullAPI.
#748
opened Mar 12, 2025 by
nalgae
[REQUEST]Support for the New Aya-Vision32b models
#746
opened Mar 9, 2025 by
GoudaCouda
3 tasks done
[BUG] Qwen-vl can't produce coordinates
bug
Something isn't working
#740
opened Feb 22, 2025 by
Tedy50
3 tasks done
[BUG] Significant prompt processing speed difference when using Tensor Parallelism
bug
Something isn't working
#734
opened Feb 16, 2025 by
ThomasBaruzier
3 tasks done
[BUG] When trying inference with Qwen2.5-VL-72B with Qwen2.5-VL-7B as a draft model, I get "IndexError: index out of range in self" (both models have identical vocab.json)
bug
Something isn't working
#733
opened Feb 6, 2025 by
Lissanro
3 tasks done
[BUG] Exception in ASGI application when trying inference with an image wit h Qwen2.5-VL-72B
bug
Something isn't working
#732
opened Feb 5, 2025 by
Lissanro
3 tasks done
[BUG] Mistral-Small-24B-Instruct-2501 - Tensor Parallel outputs garbled text.
bug
Something isn't working
#728
opened Jan 31, 2025 by
mindkrypted
3 tasks done
Previous Next
ProTip!
no:milestone will show everything without a milestone.