🎯
Focusing
Pinned Loading
-
gpustack/gpustack
gpustack/gpustack PublicSimple, scalable AI model deployment on GPU clusters
-
gpustack/llama-box
gpustack/llama-box PublicLM inference server implementation based on *.cpp.
-
gpustack/gguf-parser-go
gpustack/gguf-parser-go PublicReview/Check GGUF files and estimate the memory usage and maximum tokens per second.
-
gpustack/gguf-packer-go
gpustack/gguf-packer-go PublicDeliver LLMs of GGUF format via Dockerfile.
-
seal-io/hermitcrab
seal-io/hermitcrab PublicAvailable Terraform Provider network mirroring service.
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.