Llama 3.2 FastAPI Integration

A FastAPI service that interfaces with Ollama to provide a REST API for the Llama 3.2 model.

Features

REST API for text generation with Llama 3.2
Docker container for easy deployment
Integration with Ollama for model management

Quick Start

# Clone the repository
git clone https://github.com/jwill9999/llama3.2.git
cd llama3.2

# Build the services
docker-compose build

# Start the services
docker-compose up -d

# Access the API at http://localhost:8000
# Use http://localhost:8000/docs for the OpenAPI documentation

Alternative: Using Make Commands

# Build the services
make build

# Start the services
make up

# Access the API at http://localhost:8000

API Endpoints

GET /: Health check endpoint
GET /ask?prompt=YOUR_PROMPT: Generate a response to the given prompt

Docker Image

# Pull the image
docker pull jwill9999/llama3.2-api:latest

# Run the container
docker run -p 8000:8000 jwill9999/llama3.2-api:latest

Available Commands

Basic Operations

Command	Description
`make build`	Build the Docker images
`make up`	Start the services in detached mode
`make dev`	Start services with console output
`make down`	Stop the services
`make logs`	View service logs
`make restart`	Restart all services

Advanced Operations

Command	Description
`make prod VERSION=1.2.0`	Start services with specific version
`make pull VERSION=1.2.0`	Pull images with specific version
`make clean`	Stop services and remove volumes
`make test`	Build and start with test version
`make build-api`	Build only the api service
`make build-ollama`	Build only the ollama service
`make push`	Push images to Docker Hub
`make build-hub VERSION=1.2.0`	Pull and restart with specific version
`make build-hub`	Pull and restart latest version

Versioning

Command	Description
`make push-version`	Tag and push all services with patch version bump
`make push-version BUMP_TYPE=minor`	Bump minor version (1.0.0 → 1.1.0)
`make push-version BUMP_TYPE=major`	Bump major version (1.0.0 → 2.0.0)

Project Structure

llama3.2/
├── docker/
│   ├── fastapi/
│   │   └── Dockerfile
│   └── ollama/
│       └── Dockerfile
├── scripts/
│   ├── tag-version.sh     # Version tagging script
│   ├── build-and-tag.sh   # Build and tag script
│   └── pull-llama3.2.sh   # Pull Llama model script
├── public/
│   └── ollama.jpg         # Logo image
├── main.py                # FastAPI application
├── requirements.txt       # Python dependencies
├── compose.yml            # Docker Compose configuration
├── makefile               # Make commands
└── VERSION                # Current version file

Requirements

Docker and Docker Compose
An Ollama instance with the Llama 3.2 model loaded

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Llama 3.2 FastAPI Integration

Features

Quick Start

Alternative: Using Make Commands

API Endpoints

Docker Image

Available Commands

Basic Operations

Advanced Operations

Versioning

Project Structure

Requirements

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
docker		docker
public		public
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
compose.yml		compose.yml
main.py		main.py
makefile		makefile
open-webui-config.json		open-webui-config.json
requirements.txt		requirements.txt

License

jwill9999/llama3.2

Folders and files

Latest commit

History

Repository files navigation

Llama 3.2 FastAPI Integration

Features

Quick Start

Alternative: Using Make Commands

API Endpoints

Docker Image

Available Commands

Basic Operations

Advanced Operations

Versioning

Project Structure

Requirements

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages