Nougat OCR

Introduction

This repository contains the source code for Nougat OCR, a tool for Optical Character Recognition (OCR) using the Nougat model. Follow the instructions below to set up the environment and run the OCR.

Installation

Clone this repository:

git clone https://github.com/cudanexus/nougat.git

Download the model files from Hugging Face using Git LFS:

Make sure you have Git LFS installed (Git LFS Installation )
Run the following commands:

git lfs install
git clone https://huggingface.co/spaces/tomriddle/nougat

2. After the above commands, your folder structure should look like this:

input
Upload nougat.pdf
nougat
output
Upload nougat.pdf
README.md
app.py
requirements.txt

3. Copy the `nougat` folder (which contains all model files) to the root of this repository. Your updated structure should look like:

input
nougat
--- config.json
--- pytorch_model.bin
--- special_tokens_map.json
--- tokenizer.json
--- tokenizer_config.json
output
app.py
cog.yaml
output.txt
predict.py
requirements.txt

4. Install the required Python packages:

pip install -r requirements.txt

Testing

Ensure that everything is installed correctly by running:

python app.py --pdf_file input/nougat.pdf

If the installation is successful, you should see the OCR output.

Additional Information

For any issues or questions, please refer to the repository or contact the repository owner.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nougat OCR

Introduction

Installation

2. After the above commands, your folder structure should look like this:

3. Copy the `nougat` folder (which contains all model files) to the root of this repository. Your updated structure should look like:

4. Install the required Python packages:

Testing

Additional Information

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
input		input
output		output
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py
cog.yaml		cog.yaml
output.txt		output.txt
predict.py		predict.py
requirements.txt		requirements.txt

cudanexus/nougat

Folders and files

Latest commit

History

Repository files navigation

Nougat OCR

Introduction

Installation

2. After the above commands, your folder structure should look like this:

3. Copy the nougat folder (which contains all model files) to the root of this repository. Your updated structure should look like:

4. Install the required Python packages:

Testing

Additional Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

3. Copy the `nougat` folder (which contains all model files) to the root of this repository. Your updated structure should look like:

Packages