EduVisBench

This project conducts a detailed evaluation of webpage images within EduVisBench, providing results on the performance of webpages in educational visualization.

Prerequisites

OpenAI Python Package: Install the required package:
```
pip install openai
```
OpenAI API Key: You must set your OpenAI API key as an environment variable. In your terminal, run:
```
export OPENAI_API_KEY="your_actual_api_key_here"
```
Replace "your_actual_api_key_here" with your actual API key.
Answer Image Directory Structure: Organize your answer images as follows:
- Inside data/, create a sub-directory for each question ID found in data.json.
- Place all answer images (e.g., .png, .jpg, .jpeg) for a specific question directly into its corresponding ID folder.
- The script will automatically detect if there's a single answer image or multiple answer images based on the count of image files in this folder.
Download Dataset:
- Get data from Lekr0/EduVisBench on Hugging Face.

Directory Structure

For the script to work correctly, your files should be organized as follows, relative to the run_evaluation.py script:

EduVisBench/       <-- Your main project folder
├── run_evaluation.py          # The main evaluation script
|
├── data.json                    # JSON file containing all questions
│                                # Each question object must have an "id" and a "question" field.
│                                # - If "question" is a path (e.g., "image/my_q_image.png"),
│                                #   it's treated as an image question. The path should be
│                                #   relative to this Edu_visualization_benchmark/ folder.
│                                # - Otherwise, "question" is treated as text.
|
├── data/                          # Directory containing answer images, organized by question ID
│   ├── {question_id_1}/           # Folder named with the exact ID from data.json
│   │   ├── answer_image_A.png
│   │   ├── answer_image_B.jpg
│   │   └── ... (any number of answer images for this question ID)
│   │
│   ├── {question_id_2}/           # Folder for another question ID
│   │   └── single_answer_image.png
│   │
│   └── ... (other question ID folders)
|
└── README.md                    # This file (ensure filename is README.md)

Running the Evaluation Script

Navigate to the EduVisBench directory in your terminal:
```
cd path/to/your/EduVisBench
```
Ensure your OPENAI_API_KEY environment variable is set (see Prerequisites).
Run the script:
```
python run_evaluation.py
```
Output: The script will generate a JSON file named evaluation.json (by default) in the same directory. This file will contain the detailed evaluation results for each question.

You can specify a different output file name or path using the --output_file argument:
```
python run_evaluation.py --output_file my_custom_results.json
```
If you provide a relative path, it will be relative to the script's directory. If you provide an absolute path, that will be used directly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EduVisBench

Prerequisites

Directory Structure

Running the Evaluation Script

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
EduVisBench.pdf		EduVisBench.pdf
README.md		README.md
data.json		data.json
run_evaluation.py		run_evaluation.py

aiming-lab/EduVisBench

Folders and files

Latest commit

History

Repository files navigation

EduVisBench

Prerequisites

Directory Structure

Running the Evaluation Script

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages