Skip to content

WIP: Add Capture Dataset #974

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open

WIP: Add Capture Dataset #974

wants to merge 3 commits into from

Conversation

bodsul
Copy link

@bodsul bodsul commented May 2, 2025

Add evaluation for newly released CAPTURE dataset. Arxiv preprint: https://arxiv.org/abs/2504.15485, repo: https://github.com/atinpothiraj/CAPTURe.

Opened issue regarding prompt choice and using Llama-3.1-8B-Instruct to extract answers here. Will update PR from WIP once this issues are clarified.

@kennymckormick
Copy link
Member

Hi, @bodsul ,

Maybe you can try to re-implement and use some API models (like gpt-4o / gpt-4.1-mini). If you don't have the credit for GPT API, you can just implement in this PR and I'll help check the results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants