OpenML-Paper-Impact-Analysis

This repository contains the dataset and code used to analyse the impact of OpenML. The results are included in the OpenML cells paper. The analysis focuses on research papers citing the core OpenML paper, Python and R connectors, and benchmarking suite papers.

Data:
data/collected_papers.csv: Contains the originally collected data on 1786 papers from Google Scholar.
data/Final_survey_data.csv: The cleaned dataset (after filtering papers based on availability, language, and other criteria) with review results.
Code:
scripts/analysis.py: Python scripts used to clean the data, run statistical analyses, and generate figures/tables for the paper.
Documentation:
docs/methodology.md

Note: We exclude papers published in 2025 as the year is still in progress, to avoid skewed interpretations of trends.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
data		data
docs		docs
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

OpenML-Paper-Impact-Analysis

Contents

About

Uh oh!

Releases 4

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

Uh oh!

openml/OpenML-Paper-Impact-Analysis

Folders and files

Latest commit

History

Repository files navigation

OpenML-Paper-Impact-Analysis

Contents

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages