Skip to content

Repository for the analysis of research papers citing OpenML, and script used for systematic literature review and impact analysis.

Notifications You must be signed in to change notification settings

openml/OpenML-Paper-Impact-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

75 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenML-Paper-Impact-Analysis

This repository contains the dataset and code used to analyse the impact of OpenML. The results are included in the OpenML cells paper. The analysis focuses on research papers citing the core OpenML paper, Python and R connectors, and benchmarking suite papers.

Contents

  1. Data:
    data/collected_papers.csv: Contains the originally collected data on 1786 papers from Google Scholar.
    data/Final_survey_data.csv: The cleaned dataset (after filtering papers based on availability, language, and other criteria) with review results.
  2. Code:
    scripts/analysis.py: Python scripts used to clean the data, run statistical analyses, and generate figures/tables for the paper.
  3. Documentation:
    docs/methodology.md

Note: We exclude papers published in 2025 as the year is still in progress, to avoid skewed interpretations of trends.

About

Repository for the analysis of research papers citing OpenML, and script used for systematic literature review and impact analysis.

Resources

Stars

Watchers

Forks

Sponsor this project

  •  

Packages

No packages published

Contributors 7