Closed
Description
There are 2.6M items on the Internet Archive Openlibraries Wishlist. We want to make sure each of these books has a corresponding catalog entry in openlibrary.org.
Steps
- Download the
wishlist
dataset from Add Open Library record for every work/edition in Internet Archive 2020 Wishlist #869 (@mekarpeles will provide an updated link) - We will take the first 1,000 items in the wishlist (as a test) and see (2.a.) how many are in Open Library...
- As a first run, we can check these 1,000 wishlist items against the Open Library
books API
: https://openlibrary.org/dev/docs/api/books. This won't scale to 2.6M editions which is how many are on the wishlist. So, instead, we'll need to download + process theeditions
data dump https://openlibrary.org/data/ol_dump_editions_latest.txt.gz
- As a first run, we can check these 1,000 wishlist items against the Open Library
See: https://archive.org/details/open_libraries_wish_list
https://archive.org/download/open_libraries_wish_list/wish_list_isbn13_ver_1.csv.zip
@thisismattmiller -- can get a master csv which collates isbn10, isbn13, and oclc (into one row)?
We want to add all of these records into Open Library so we can query by any of these identifier fields and retrieve the metadata for the book.
Also, do we have metadata on these books?