- This data includes the dataset, code, and files used and created by the Duke University Data+ 2021 Rubenstein Library Card Catalog Team. Working with the digitized cards from the David M. Rubenstein Rare Book and Manuscript Library's physical card catalogs, our team explored the files as a way to further the library's initiative of finding and describing historically marginalized voices in their collections.
We created a structured dataset using natural language processing and some manual editing, sorted by collection of items within the catalog and containing important metadata such as author, location, and date written from the OCRed text of the scanned cards. With the dataset we ... [Read More]
- Total Size
- 13 files (111 MB)
- Data Citation
- Smith, H. A., & Garomsa, B. (2021). Data and scripts from: Rubenstein Library card catalog. Duke Research Data Repository. https://doi.org/10.7924/r4br8v905
- Creator
- DOI
- 10.7924/r4br8v905
- Publication Date
- August 5, 2021
- ARK
- ark:/87924/r4br8v905
- Contributor
- Publisher
- Location
- Durham
- Language
- Type
- Funding Agency
- Duke Rhodes Information Initiative
- Contact
- Meghan Lyon: meghan.lyon@duke.edu
- Title
- Data and scripts from: Rubenstein Library card catalog
- Repository
Thumbnail | Title | Date Uploaded | Visibility | Actions |
---|---|---|---|---|
Project Overview.docx | 2021-08-05 | Download | ||
README.md | 2021-08-05 | Download | ||
Rubenstein-Library-Card-Catalog.zip | 2021-08-05 | Download | ||
Jupyter PDF exports | 2021-08-05 |