Skip to Main Content
Syracuse University Libraries

DH Workshop: Introduction to Text Mining with HathiTrust Research Center: Home

Workshop Guide

Patrick Williams
Lead Librarian for Digital & Open Scholarship

About HathiTrust & HTRC

Founded in 2008, HathiTrust is a not-for-profit collaborative of academic and research libraries preserving 17+ million digitized items. HathiTrust offers reading access to the fullest extent allowable by U.S. copyright law, computational access to the entire corpus for scholarly research, and other emerging services based on the combined collection. HathiTrust members steward the collection — the largest set of digitized books managed by academic and research libraries — under the aims of scholarly, not corporate, interests.

Syracuse University is an Instutional Member of HathiTrust, which allows affiliated users free full-text downloads of complete volumes as well as special access to the HathiTrust Research Center, along with some other benefits. You can log in to HathiTrust with your NetID to access these benefits. 

Workshop Resources

HathiTrust Resources
Example Collections for using HTRC Analytics

HTRC Output

Output Examples in case of Server Problems
Named Entity Recognizer example interface and HTRC CSV output examples