Syracuse University Libraries is among the first institutions in the U.S. to subscribe to ProQuest’s new text and data mining platform, known as ProQuest TDM Studio. Currently in its beta release, this tool allows research teams at Syracuse University the ability to mine large volumes of published content from the millions of pages of news, scholarly and other publications provided to the SU campus community through current subscriptions to contemporary and historical ProQuest databases. Tap into resources such as the American Periodicals Series, US Newsstream’s coverage of millions of newspaper articles, the vast corpus of fiction and other book length texts within Early English Books Online, or the many decades of alternative press journalism within Alt-Press Watch and other ProQuest resources spanning multiple centuries of publications. TDM Studio allows Syracuse University researchers the capacity within ProQuest’s text and data mining environment to apply programming languages like R or Python to execute queries, develop datasets, and extract and analyze the text of publications central to their research. ProQuest TDM Studio is also pre-equipped with helpful programming libraries and scripts for getting started, and users can import or write their own scripts and packages to extend the capabilities of the platform. Syracuse University Libraries is delighted to offer the Syracuse University community early access to this valuable resource as a tool to assist researchers in applying digital scholarship techniques to the wealth of licensed content available through numerous current ProQuest subscriptions. This is a powerful tool for analyzing recent and more deeply historical scholarly publications, primary source texts in the humanities, business, public policy, public health and other scientific literature, as well as extensive recent and older U.S. and international journalism.
For more information or general assistance, contact SU Libraries data services.
To access this text and data mining tool, complete the SU Libraries' ProQuest TDM Studio proposal form.
Please also see detailed User Information on this guide.
ProQuest TDM Studio is a text and data mining resource available to individuals or research teams in the Syracuse University community for either short or long term use with these user limits:
Login information: ProQuest TDM Studio access is passworded. The login will not be through SU’s authentication via NetID and password. Access will be set up by ProQuest, and TDM Studio team members will be provided with login information. ProQuest TDM Studio access and passwords are set up for individual users who comprise a research team, ranging for one to five persons, and accounts/passwords may not be shared beyond those persons.
User information: Sections of SU Libraries’ Access to Licensed Web Resources Policy relevant to use of licensed resources apply.
Short Term Use
7-days: Individuals and teams may reserve ProQuest TDM Studio for a seven-day period, which is intended to either trial the platform and the appropriateness of its features for a possible project, or for direct use of TDM Studio not anticipated to require more than a seven-day project completion time frame. Short term use may be renewed for an additional seven days, unless another project team has booked the tool for that time period. Individuals or project teams are welcome to renew a seven-day booking once, but are strongly encouraged to make a 30-day project use booking if they have determined extended access best fits their project goals.
Extended Term Use
30 days: Individuals and teams with an established project may reserve ProQuest TDM Studio for periods of up to 30 days, with the possibility of renewals in seven-day increments or more by request should more time be needed and should there be no other requests by other individuals or groups for project time.
Individuals or teams wishing to use ProQuest TDM Studio should submit a ProQuest TDM Studio proposal to SU Libraries that includes basic information about their project and how they intend to use ProQuest TDM Studio. Literacy in writing either Python or R is needed by at least one member of a team that will be assigned an account, called a "workbench."
Planning: Because access is shared Syracuse University-wide and does not support multiple simultaneous team access please plan ahead when deciding when you would like to book access to ProQuest TDM Studio, especially if your use of the platform is anticipated to involve deadlines, other complex scheduling or calendar conflicts, or a need for multiple periods of short or extended term use.
Additional planning advice:
Individuals or groups submitting an access proposal need to provide:
Saving work: Since all data, programs and results will be deleted when a team’s 7 day or 30 day time slot has ended, it is imperative that teams save all work and keep detailed notes on all steps and selections as there is no way to save them in TDM Studio, and teams may want to refer to them after the TDM Studio access period has ended.