Library Data Services caters to researchers interested in working with data, mapping, texts, visualization, and technology. Many of these services are available online. Davis Library Data Services, located on the second floor of Davis Library, offers:
Listed below are a number of tools for text and data mining. This is not intended to be a complete list, and tool availability may change. Please contact a librarian listed on the Support tab of this guide for assistance if you have any questions about the tools listed on this page.
A text and data mining tool that has access to most of the ProQuest collections to which the University Library subscribes. TDM Studio has two main utilities:
Note: Use your UNC email when creating your ProQuest TDM Studio account.
For more information about ProQuest TDM Studio, see ProQuest's libguide.
Python library for data extraction from webpages, HTML, and XML files. For more information, see this Programming Historian tutorial.
Open-source repository that links research objects, entities, and actions. Their REST API(?) is very helpful for large-scale extraction of scholarly metadata. For more information, see their documentation.
Open-source text-analysis software for textual corpus research. Extract and visualize word frequencies, see key words in context, cluster terms, and more! Click here for more information about AntConc.
Open-source software for data exploration, transformation, and cleaning. See OpenRefine's documentation or this Programming Historian tutorial for more information.
Open-source software for visualization network data and extract network statistics, ideal for text or social media network data. For more information, check out these tutorials.