Skip to Main Content

Text and Data Mining: Social Media Data

This guide provides guidance for users interested in doing text and data mining. It includes data sources, support options, and best practices for using content licensed by the University Libraries.

Library Data Services

Library Data Services caters to researchers interested in working with data, mapping, texts, visualization, and technology. Many of these services are available online. Davis Library Data Services, located on the second floor of Davis Library, offers:

  • A computing lab with specialized software for GIS and data visualization & analysis.
  • Walk-in assistance provided by knowledgeable student consultants during set hours
  • Consultations with specialists for more in-depth inquiries (by appointment).
  • Spaces for collaboration and presentation, complete with white boards and external displays.
  • Technology short courses and programs that promote digital scholarship.

Social Media Platforms

Facebook & Instagram (Meta)

 

API Available

Graph API:

Primary way for apps to read and write to the Facebook social graph. (HTTP-Based)

Instagram API:

Provides tools to help your business interact with Instagram users. 

Content Library:

Web-based tool for researchers to explore and understand publicly available data on Facebook, Instagram and Threads.

Content Library API:

API product that allows researchers to programmatically pull data from the content library.

Researcher Platform:

Separate platform to allow for a secure way for qualified users to access privacy-protected Facebook and Instagram data

 

Accessibility

Graph API
Instagram API
Content Library/ API/ Researcher Platform
  •     To access and download data applicants must be affiliated with a qualified academic institution or qualified research institution. The Inter-university Consortium for Political and Social Research (ICPSR) assess qualifications and submissions (~4-6 weeks)

 

Limits

  •     Both Graph API and Instagram API have platform rate limits that are tracked on an individual application, user level, and type of token used.  
  •     Tip: Limit API requests as much as possible as this improves the performance of your API response. 

 

Data Analysis


TikTok

 

API Available

Content Posting API:

Helps users to post directly to TikTok

Data Portability API:

Allows users to download/transport data

Display API:

contains a set of HTTP-based APIs that your product can use to display a TikTok creator's videos and their profile information.

Research API:

Allows independent and academic researchers who work for a non-profit institution to access certain data, such as videos, comments, and accounts

  •     Virtual Compute Environment (VCE): Secure place that allows you to query and analyze public data
    •     Allows you to access and analyze TikTok's public data in two stages. These stages are meant to protect user privacy and help organize your data analysis.
      •     Test Stage: Query the data using TikTok's query software development kit (SDK). The VCE will return random sample data based on your query.
      •     Execution Stage: Submit a script to execute against all public data. TikTok provides a powerful search capability that allows data to be paginated. TikTok will review the results file to make sure the output is aggregated.
  •     TikTok only reviews the results to ensure that there is no identifiable individual information extracted out of the platform. All aggregated results will be shared as a downloadable link to the approved primary researcher's email.
Commercial API:

Allows the public and researchers to perform customized searches on advertisement and targeting information. From the period when the advertisement is run and until one year since the advertisement was last shown to any user.

 

Accessibility  

All TikTok APIs require a TikTok Developer Account and if using an App to Register that App on TikTok Developers Website

Data Portability API:

Due to the ability to move data from platform additional access is required

Research API:  

Researchers must submit an application be approved and adhere to TikTok research tools terms of service

Commercial API:
  • Apply (Must have TikTok Developer Account first) (~ 1-2 weeks)

 

Limits

  •     Scopes: End users granted permissions to access specific data resources or perform specific actions. Every API requires a scope to be access and sensitive fields are protected by additional scopes.
    •     Along with scopes access tokens may be needed to help expedite the process. Specific tokens depend on API
  •     TikTok API limits the number of requests you can send in each timeframe. Limits of each API are set and enforced separately. But across all APIs the limit is 100,000 per day.  
  •     Default limits:  
API Limit
/v2/user/info 600
/v2/video/query 600
/v2/video/list 600
  •     Higher limits can be approved per API
Content Posting API
  • Each user_access_token is limited to 6 requests per minute
Research API/VCE
  •     Research API: 1,000 requests per day
  •     VCE
    •     Test Stage: Limited to 5,000 records per day.
    •     Execution Stage: Allows data to be paginated in increments of up to 100,000 records. 

 

Data Analysis

Each API has tutorials and examples on how to use their software. As well as an API Reference on requests, parameters, data structures, etc.  

Wrappers

Python TikTok Wrapper


X (Twitter)

 

API Available

X API v2:

Enables programmatic access to X

    V1 exists but it is recommended that most developers start to think about migrating to v2 of the API, and for any new users to get started with v2

X Ads API:

The X Ads API connects developers to X's advertising platform  

X for Websites:

X for Websites is a suite of tools bringing X content and functionality to your webpages and apps, enabling the X audience to share your content, and follow your X accounts

 

Accessibility/Limitations  

For all levels you must have an X developer account

Free Access
  •     Write-only and testing the X API
  •     Low rate-limit access to v2 posts and media upload endpoints
  •     1,500 Posts per month - posting limit at the app level
  •     1 Project
  •     1 App per Project
  •     1 Environment (Development/ Production/ Staging)
  •     Login with X  
  •     Access to Ads API
Basic (Hobbyists and Prototypes) ---$100 per month
  •     Low-rate limit access to suite of v2 endpoints
  •     3,000 Posts per month - posting limit at the user level
  •     50,000 Posts per month - posting limit at the app level
  •     10,000/month Posts read-limit rate cap
  •     1 Project
  •     2 Apps per Project with unique Environment (Development/ Production/ Staging)
  •     Access to Ads API
Pro (Startups scaling business) ---$5,000 per month
  •     Rate-limited access to suite of v2 endpoints, including search and filtered stream
  •     1,000,000 Posts per month - GET at the app level
  •     300,000 Posts per month - posting limit at the app level
  •     1 Project
  •     3 Apps per Project with unique Environment (Development/ Production/ Staging)
  •     Login with X
  •     Access to Ads API
Enterprise (Businesses/Scaled projects) --- Various tiers (starts at $42,000 per month)
  •     Commercial-level access that meets your and your customer's specific needs
  •     Managed services by a dedicated account team
  •     Complete streams: replay, engagement metrics, backfill, and more features
* There is not a separate subscription or access availability for Academic Research

 

Data Analysis

Useful Tutorials
X API:

Python, R, Java, JavaScript, etc.  

X Ads API:

Python, Ruby, JavaScript SDKs

X Websites:

Gatsby, WordPress

API Reference Index


YouTube

 

API Available

YouTube Data API (v3)

Add YouTube features to your application, including the ability to upload videos, create and manage playlists, and more

YouTube Analytics API

Retrieve viewing statistics, popularity metrics, and more for YouTube videos and channels

  •     The Reporting API supports applications that can retrieve and store bulk reports, then provide tools to filter, sort, and mine the data
  •     The Analytics API supports targeted, real-time queries to generate custom reports in response to user interaction
YouTube Livestreaming API

Schedule live YouTube broadcasts and manage your broadcast video streams

 

Accessibility

  1.     Create/log into Google Account
  2.     Create a project in the Google Developers Console and obtain authorization aredentials
  3.     Register YouTube Data API as one of the services that your application is registered to use:
  4.     If the application uses any API methods that require user authorization review the authentication guide to learn how to implement OAuth 2.0 authorization

 

Limitations

All API requests, including invalid requests, incur at least a one-point quota cost.  
  •     Available quota can be found on the Quotas page in the API Console in your account
All projects that enable YouTube Data API default to 10,000 units per day
  •     Request for additional quota can be done through the Quota extension request form for YouTube API Services

 

Data Analysis

YouTube Data API (v3)   
YouTube Analytics API
YouTube Livestreaming API
R Wrapper for YouTube API
Python Wrapper for YouTube API