Skip to content

Sample dataset of 1001 YouTube comments, extracted via Bright Data API, featuring essential data points for consumer sentiment, market analysis, and content creation.

Notifications You must be signed in to change notification settings

luminati-io/YouTube-dataset-sample

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

YouTube-dataset-sample

A sample dataset of 1001 YouTube comments

YouTube dataset header

A YouTube dataset sample of over 1000 records. Dataset was extracted using the Bright Data API.

Some of the data points that are included in the YouTube dataset:

  • comment_id: Unique identifier for each comment
  • comment_text: The text content of the comment
  • likes: Number of likes received by the comment
  • replies: Number of replies to the comment
  • username: Username of the commenter
  • username_md5: MD5 hash of the username field
  • user_channel: URL of the commenter's YouTube channel
  • date: Date when the comment was posted
  • url: Web address of the YouTube video where the comment was posted
  • video_id: Unique identifier for the YouTube video
  • replies_value: Array of reply values
  • replies_without_names: Replies without associated usernames

And a lot more.

This is a sample subset which is derived from the "YouTube comments" dataset which includes more than 11.1M records.

Other available YouTube datasets:

  • YouTube posts dataset which includes 1.3B records

  • YouTube profiles dataset which includes 29.6M records

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

Get the full YouTube dataset.

What are the YouTube datasets use cases?

1. Consumer Sentiment

Gain insights into what your YouTube subscribers are saying about your brand. Monitor brand and product mentions by collecting comments, hashtags, mentions, and likes on YouTube to enhance your brand’s reputation and improve your services.

2. Content Creation

Utilize YouTube datasets to uncover insights about trending videos, popular formats, and the content users engage with most. Leverage this data to optimize your video content for maximum reach and audience engagement.

3. Market Analysis

Analyze your competitors' presence on YouTube with a YouTube dataset. Gather data on their content and activity, identify the types of content driving engagement, and see what YouTubers are saying about their brand and services.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.

About

Sample dataset of 1001 YouTube comments, extracted via Bright Data API, featuring essential data points for consumer sentiment, market analysis, and content creation.

Topics

Resources

Stars

Watchers

Forks