Skip to content

Sample dataset of 1001 H&M products, extracted via Bright Data API, featuring essential data points for pricing optimization, product personalization, and product inventory.

Notifications You must be signed in to change notification settings

luminati-io/HM-dataset-sample

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

H&M-dataset-sample

A sample dataset of 1001 H&M products

H&M products dataset header

A H&M products dataset sample of over 1000 records. Dataset was extracted using the Bright Data API.

Some of the data points that are included in the H&M products dataset:

  • category_tree: The hierarchy of categories to which the product belongs
  • color: The color of the product
  • country_code: Country code indicating the location or availability of the product
  • county_of_origin: The country of origin for the product
  • currency: The currency in which the product price is listed
  • delivery: Information about product delivery
  • description: A detailed description of the product
  • domain: The domain or website where the product is listed
  • features: Features or characteristics of the product
  • final_price: The current price of the product
  • image_count: The total number of images associated with the product
  • image_urls: URLs pointing to images of the product
  • in_stock: Indicates whether the product is currently in stock
  • initial_price: The original or initial price of the product
  • main_image: URL or identifier of the main image associated with the product
  • manufacturer: The manufacturer of the product
  • offers: Information about different offers or packages for the product
  • people_bought_together: Products that are commonly bought together with the current product
  • product_name: The name or title of the product
  • reviews_count: The total number of customer reviews for the product
  • related_products: Other products related to the current one
  • seller_name: The name of the seller offering the product
  • size: The size of the product
  • top_reviews: Highlights or top customer reviews for the product

And a lot more.

This is a sample subset which is derived from the "H&M products" dataset which includes more than 4.1M records.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enricH&Ment available as an addition to the data points extracted: Based on request.

Get the full H&M products dataset.

What are the H&M products datasets use cases?

1. Pricing Optimization

Develop a pricing strategy and create dynamic pricing models by analyzing comparable H&M products and categories against competitors.

2. Product Inventory

Identify inventory shortages of H&M products, detect increasing demand for specific items, and uncover emerging trends among consumers.

3. Market Strategy

Leverage the H&M dataset to perform market strategy analysis, identifying key trends and consumer preferences.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.

About

Sample dataset of 1001 H&M products, extracted via Bright Data API, featuring essential data points for pricing optimization, product personalization, and product inventory.

Topics

Resources

Stars

Watchers

Forks