Skip to content

Latest commit

 

History

History
28 lines (20 loc) · 908 Bytes

README.md

File metadata and controls

28 lines (20 loc) · 908 Bytes

BBC_News

News

About Dataset:

Context

Self updating dataset. It collects RSS Feeds from BBC News using a Kernel: https://www.kaggle.com/gpreda/bbc-news-rss-feeds. The Kernel is run with a fixed frequency and the dataset is updated using the output of the Notebook.

These datasets can be viewed as classification or regression tasks. The classes are ordered and not balanced (e.g. there are much more normal wines than excellent or poor ones).

Content

BBC News RSS Feeds. The data contains the following columns:

  • title
  • pubDate
  • guid
  • link
  • description

Collection method

Uses requests_html and BeautifulSoup to collect RSS Feeds from BBC News site.

Acknowledgements

The content is proprietary of BBC.

Inspiration

Use the data to analyze the sentiment of news, from title and description.