Skip to content

ajdev-1/corona-press-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Please have a look at the Jupyter notebook on Jupyter Notebook Viewer since it displays the dynamic LDA Viz component.

Corona Press Analysis

Motivation

The past has shown that the press uses exceptional situations, to bypass our attention filter with emotionalized reports that tend to exaggerate. In the U.S. there has been an increased number of reports on social consequences in about particular epidemics (swine flu, Zika virus, Ebola). Health aspects have been neglected.

Thus, press articles can give an impression about a pandemic that differs from the actual situation (e.g. growth rate). This type of reporting could be repeated with COVID-19. This raises the question of whether emotionality or specific themes of press coverage of COVID-19 correlate with the growth of the virus in a country.

Within this project I examined whether there is a correlation between emotionality, used topics of press articles and the growth rate of COVID-19 in a country. Here, countries were examined that had a significant development of the growth rate (in March/ April 2020): Germany, USA, Spain, Italy, Great Britain, and France.

Data

Within this project, I used an open-source dataset from IEEE DataPort that includes news/message and blog entry data. Here I have concentrated on English reports.

Technology

In this project, I pre-processed the press articles, then clustered them using Latent Dirichlet Allocation, and finally formed different topics from these clusters.

Result Examples

Word clustering

for one of the countries. This methodology (Latent Dirichlet Allocation in combination with the vizualization tool LDAViz) has been used to identify the different topics within the data corpus. image

Probability distribution

One of the probability charts for specific topics for one of the selected countries: image

Releases

No releases published

Packages

No packages published