TD Rotman FinHub TDMDAL Hackathon

Feb. 28, 2020 to Mar. 1, 2020.

Finalist Group (Top 5)

Task

In this project, we developed a ML process extracting information from the transcript of earning calls to predict stock price movement (net return) on the next trading day.

Available Data

Records of quaterly earning calls which are provided as paragraphs(.json) and daily stock returns(.csv) from Feb.2013 to Feb.2020 for 464 listed U.S. companies.

Methodology

We splitted each transcript into the manager discussion and Q&A parts because they are different in nature.
For each part, we measured the emotions using a Loughran McDonald dictionary and a finance terminology dictionary. Number of words in different categories(positive, negative, uncertain...) are counted which forms input data to following algorithms.
Random forests, support vector regressions, and XGBoost are used to predict returns.
Five-fold cross validation is implemented to find the best configuration(hyperparameters) and model.
Raw predictions on test set are scaled so that they have the same variance as the training set.

Outcome

The number of negative words in Q&A part had quite different distribution than that in manager discussion part.
Scaling raw predicted distribution effectively increased prediction accuracy.
Random forest worked best among the three models. It has a 55% accuracy in predicting stock price directions and a mean square error of 0.0016(30% less than linear regression) in predicting returns.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
data_proc		data_proc
model_selection_results		model_selection_results
models		models
predictions		predictions
presentation		presentation
references		references
sentiment_data		sentiment_data
.gitignore		.gitignore
Inference.ipynb		Inference.ipynb
README.md		README.md
append_returns.ipynb		append_returns.ipynb
eda.ipynb		eda.ipynb
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TD Rotman FinHub TDMDAL Hackathon

Feb. 28, 2020 to Mar. 1, 2020.

Task

Available Data

Methodology

Outcome

About

Releases

Packages

Languages

polo2444172276/TD-Rotman-FinHub-TDMDAL-Hackathon

Folders and files

Latest commit

History

Repository files navigation

TD Rotman FinHub TDMDAL Hackathon

Feb. 28, 2020 to Mar. 1, 2020.

Task

Available Data

Methodology

Outcome

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages