Skip to content
This repository has been archived by the owner on Jan 14, 2020. It is now read-only.

Latest commit

 

History

History
50 lines (41 loc) · 1.02 KB

RESULTS.md

File metadata and controls

50 lines (41 loc) · 1.02 KB

Experiments

Dec 4 - Preliminary Results

  • using commit log of existing issues we calculate the correct author
  • using symfony actvity: matched at 70%
  • without symfony actvity: matched at 32%

Nov 4

Features:

  • age
  • repo_num
  • follower_num
  • commit_comment_num
  • commit_num'
  • issue_comment_num
  • issue_event_num
  • issue_number
  • org_number
  • pr_comment_num
  • pr_num
  • collaborator_num

K-means

  • n_clusters: 2
  • n_init: 10 (default)
  • max_iter: 300 (default)
  • tolerance: 0.0001 (default)
  • algorithm: auto - em, elkan (default)

Training / Test Split

  • n = 474
  • 30%

Model 1: Decision tree

  • criterion: entropy
  • split: best (default)
  • max_depth: 13
  • max_features: None - use all (default)
  • random state: 0

https://github.com/jonlamca/comp5117-lam-wang/blob/d8607c4/python37/data/user_tree.png

Model 2: Random Forest

  • criterion: entropy
  • n_estimators: 10 (default)
  • max_depth: None (full depth)
  • random_state=3

https://github.com/jonlamca/comp5117-lam-wang/blob/d8607c4/python37/data/user_forest.png