-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Labeled code vectors pre trained code2vec (not issue but link share) #13
Comments
small preprocced dataset: |
220.000 codevectors, names and labels, balanced with 20% positive and shuffled: |
https://drive.google.com/drive/folders/180Lqpmp4X4YPwbAf_94uBiuj0B8pSM8D?usp=sharing |
Test set with 104K methods, 3k positive, shuffled, from elastic search: |
test data set with 20K methods from apache projects: |
codevectors_labeled_shuffled_test02 :
codevectors_labeled_shuffled_test :
codevectors_labeled_rebalanced-0-2_shuffled :
|
G drive link:
https://drive.google.com/open?id=1G0JtgelCNjjIHiGolUpF-4DbIrGyUdxO
The text was updated successfully, but these errors were encountered: