Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[idea] Generic code tokenizer #44

Open
EgorBu opened this issue Jan 10, 2019 · 5 comments
Open

[idea] Generic code tokenizer #44

EgorBu opened this issue Jan 10, 2019 · 5 comments

Comments

@EgorBu
Copy link

EgorBu commented Jan 10, 2019

Feature extraction for source code heavily relies on tokenization of source code and structure information in many tasks. If we want to use suggestion feature at GitHub we must use tokenized code.
This part is very important for everybody in MLonCode area and still it's quite complicated to do.
Proposal - extend bblfsh client or make new module that could be used by many different projects.

TLDR: information required by feature extractor

And this module could be used by many researchers in this area. Related issue bblfsh/bblfshd#231

@zurk
Copy link

zurk commented Jan 10, 2019

Right now we have tokenizer for JS in style-analyzer:
https://github.com/src-d/style-analyzer/blob/a0eaafd5b371433e3c2e3dc9d113710814912f99/lookout/style/format/feature_extractor.py#L655-L656

If we start this project, this code should be considered as an entry point.

@vmarkovtsev vmarkovtsev changed the title [basic feature extractor] Tokenizer [idea] Generic code tokenizer Jan 10, 2019
@m09
Copy link
Contributor

m09 commented Jun 18, 2019

Should this be transferred to src-d/feature-idea? @vmarkovtsev @EgorBu

@EgorBu
Copy link
Author

EgorBu commented Jul 2, 2019

Good idea, @m09

@m09
Copy link
Contributor

m09 commented Jul 2, 2019

Somehow I cannot transfer it, GitHub does not find the feature-idea repo 😕

Edit: it seems we need someone admin in both ml-backlog and feature-idea to transfer the issue.

@m09
Copy link
Contributor

m09 commented Aug 5, 2019

Calling @smola to the rescue to transfer the issue to feature-idea :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants