Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Format of converted output files #9

Open
kastein opened this issue Jun 7, 2022 · 1 comment
Open

Format of converted output files #9

kastein opened this issue Jun 7, 2022 · 1 comment
Assignees

Comments

@kastein
Copy link
Contributor

kastein commented Jun 7, 2022

When converting the json output files of the tagger or parser to conllu using read_prediction.py, then different recipes are not separated by an empty line and the IDs in the first column do not restart at 1 when a new recipe starts. When converting the output of the tagger the format looked correct when I added the argument --single-sentences in addition to the arguments listed in the main Readme but this option did not work for converting the parser output.

@irisferrazzo
Copy link
Contributor

@TheresaSchmidt, I've just had a meeting with @kastein about this and we figured out that the problem could be solved by adding the function that you wrote for splitting the input data for the tagger (in case there are more recipes in a single input file) by adding white lines also for the parser. Could you also document the arguments that can be used (e.g., --single-sentences)?

This isn't as urgent as other matters (since we usually parse single recipes and not different ones at a time), but we may forget about it in the future. Thank you in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants