Format of converted output files #9

kastein · 2022-06-07T08:23:23Z

When converting the json output files of the tagger or parser to conllu using read_prediction.py, then different recipes are not separated by an empty line and the IDs in the first column do not restart at 1 when a new recipe starts. When converting the output of the tagger the format looked correct when I added the argument --single-sentences in addition to the arguments listed in the main Readme but this option did not work for converting the parser output.

irisferrazzo · 2022-06-17T15:25:20Z

@TheresaSchmidt, I've just had a meeting with @kastein about this and we figured out that the problem could be solved by adding the function that you wrote for splitting the input data for the tagger (in case there are more recipes in a single input file) by adding white lines also for the parser. Could you also document the arguments that can be used (e.g., --single-sentences)?

This isn't as urgent as other matters (since we usually parse single recipes and not different ones at a time), but we may forget about it in the future. Thank you in advance!

irisferrazzo assigned TheresaSchmidt Jun 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Format of converted output files #9

Format of converted output files #9

kastein commented Jun 7, 2022

irisferrazzo commented Jun 17, 2022

Format of converted output files #9

Format of converted output files #9

Comments

kastein commented Jun 7, 2022

irisferrazzo commented Jun 17, 2022