Skip to content

Latest commit

 

History

History
65 lines (45 loc) · 2.88 KB

README.md

File metadata and controls

65 lines (45 loc) · 2.88 KB

CulturalRecipes

Cross-cultural recipe adaptation

Format of a recipe

{
'id': str,
'title': str,
'title_translated': Optional[str],
'ingredients': List[str],
'steps': List[str],
'dish': Optional[str],
}

Format of a matched_recipe

{
'source': recipe,
'targets': List[recipe]
}

Format of a data file: jsonl with one matched_recipe per line

Set direction Scale Link Notes
Silver cn2en 82 Download 1:1
Human cn2en 25 Download 1:1
Silver en2cn 52 Download 1:1
Human en2cn 41 Download 1:1

Chinese monolingual dataset (cleaned) and English monolingual dataset (raw)

Statistics of CulturalDataset

Direction Source Target Train Valid Test Silver Human
cn2en 44,473 144,645 35,578 4,447 4,448 82 25
en2cn 43,767 120,674 35,013 4,377 4,377 52 41