Skip to content

Commit

Permalink
merge 2 sets of duplicates
Browse files Browse the repository at this point in the history
  • Loading branch information
anya-ji committed Jun 7, 2023
1 parent fd1f523 commit 0314a50
Show file tree
Hide file tree
Showing 6 changed files with 9 additions and 14 deletions.
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1 +1,2 @@
.DS_Store
.DS_Store
dataset/merge.py
12 changes: 6 additions & 6 deletions dataset/README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,17 @@
# KiloGram Dataset
## ⚠️ Note:
- Tangrams page5-207, page6-51, and page6-66 are found to be duplicates, so their annotations are similar, but their SVGs differ slighly due to artifacts of the vectorization process.

[updated April 2023]
## ⚠️ Note: (v0.1.1)
- We found a few tangrams that are duplicates, so their annotations are similar, but their SVGs differ slighly due to artifacts of the vectorization process. We merged their annotations in the dataset. We merged their annotations and removed the SVG files of the duplicates.
- page5-207, page6-51, and page6-66 -> page5-207 (30 annotations)
- page2-189 and page4-170 -> page2-189 (20 annotations)

---
`full.json`: 1016 tangrams, at least 10 annotations each
`full.json`: 1013 tangrams, at least 10 annotations each

`dense.json`: 74 tangrams, at least 50 annotations each

`dense10.json`: 74 tangrams sampled for dense annotations, 10 annotations each (from FULL set)

`/tangrams-svg`: 1016 tangrams in SVG format
`/tangrams-svg`: 1013 tangrams in SVG format

JSON schema:
```
Expand Down
2 changes: 1 addition & 1 deletion dataset/full.json

Large diffs are not rendered by default.

2 changes: 0 additions & 2 deletions dataset/tangrams-svg/page4-170.svg

This file was deleted.

2 changes: 0 additions & 2 deletions dataset/tangrams-svg/page6-51.svg

This file was deleted.

2 changes: 0 additions & 2 deletions dataset/tangrams-svg/page6-66.svg

This file was deleted.

0 comments on commit 0314a50

Please sign in to comment.