This repository contains Python code to find the dominant / most common colors in Steam banners.
Run extract_steam_palette.ipynb
A palette is an ordered list of N colors. In our case, N is arbitrarily set to 8.
We consider:
- a source palette, typically the palette of the gift wrapping,
- a database of target palettes, typically the palettes of every Steam game.
In order to match palettes, we first need to be able to match colors.
Colors can be represented in the following spaces:
- RGB,
- HSV:
- raw,
- linearized,
- CIE LAB,
- CIE LUV.
The distance between colors is the L2 norm in this space.
It is possible to take into account:
- the index of the color in the target palette,
- the difference between the indices of the colors in the source and target palettes.
The objective is to incentivize the matching to colors which are respetively:
- predominant in the target palettes, i.e. with low indices,
- at lease more predominant in the target than in the source palette, i.e. with lower indices.
In the latter case, thresholds can be used in order not to distinguish between target colors which have an index lower than the color in the source palette.
The distance between palettes can be:
- the Mean Pairwise Distance,
- the Hausdorff distance,
- a modified Hausdorff "distance",
- a custom Hausdorff distance, similar to Minimum Color Difference.
It is possible to take into account:
- the index of the color in the source palette.
The objective is to re-weigh the color distances in order to give more importance to the errors for colors which are predominant in the source palette. The weights can be normalized so that they sum to 1, as the idea is only to re-balance the terms in the error.
As with the matching of colors, it is possible to take into account:
- the index of the color in the target palette,
- the difference between the indices of the colors in the source and target palettes.
Here, in contrast to what was done with the matching of colors, the idea is to penalize rather than incentivize.
As with the matching of colors, thresholds can be used for more subtle touches.
A benchmark of the parameters used for the palette distance is shown on the Wiki.
woctezuma/steam-popular-appids
: popular Steam appIDs,woctezuma/steam-store-snapshots
: a comprehensive list of appIDs downloaded in January 2021,- Stack Overflow: different approaches to extract the palette of an image.
- Compare colors:
- Wikipedia: Red, Green, Blue (RGB)
- Wikipedia: Hue, Saturation, Value (HSV) Caveat: this is a cylindrical-coordinate representation!
- Wikipedia: CIELAB color space (L*a*b*)
- Wikipedia: CIELUV color space (L*u*v*)
- Compare palettes, i.e. ordered lists of a few colors:
- Mean Pairwise Distance [1, 2]
- Wikipedia: Hausdorff distance [2]
- Wikipedia: Modified Hausdorff "distance" (MHD) [2, 3] (in French):
- which is a modification where the sup is replaced by an average,
- which is not actually a distance as it does not exhibit the triangle inequality property,
- Minimum Color Difference (MCD) [1, 2]:
- which is a modification of the Hausdorff distance where the sup and max are replaced by an average,
- In retrospect, in my code, the distance between palettes is assessed by:
- a modification of the Hausdorff distance where the sup and max are replaced by a (weighted) sum,
- NB: Using a sum or an average is equivalent if the number of colors in each palette, i.e. the cardinality of each set, is constant.
- NB²: The weights allow to take advantage of the order of colors in the palettes. If one of
exponent
andfactor
is equal to 0, then the weights are all equal to 1, i.e. the weighted sum is simply a sum.
- Articles:
- [1] Pan, Qianqian, et al. Comparative Evaluation of Color Differences between Color Palettes. Color and Imaging Conference 2018.
- [2] Kim, Suzi, et al. Dynamic Closest Color Warping to Sort and Compare Palettes. SIGGRAPH 2021. (code)
- [3] Dubuisson, M-P., et al. A modified Hausdorff distance for object matching. ICPR 1994.