Try out generating code instead of bincode for glyphsLib data to avoid runtime deserialization #1066

rsheeter · 2024-10-24T05:40:37Z

As noted in #774 (comment), loading the glyph data takes a noticeable amount of time, north of 10% on Linux if samply is to be believed. This PR explores generating const arrays instead of bincode we have to deserialize at runtime.

Impact on my Linux box:

$ hyperfine --warmup 25 --runs 250 --prepare 'rm -rf build/' 'target/release/fontc_main ../OswaldFont/sources/Oswald.glyphs' 'target/release/fontc ../OswaldFont/sources/Oswald.glyphs'
Benchmark 1: target/release/fontc_main ../OswaldFont/sources/Oswald.glyphs
  Time (mean ± σ):     206.3 ms ±   8.5 ms    [User: 1091.0 ms, System: 1469.1 ms]
  Range (min … max):   187.4 ms … 242.4 ms    250 runs
 
Benchmark 2: target/release/fontc ../OswaldFont/sources/Oswald.glyphs
  Time (mean ± σ):     183.8 ms ±   8.6 ms    [User: 1067.5 ms, System: 1436.0 ms]
  Range (min … max):   158.9 ms … 211.8 ms    250 runs
 
Summary
  target/release/fontc ../OswaldFont/sources/Oswald.glyphs ran
    1.12 ± 0.07 times faster than target/release/fontc_main ../OswaldFont/sources/Oswald.glyphs

The key change is that update.py now emits const arrays in Rust code instead of build.rs producing bincode that we load at runtime. You probably don't want to open glyphs-reader/src/glyphdata.rs in a browser diff view, it's a bit long. Maybe there is a better way?

… it not do so

cmyr · 2024-10-24T14:44:36Z

I wonder if we could just start a job to load the data earlier in execution? It is needed fairly early, but if we triggered the load in another thread immediately on launch it might be ready by the time we need it..

rsheeter · 2024-10-24T15:45:14Z

start a job to load the data earlier in execution

I have reservations about this being more complex and more expensive in that we burn cycles pointlessly every run.

cmyr

I definitely agree that we don't want to be spending 10% of our runtime on this, but I wonder if there isn't a slightly easier way to do this (as you mentioned elsewhere) by just directly generating a &'static [GlyphInfo]; we would then use Cow<'static, [GlyphInfo]> in the GlyphData struct so that we were using the static array in the general case, but then it should 'just work'?

basically we can quite easily just swap out the bincode step without needing to make many other changes... I'm now nerdsniped enough I'm going to take a quick look, I think it should be a very small patch.

glyphs-reader/data/update.py

glyphs-reader/src/font.rs

glyphs-reader/src/glyphdata.rs

cmyr

Okay, so I need to stop ratholing on this for the time being; I had an approach that seems sensible and overall simpler but I'm running into some very funny business where my generated code is hanging fontc?

my main concern with this patch is that it is making a big chunk of changes that seem unrelated to our primary concern, which is to stop parsing an input file in order to generate our initial data. It may be that these other changes make sense, but it makes reasoning about the patch significantly more difficult.

Currently we have a fn load_bundled_data, which returns Vec<GlyphInfo>. What if we keep this exact same signature, but with load_bundled_data just being a codegen'd vec![] macro of GlyphInfo structs?

to make this easier (and simply reduce the size in MB of the enormous file we generate) I would add a convenience constructor to GlyphInfo looking like,

/// Create a new `GlyphInfo` instance from generated code.
    ///
    /// The types and the name of this function are chosen to minimize filesize
    /// of the generated code.
    #[doc(hidden)]
    pub fn bake(
        name: &'static str,
        category: Category,
        subcategory: Subcategory,
        unicode: Option<u32>,
        production: &'static str,
        alt_names: &'static [&'static str],
    ) -> Self {
        let production = if production.is_empty() {
            None
        } else {
            Some(SmolStr::new_static(production))
        };

        Self {
            name: SmolStr::new_static(name),
            category,
            subcategory,
            unicode,
            production,
            alt_names: alt_names
                .into_iter()
                .map(|s| SmolStr::new_static(*s))
                .collect::<Vec<_>>(),
        }
    }

and then I would generate a bunch of invocations of this constructor.

This approach isn't perfect; we will be doing some work in this fn, but hopefully it will be significantly less? The existing use of OnceCell means it will only ever be invoked once.

It may be that this solution is inadequate, but it seems to me like it would be worth trying this first, and then if we aren't satisfied we can consider something more invasive?

glyphs-reader/src/font.rs

rsheeter · 2024-10-24T19:42:25Z

It may be that these other changes make sense, but it makes reasoning about the patch significantly more difficult.

That's fair, let me think a bit about how to factor into smaller changes after meeting season ends.

rsheeter added the performance label Oct 24, 2024

rsheeter force-pushed the glib branch from 405a5dc to 143c8d1 Compare October 24, 2024 05:50

Loading glyphslib derived data is showing up in profiles, try to make…

6dc45a6

… it not do so

rsheeter force-pushed the glib branch from 143c8d1 to 6dc45a6 Compare October 24, 2024 05:52

cmyr reviewed Oct 24, 2024

View reviewed changes

glyphs-reader/data/update.py Outdated Show resolved Hide resolved

glyphs-reader/src/font.rs Show resolved Hide resolved

glyphs-reader/src/glyphdata.rs Outdated Show resolved Hide resolved

cmyr reviewed Oct 24, 2024

View reviewed changes

glyphs-reader/src/font.rs Show resolved Hide resolved

rsheeter added 2 commits November 1, 2024 13:56

We don't need to commit the giant xml files anymore

b2f6e51

Per code review, don't codegen enums

790cdc8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try out generating code instead of bincode for glyphsLib data to avoid runtime deserialization #1066

Try out generating code instead of bincode for glyphsLib data to avoid runtime deserialization #1066

rsheeter commented Oct 24, 2024 •

edited

Loading

cmyr commented Oct 24, 2024

rsheeter commented Oct 24, 2024

cmyr left a comment

cmyr left a comment

rsheeter commented Oct 24, 2024

Try out generating code instead of bincode for glyphsLib data to avoid runtime deserialization #1066

Are you sure you want to change the base?

Try out generating code instead of bincode for glyphsLib data to avoid runtime deserialization #1066

Conversation

rsheeter commented Oct 24, 2024 • edited Loading

cmyr commented Oct 24, 2024

rsheeter commented Oct 24, 2024

cmyr left a comment

Choose a reason for hiding this comment

cmyr left a comment

Choose a reason for hiding this comment

rsheeter commented Oct 24, 2024

rsheeter commented Oct 24, 2024 •

edited

Loading