JAX development patterns

Memory

This snippet is useful for inspecting the currently allocated device buffers.

client = jax.lib.xla_bridge.get_backend()
mem_usage = sum([b.nbytes for b in client.live_buffers()]) / 1e9
print(mem_usage)
print([b.shape for b in client.live_buffers()])

Also, to clear the compilation cache for a particular function: f_jit.clear_cache()

JAX memory profiling produces output readable by the pprof Go program. There's an online hosted version of this here: https://pprofweb.evanjones.ca/pprofweb/
jax.vmap can be dangerous for memory usage. Don't assume that a loop will be ordered in a sane way to minimize memory usage.
It's possible to run into out of memory errors when too much data is stored in the JAX compilation cache. The error will look like Execution of replica 0 failed: INTERNAL: Failed to load in-memory CUBIN: CUDA_ERROR_OUT_OF_MEMORY: out of memory in contrast to the normal JAX out of memory errors.
- Clearing the JAX compilation cache
- f._cache_size() from here
- f.clear_cache() will clear the compilation cache for a particular jitted function (the output of jax.jit)

Python not knowing about the size of JAX arrays can cause memory leaks:

python only knows about system RAM, not GPU RAM.
so it only schedules "deep" garbage collection (level 2) when memory usage is getting high.
but a JAX DeviceArray uses almost no system RAM since it’s all stored on the GPU.
so a DeviceArray looks to Python like the kind of thing that doesn’t need to be urgently garbage collected.
so, giant 1.5 GB DeviceArrays start to leak every iteration through AdaGrid.

Miscellaneous

JAX development patterns that might be useful:

Pull your jax.jit and jax.vmap calls into the outermost layer of the code. This has two benefits
- it's easier to write code that is less vectorized.
- grouping more operations before jit-ing gives more room for the compiler to optimize.
- jax.jit(jax.vmap(... is better than jax.vmap(jax.jit(...
Put a bunch of shared variables into a class and then include self in the list of static_argnums. This is a useful strategy for having a large number of static args.
jax.jit(f).lower(*args).compile() is a useful snippet for compiling a function without running the function. There is a long running JAX project to build better ahead-of-time compilation tools.

Techniques for avoiding non-deterministic behavior that will make jax complain:

jnp.where(...
jax.lax.fori_loop, jax.lax.while_loop. jax.lax.cond: https://jax.readthedocs.io/en/latest/notebooks/Common_Gotchas_in_JAX.html#structured-control-flow-primitives

Other things:

understanding pytrees in JAX is really helpful: https://jax.readthedocs.io/en/latest/pytrees.html
other difficult stuff: https://jax.readthedocs.io/en/latest/notebooks/Common_Gotchas_in_JAX.html
useful issues talking about using JAX with async/await:
- jax-ml/jax#6772
- jax-ml/jax#3769

Scary things:

sometimes jax.lax.cond combined with jax.vmap will result in both branches of your cond executing. In extreme cases, this can result in infinite loops. See the problem James and I ran into here.

Confusing static behavior for members of classes.

This is how it works:

the object id is used as the hash.
every member of the object is treated as also static.
but because the object id is the top level hash, the members do not need to be hashed and are just treated as compile-time constant.
this means that if you later changes the members of the class, JAX will not reflect those changes!

Demonstration:

import jax
from functools import partial

class wow:
    def __init__(self, x):
        self.x = x

    @partial(jax.jit, static_argnums=(0,))
    def f(self):
        return self.x
W = wow(10)
print(W.f())
W.x = 9
print(W.f())

output:

10
10

Useful Lazy-Evaluation Technique

It may be useful sometimes to force lazy-evaluation of JAX expressions (see example). In the example file, we see a class ArraySlice0, which represents a sliced array along axis 0. Though the restriction to axis 0 is unnecessary, it leads to simpler code and is sufficient for our applications. The purpose of this class is to allow for dynamic slicing. JAX cannot jit a[i:j] where i, j are non-static. However, although the slicing cannot be jit-ed, if we only rely on the sliced array through other operations such as __getitem__ and if those operations can be jit-ed, we can lazily evaluate the slice by fusing it with these other operations.

So, the following is not jit-able:

def f(x, y):
    y = y[x[0]:x[1]]
    return y[0]

However, the following is jit-able:

def f(x, y):
    y = ArraySlice0(y, x[0], x[1])
    return y[0]

Links that go into deep JAX details:

Hashing a Jax.DeviceArray
PartialVal is a hidden internal API for caching intermediate computations in JAX
reverse-mode differentiable bounded while loop
"why is jax so fast?"
can you precompile a JAX function before running it the first time?
- yes, and maybe you should do it inside a separate thread so that the main thread can continue doing whatever it is doing!
conditionals based on lax.cond will evaluate lazily, while conditionals based on lax.switch will evaluate every branch regardless of the condition.
A small library for creating and manipulating custom JAX Pytree classes
fast lookup tables in jax
np.interp in jax - note that we also have an interpnd implementation!
scan vs while_loop
higher order derivatives via taylor series!
extending jax with a custom C++ or CUDA operation
experimental sparse support
an issue about implementing scipy.spatial
- An interesting question: Would it be possible to implement a JAX KDTree?? What restrictions would make it possible? What modifications could be made so that it works well?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jax_patterns.md

jax_patterns.md

JAX development patterns

Memory

Miscellaneous

Confusing static behavior for members of classes.

Useful Lazy-Evaluation Technique

Links that go into deep JAX details:

Files

jax_patterns.md

Latest commit

History

jax_patterns.md

File metadata and controls

JAX development patterns

Memory

Miscellaneous

Confusing static behavior for members of classes.

Useful Lazy-Evaluation Technique

Links that go into deep JAX details: