Introduce a PathBuilder #850

gunnarmorling · 2017-09-18T09:37:14Z

Hey @gsmet, this is a small experiment based on your perf improvement work. You pointed out that the allocation of nodes and paths is a problem and I agree.

We went for the immutable approach back in the day as we often had issues with paths which were referenced e.g. in a constraint violation and then be altered later on in the course of subsequent validation of other constraints. I think the best way forward would be to separate mutable and immutable path and node types. The engine would in general work with the mutable representations and only "freeze out" an immutable reference when needed (constraint violation creation, storing processed path).

As a quick attempt, this PR is doing this very hackishly for paths, could you check whether it already makes a noticable difference? I'd expect a bigger difference when doing the same for nodes, but an early check may be useful to see whether it's in the right direction or not.

our current snapshot

cascade to

We can directly set the element in the list. It avoids some list resizing.

only have the default group Also optimize a bit the advanced case with groups.

default violation Also optimize a bit the concatenation of the default violation and the custom ones to avoid creating a list too small and resize it each time.

A lot of NodeImpls are created during the build of the path and some of them are simply discarded as they are "modified" (NodeImpl are immutable so we create a new) further away. Thus we better avoid building the hashCode each time.

Otherwise, we "compare" the whole path several times while comparing a path.

It avoids copying the list when not strictly necessary

It avoids a lot of initialization/resizing and in the end it's more efficient.

So we don't need to do it twice. Note that this change uncovers the fact that in ConstraintValidatorContext, calling atKey() or atIndex() makes the node iterable. It was already the case before and I think it's acceptable. It brings its own performance improvements as it avoids initializing 1 NodeImpl and creating 1 copy of the Path.

list with one element It doesn't seem necessary to consider more elements as the list will be copied when new nodes will be added.

correctly set in all constructors

Only property and container node exposes it to users.

gunnarmorling · 2017-09-20T06:52:37Z

Hey @gsmet, what should we do about this one? Would you like to pursue the mutable/immutable split for paths and nodes at this time? If not, you could at least integrate my first commit and we should be good to go once you've addressed that one remark about a hash code.

On the split above, instead of using Stack for mutable paths, we also may consider to make this a linked list of dedicated node implementations which should help with reducing memory allocation.

gsmet and others added 21 commits September 18, 2017 09:09

HV-1480 Add a hv-6.0 entry to be able to compare the latest stable with

f970b29

our current snapshot

HV-1480 Add a benchmark validating a bean containing a lot of beans to

27f566a

cascade to

HV-1480 Avoid removing and adding element to the node list

d3672d3

We can directly set the element in the list. It avoids some list resizing.

HV-1480 Copy the hashCode in the copy constructor

e3b444e

HV-1480 Avoid initializing lists and maps in the common case where we

5398159

only have the default group Also optimize a bit the advanced case with groups.

HV-1480 Avoid creating a list in the common case when we only have the

780af71

default violation Also optimize a bit the concatenation of the default violation and the custom ones to avoid creating a list too small and resize it each time.

HV-1480 Avoid computing the hashCode if not necessary

e76f701

A lot of NodeImpls are created during the build of the path and some of them are simply discarded as they are "modified" (NodeImpl are immutable so we create a new) further away. Thus we better avoid building the hashCode each time.

HV-1480 Don't take into account the parent in hashCode and equals

c11fdeb

Otherwise, we "compare" the whole path several times while comparing a path.

HV-1480 Implement a copy on write strategy for the node list

ed8f3b5

It avoids copying the list when not strictly necessary

HV-1480 Centralize the processed works in one single set

7c00b44

It avoids a lot of initialization/resizing and in the end it's more efficient.

HV-1480 We expect at least one node in the path so let's initialize the

2e66bec

list with one element It doesn't seem necessary to consider more elements as the list will be copied when new nodes will be added.

HV-1480 Even if not strictly necessary, the leaf node should be

16c19e5

correctly set in all constructors

HV-1480 Reduce the number of iterations in the other benchmarks

1acf669

HV-1480 Add the new benchmark to the default benchmarks

324b56e

HV-1480 Some more optimizations suggested by Sanne

b82169c

HV-1480 Add another benchmark

d2ea4b8

Mark a test as candidate for the TCK

0e0d825

HV-1480 Only set the property value if required

3b79cdd

Only property and container node exposes it to users.

HV-1480 Getting reference to parent node in a simpler way

bf7b5c0

WIP

da77077

gunnarmorling force-pushed the HV-1480 branch from aa68993 to da77077 Compare September 18, 2017 10:36

gsmet added the On ice while we think about it label Oct 4, 2017

gsmet changed the title ~~HV-1480~~ Introduce a PathBuilder Oct 4, 2017

Base automatically changed from master to main March 19, 2021 08:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce a PathBuilder #850

Introduce a PathBuilder #850

gunnarmorling commented Sep 18, 2017

gunnarmorling commented Sep 20, 2017

Introduce a PathBuilder #850

Are you sure you want to change the base?

Introduce a PathBuilder #850

Conversation

gunnarmorling commented Sep 18, 2017

gunnarmorling commented Sep 20, 2017