Detect infinite recursions in `nickel doc` #2055

yannham · 2024-09-27T12:02:00Z

Closes #1967.

This commit fixes an issue where legit recursive configurations cause infinite recursion when trying to extract their documentation. Indeed, nickel doc calls to eval_record_spine, which is unable to detect loops that spans multiple evaluation steps (an evaluation step is evaluating a field to a WHNF).

This commit adds a lock and unlock method to the interface of thunks, which amounts to blackholing, and use them in the implementation of eval_record_spine such that the thunks of the parents of the field that we are currently evaluating - the active parents - are always locked. If there is a cycle, this will raise an infinite recursion error, that we ignore - we can still print the unevaluated value for the user - but will prevent the evaluation from going on forever.

This technique should be used to fix other similar issues with force and deep_seq, but this is left for future work.

yannham · 2024-09-27T12:03:14Z

core/src/eval/cache/lazy.rs

    /// Generate an update frame from this thunk and set the state to `Blackholed`. Return an
    /// error if the thunk was already black-holed.
    pub fn mk_update_frame(&mut self) -> Result<ThunkUpdateFrame, BlackholedError> {
-        if self.data.borrow().state == ThunkState::Blackholed {
+        let mut data_ref = self.data.borrow_mut();


The changes in this function are cosmetic (avoid two calls to borrow()/borrow_mut() in a row: just borrow mut from the beginning)

yannham · 2024-09-27T12:04:44Z

core/src/eval/cache/lazy.rs

-            }
-        } else {
-            Ok(None)
+        // If the thunk is already evaluated, we don't return an update frame.


The changes are a bit cosmetic, but not only: previously, if idx.should_update() was false, we would never look at the state of the thunk and always proceed with evaluation. This is actually not correct, because if the thunk is blackholed, we should raise an infinite recursion error. For eval record spine in particular, the parents thunk are in weak head normal form (they are records), so should_update() is false and without this additional change, the detection of the infinite cycle fails.

I guess this code path is no longer triggered in eval_record_spine? But maybe it makes sense to do this anyway...

Yes, I suspect the previous code could miss some infinite recursion errors as well because of that.

github-actions · 2024-09-27T12:10:41Z

Bencher Report

Branch	2055/merge
Testbed	ubuntu-latest

⚠️ WARNING: The following Measure does not have a Threshold. Without a Threshold, no Alerts will ever be generated!
Latency
Click here to create a new Threshold
For more information, see the Threshold documentation.
To only post results if a Threshold exists, set the --ci-only-thresholds CLI flag.

Click to view all benchmark results

Benchmark	Latency	nanoseconds (ns)
fibonacci 10	📈 view plot ⚠️ NO THRESHOLD	495,290.00
pidigits 100	📈 view plot ⚠️ NO THRESHOLD	3,231,100.00
product 30	📈 view plot ⚠️ NO THRESHOLD	847,830.00
scalar 10	📈 view plot ⚠️ NO THRESHOLD	1,537,800.00
sum 30	📈 view plot ⚠️ NO THRESHOLD	840,870.00

🐰 View full continuous benchmarking report in Bencher

jneem · 2024-09-27T14:17:09Z

I think the recursion check has some false positives: with this PR, the input:

{
  outer = {
    mid = { inner | doc "this is inner" },
    z | doc "this is z" = outer.mid,
  }
}

doesn't generate documentation for outer.z.inner. But if I replace the value of z by mid instead of outer.mid then it does generate documentation for outer.z.inner.

yannham · 2024-09-27T15:08:40Z

I wonder if we can do much better than that, at least using the thunk machinery, because when z refers to the thunk outer, we can't really decide if this use is problematic or not. It depends if another field of outer is then accessed, and even then, just seeing mid doesn't tell us if there's an issue or not, as we could have something like:

{
  outer = {
    mid = { inner = outer.z | doc "this is inner" },
    z | doc "this is z" = outer.mid,
  }
}

Which is looping. So either we think it's ok to not print documentation in this case because the structure of the configuration is a bit intricate - do we expect configuration from which we extract doc to exhibit this sort of shape? I honestly don't know.

Or we can lower the detection power and accept to have more false negatives, but it becomes very ad-hoc. For example, we could only detect unguarded recursion, that is when a field directly hosts a naked variable as in z = outer.

Another possible solution is to try to count the number of times we unwrap the same thunk, and instead to halt at one, halt at some other higher empirical value (with the idea that if you recurse 10 times in the same thunk, there is a great change that this is an infinite loop). It has to be baked in the interpreter though, because eval_record_spine only has access to the thunks stored directly at the level of fields, but in z = outer.mid, only the evaluator has the knowledge that outer has been used at some point.

yannham · 2024-09-27T15:10:32Z

What makes me wonder if this example is really a good working example is that you would rather refer to mid directly as a sibling here, and this is handled fine by the implementation of this PR.

However, while it might be ok for documentation, the false positives preclude the usage of this technique for deep sequing and forcing, I believe.

yannham · 2024-09-27T15:17:44Z

Maybe one last possible approach, which is I believe what you hinted at in #1815 (comment), would be to detect regular trees, that is when a contract or a field evaluates to a parent thunk directly, rather than this PR that is more sensitive and detects mere usage. I would indeed require an additional stack/set of active thunks, but shouldn't be very hard to implement either. Le met give it a try

yannham · 2024-09-30T07:08:50Z

@jneem I implemented the last proposal, which seems to handle both your example and the original repro. I'm not sure it's a good idea to do that for deep_seq, as it has a cost - albeit small. I don't know if we want to pay that price for each and every deep_seq. I think the trade-off is different for nickel doc, where we can afford a bit more machinery to avoid infinite recursion.

In any case, it's read for a review.

jneem

Looks good, except that I think there are some left-overs from the previous version.

I haven't thought too much about the semantics, but if performance is a concern during normal eval then I think the state could be moved into the thunks instead of having an external hash set: give each thunk a bit that says whether or not it's active.

jneem · 2024-10-02T07:34:54Z

core/src/eval/cache/lazy.rs

+            data_ref.state = ThunkState::Evaluated;
+        }
+    }
+


This lock/unlock are no longer used, right?

jneem · 2024-10-02T07:40:45Z

core/src/eval/cache/lazy.rs

-            }
-        } else {
-            Ok(None)
+        // If the thunk is already evaluated, we don't return an update frame.


I guess this code path is no longer triggered in eval_record_spine? But maybe it makes sense to do this anyway...

yannham · 2024-10-02T08:25:46Z

I haven't thought too much about the semantics, but if performance is a concern during normal eval then I think the state could be moved into the thunks instead of having an external hash set: give each thunk a bit that says whether or not it's active.

This is a very good point; we could just "mix" the two solutions to get the best of both worlds (no false positive and fast check). For now I'm going to proceed with the current version, but it's good to keep in mind for the deep_seq issue.

yannham · 2024-10-02T09:19:57Z

@jneem Sorry I changed my mind 😛 as the lock and unlock methods were left over, I thought it would not be much effort to implement your last suggestion. Given alignment and padding, adding a bit for locking in ThunkData doesn't change the size of any datastructure, and will let us implement the detection for deep seq easily.

The only drawback is that cleaning up after an eval error is a bit more involved (the external hashset was just dropped upon error, but here I reckon we have to make sure the thunk's state is properly reset, or it could interfere with future evaluation e.g. when deep_seqing in the REPL). It's still reasonable IMHO.

The PR could deserve a new review though.

This commit fixes an issue where legit recursive configurations cause infinite recursion when trying to extract their documentation. Indeed, `nickel doc` calls to `eval_record_spine`, which is unable to detect loops that span multiple evaluation step (an evaluation step is evaluating one field to a weak head normal form). This commit changes `eval_record_spine` and related methods to lock (in practice just flip a bit in the thunk's state) the active thunks, that is the thunks of parents of the field that we are currently evaluating. If we ever come across a thunk that is already locked, the field is left unevaluated, avoiding infinite recursion.

core/src/program.rs

yannham commented Sep 27, 2024

View reviewed changes

github-actions bot temporarily deployed to pull request September 27, 2024 12:04 Inactive

yannham commented Sep 27, 2024

View reviewed changes

yannham requested a review from jneem September 27, 2024 12:22

yannham force-pushed the fix/doc-infinite-recursion branch from a0b0568 to cb65d79 Compare September 30, 2024 06:42

github-actions bot temporarily deployed to pull request September 30, 2024 06:44 Inactive

jneem approved these changes Oct 2, 2024

View reviewed changes

yannham mentioned this pull request Oct 2, 2024

Infinite loop (and memory usage) #1815

Open

yannham force-pushed the fix/doc-infinite-recursion branch from cb65d79 to f774618 Compare October 2, 2024 09:16

github-actions bot temporarily deployed to pull request October 2, 2024 09:18 Inactive

yannham force-pushed the fix/doc-infinite-recursion branch from f774618 to ee0d914 Compare October 2, 2024 09:44

github-actions bot temporarily deployed to pull request October 2, 2024 09:46 Inactive

jneem reviewed Oct 2, 2024

View reviewed changes

core/src/program.rs Outdated Show resolved Hide resolved

jneem approved these changes Oct 2, 2024

View reviewed changes

core/src/program.rs Show resolved Hide resolved

yannham added 2 commits October 2, 2024 15:06

Fix clippy warning

4e9463b

Fix typo in code comment

055312e

yannham enabled auto-merge October 2, 2024 13:07

github-actions bot temporarily deployed to pull request October 2, 2024 13:09 Inactive

yannham added this pull request to the merge queue Oct 2, 2024

Merged via the queue into master with commit 0b251e8 Oct 2, 2024
7 checks passed

yannham deleted the fix/doc-infinite-recursion branch October 2, 2024 13:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect infinite recursions in `nickel doc` #2055

Detect infinite recursions in `nickel doc` #2055

yannham commented Sep 27, 2024

yannham Sep 27, 2024

yannham Sep 27, 2024

jneem Oct 2, 2024

yannham Oct 2, 2024

github-actions bot commented Sep 27, 2024 •

edited

Loading

jneem commented Sep 27, 2024

yannham commented Sep 27, 2024

yannham commented Sep 27, 2024 •

edited

Loading

yannham commented Sep 27, 2024

yannham commented Sep 30, 2024

jneem left a comment

jneem Oct 2, 2024

jneem Oct 2, 2024

yannham commented Oct 2, 2024 •

edited

Loading

yannham commented Oct 2, 2024 •

edited

Loading

Detect infinite recursions in nickel doc #2055

Detect infinite recursions in nickel doc #2055

Conversation

yannham commented Sep 27, 2024

yannham Sep 27, 2024

Choose a reason for hiding this comment

yannham Sep 27, 2024

Choose a reason for hiding this comment

jneem Oct 2, 2024

Choose a reason for hiding this comment

yannham Oct 2, 2024

Choose a reason for hiding this comment

github-actions bot commented Sep 27, 2024 • edited Loading

Bencher Report

jneem commented Sep 27, 2024

yannham commented Sep 27, 2024

yannham commented Sep 27, 2024 • edited Loading

yannham commented Sep 27, 2024

yannham commented Sep 30, 2024

jneem left a comment

Choose a reason for hiding this comment

jneem Oct 2, 2024

Choose a reason for hiding this comment

jneem Oct 2, 2024

Choose a reason for hiding this comment

yannham commented Oct 2, 2024 • edited Loading

yannham commented Oct 2, 2024 • edited Loading

Detect infinite recursions in `nickel doc` #2055

Detect infinite recursions in `nickel doc` #2055

github-actions bot commented Sep 27, 2024 •

edited

Loading

yannham commented Sep 27, 2024 •

edited

Loading

yannham commented Oct 2, 2024 •

edited

Loading

yannham commented Oct 2, 2024 •

edited

Loading