-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
added bazel workspace lsp support #51
base: main
Are you sure you want to change the base?
added bazel workspace lsp support #51
Conversation
Hi @DanielEliad! Thank you for your pull request and welcome to our community. Action RequiredIn order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you. ProcessIn order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA. Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks! |
Hey @DanielEliad , thanks so much for the PR. We're still actively developing this, so I'm surprised to see people finding it! I haven't dug through all the way yet (it's after quitting time on Friday here ;) ), but I do want to say at the outset, we're not really looking to put bazel specific logic inside of this language server directly, so we wouldn't be able to use this as-is. We have multiple projects that use this (build systems, some other tooling) that have different needs around what they can interpret, how path resolution is done, etc, and the starlark part of this needs to remain reasonably generic. (Unless it's changed, I think the spec also still doesn't really define what a "load" is, so we want to keep that API boundary too so projects can do their own thing). I would say, though, it should be reasonably straight forward to implement what you had mentioned as an impl of LspContext that's something more standalone. That's what I've done, personally, for our internal projects, and it's worked well. Let me know if that could work for you, and that's for giving things a look! |
Thank you for taking a look at it :) in any case thank you for working on this repo it's awesome 👍 |
Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks! |
1 similar comment
Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks! |
Thanks @DanielEliad for the contribution! Definitely a super cool thing, CC @aherrmann who was considering doing something similar, and should definitely check this out. Looking at what this patch does:
Then for the Bazel specific logic, there are three options for where it goes:
I'm not too fussed, but my inclination would be to start at option 1 (it's not huge), move to option 2 when it gets bigger, and if it ever takes off properly, move to option 3. While I wouldn't be keen to have Bazel logic in the starlark-rust library itself, having it in the binary we ship alongside is less of a problem, since its a more lump of somewhat useful things, doesn't have to be precisely the standard or widely reusable. At the moment we are fairly rapidly iterating on the external API for the LSP server, so jumping straight to option 3 has costs in terms of breaking stuff fairly often, which would be a bit sad. |
Hey sounds great! I think the best thing to do is combine 2. & 3. And if you would ever want a different binary for the bazel lsp we could create it and either copy the some of the lsp logic or put in a common crate that both binaries will depend on. Thank you for your response! :) |
bf88954
to
bc34622
Compare
I might suggest holding off on doing too much restructuring. There are a few of us who maintain this crate, and there's no guarantee we'll agree. We should be talking tomorrow so can give you an answer then. |
Thanks for the ping @ndmitchell, this looks very interesting! Also pinging @k1nkreet, who was looking into this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BUILD target resolution
load(SOME_PATH, TARGET_NAME)
redirects to aBUILD/BUILD.bazel
file in that directory and to the target with the nameTARGET_NAME
I'm not sure I understand that point. In Bazel, the load
statement is used to load symbols defined in .bzl
files. E.g. load("//some:file.bzl", "some_symbol")
. How does this relate to targets defined in BUILD
files.
starlark/bin/eval.rs
Outdated
let malformed_err = ResolveLoadError::PathMalformed(path_buf.clone()); | ||
let mut split_parts = path.trim_start_match("//").split(':'); | ||
let package = split_parts.next().ok_or(malformed_err.clone())?; | ||
let target = split_parts.next().ok_or(malformed_err.clone())?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not familiar with this code-base, yet. So, take my comments with a grain of salt. But, I see a lot of label parsing logic repeated throughout. Perhaps it would be better to factor it out into one function producing a structured label representation?
Bazel supports a number of short-hands, e.g. @foo
is short for @foo//:foo
, or @foo//bar
is short for @foo//bar:bar
, or baz
is short for :baz
. I think it'll be easier to get these right if the label parsing logic is located in one place and not spread out.
EDIT Just saw, the next commit does just that.
starlark/bin/eval.rs
Outdated
Ok(info | ||
.output_base | ||
.join("external") | ||
.join(repository) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note, this logic will probably no longer work once bzlmod
becomes the norm (Bazel >=6.0) and repositories are stored under their canonical repository name.
starlark/bin/eval.rs
Outdated
if build.exists() { | ||
Some(build) | ||
} else if build_bazel.exists() { | ||
Some(build_bazel) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bazel will prefer BUILD.bazel
over BUILD
if both exist. So, IIUC the order should be swapped.
bazel/src/label.rs
Outdated
let target = split_parts.next()?; | ||
Some((package, target)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note, //foo
can be used as a short-hand for //foo:foo
,
and relative labels can skip the :
, e.g. foo
instead of :foo
.
|
||
fn split_repository_package_target(label: &str) -> Option<(&str, &str, &str)> { | ||
let mut split_parts = label.split("//"); | ||
let repository = split_parts.next()?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And @foo
as a short-hand for @foo//:foo
.
Hey :) For now I could check for all permutations of a name this is not completely hermetic but there seem to be a lot of issues under the bazel repository that want this information to be exported somehow. |
…ading //package:target we now load // as the workspace the file is in right now
…pository name.<version>
@Wyverald can help maybe? |
Hey @DanielEliad , sorry, I was out for a bit and am just catching up. So, I do want to detangle things a bit, and also I guess get my opinion in writing here (@stepancheg might want to too :) )
My first inclination was "No. Nope nope nope, generic all the way". However, making someone depend on things externally also means we can't change the APIs as willy-nilly as we can now. And I'm lazy, and don't want to have to deal with that. So, I think I'd be fine keeping this in the repo, but only for the starlark bin. Bazel logic should never be in the library portion. I'd also rather it were kept in a The only other concern I'd have, is if there's going to be a lot of iteration on the bazel api side of things. If there's potentially going to be a lot of churn there, I'd honestly rather it were kept in another repo (and #52 seems to suggest that there would at least be some codegen tools, etc to create the bazel builtins interface). Not that we don't love you folks, but a higher volume of pull requests for a piece of the project that we're not even really supporting does become a bit of a support load after a while. But, this is just one person's thoughts, I'll let the rest of the team chime in :) |
My initial thought was in the repo is fine. But as I see the iteration/discussion between three different people on what this Bazel stuff should do (which I love to see!), I worry that we are putting non-Bazel people in the loop as the ultimate arbiters of what gets committed, and I worry we're going to suck at that. A separate repo would let the Bazel interested parties have more freedom to do what makes sense there, without us slowing you down. Those are my thoughts, rather than a final answer though! |
One possibility is to extend I. e.
LSP binary will call This script:
Or maybe instead of starlark, This way no need to write rust code to create your custom LSP setup for Bazel (or whoever else needs it). |
}); | ||
match location_result { | ||
// if the location result was successful | ||
// only return a location link if a location was found | ||
Ok(location) => location.and_then(|target_range| { | ||
Some(LocationLink { | ||
origin_selection_range: Some(source.into()), | ||
target_uri: url.clone().try_into().ok()?, | ||
target_range, | ||
target_selection_range: target_range, | ||
}) | ||
}), | ||
// if the location result was an error | ||
// try to at least go to the file | ||
_ => Some(LocationLink { | ||
origin_selection_range: Some(source.into()), | ||
target_uri: url.try_into()?, | ||
target_range: Range::default(), | ||
target_selection_range: Range::default(), | ||
}), | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm actually a little curious how you were hitting this issue. The first unwrap_or_default() gets the default range in the case of an error, and the second one gets the default range in the case of a None
. I actually had a test written up to make sure we caught the problem, and it doesn't actually repro.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because right now what it'll do for loads
, is, if it can't find the symbol, it'll jump to where it is in the load statement, and you can get jump to the file by doing a gotodefinition on the file portion of the load. I could kind of argue either way on that, tbh? Like, if the file doesn't parse, or the symbol doesn't exist, wanting to know that it's supposed to be loaded. Popping a "random" file up might be confusing. But I could also see an argument in the other direction, that you at least want to get to the file that it's in.
I think I'll probably go with adding a little bit of user-facing error reporting, and then making this jump to the file in the load statement (which I think is the only place this should be an actual issue)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
because of how the String Literal definition location is built I first have to redirect to the url of the BUILD file and only then I get the ast
that I can give to the location_finder to find the exact range in the file.
Like looking for target :my_target
will redirect to the url for the closest BUILD
file and then calculate the ast and call location_finder to get the location of my_target
in that file. If it doesn't exist there location_finder returns Ok(None)
.
because resolve_string_literal doesn't have access to the ast and can't calculate it I have to "assume" a target with that name exists parse the file and then decide what the range is.
because of bazel label ambiguity we can try to goto definition on "manual" in tag = ["manual" ]
and then get sent to the closest BUILD file.
So in case the location_finder couldn't find the range I read that as the resolve_string_literal guess was wrong.
If the LspContext had a way of calculating the ast the logic would be a lot simpler
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, we don't have the context do the calculation because it can't be assumed to have the most up to date file contents. (The server maintains contents of files that are open and edited, but not necessarily saved).
I'm still a tad confused, tbh, about this behavior. Is the point with the "manual" bit that you wouldn't want the BUILD file to be visited, or... ? Because right now, it should open whatever BUILD file you return from the first pass of resolve_string_literal
, whether Some or None is returned from the location finder.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah I would like it so say can't find definition instead of taking me to the nearest BUILD file,
the location_finder returns Ok(None) if it cant find the target (in this case "manual") in the build file so I don't redirect to the BUILD file
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, we could probably make that an enum and make it more configurable. Because our projects do want that behavior :) Will try to knock something up in the morning.
@stepancheg I'd be down with something like that (further details to be hashed out in an Issue if we go that way :) ). It'd also I think work with @laurentlb 's concerns in #52 . Agreed a simpler interface for users who have non-starlark-rust interpreters might be desirable :) |
@stepancheg nice idea! I imagine that most users of the script will find Starlark too limiting (e.g. they'll want to read and parse a proto file), so pregenerating it as JSON might be simpler for the user, and probably simpler for us. |
To clarify, though path resolution inherently tends to require some programmatic logic, so you'd still have to have a starlark file. Just, it might get to have access to the pregenerated json from whatever script you invoke (which e.g. might parse through protobufs, etc) :) But for "builtin" deifnitions, yeah, JSON would be nice, since we already have a |
Thank you all for taking the time to look at this!
I would need to generate a definition location for Maybe the quickest solution here would be to depend on this crate from a different repo and implementing What do you think? |
Currently the Starlark AST is (deliberately!) not exported. The reason was that certain aspects of the AST (e.g. where we box etc) are driven by somewhat ad-hoc concerns, and thus it isn't a good stable API. Would there be a subset of operations/queries on the AST that would be sufficient? I kind of suspect that exporting the entire AST sooner or later is going to be necessary, so if this is the change that makes it necessary, that's not a problem either. |
I forgot about resolving target names. Then the resolver script should accept a list of string literals. It becomes complicated.
That was the idea.
Yes, simpler API with only as little dependencies on starlark internals would be better. |
So in terms of the API I would need from the ast, I think the minimum now is: as the trait So what I'll do is convert this pr into making I'll keep working on it and get back to you with the refactor soon :) |
So getting back to this, and in conjunction with #68, there are three broad pieces:
Let me know if anything seems missing. |
@MaartenStaa - does your PR's cover this? Anything remaining here we should seek to be merging? |
Hi :)
I found this project after looking for an lsp server for working on bazel projects,
I added some load path resolution features to make it be able to handle bazel workspaces and remote repositories.
I added:
load(SOME_PATH, TARGET_NAME)
redirects to aBUILD/BUILD.bazel
file in that directory and to the target with the nameTARGET_NAME
This PR also supports bzlmod and regular workspaces,
local_path_override
and--override_repository
Thank you for taking the time to look at it :)