Add metrics gathering #284

evilnick · 2024-09-19T14:28:45Z

Adds /metrics/ directory for metrics scripts and metrics.yaml
Adds bash scripts to generate metrics from source (.md, .rst) and build (.html) files
Adds a makefile target allmetrics that outputs metrics on the terminal and updates metrics.yaml
Stores results in metrics.yaml

DOCPR-880
DOCPR-881
DOCPR-882
DOCPR-883
DOCPR-885

scripts run on md and rst files scripts run when there are no files some junk data in .sphinx is excluded jq dependency is removed vale dependency is installed for readability metrics allmetrics entry added to list of make commands draft implementation of pre/code metric

update allmetrics

k-dimple · 2024-09-24T09:10:28Z

@SecondSkoll could you a take a quick look at this one? Especially the part about storing results in a /metrics/ directory. If you don't see any problem with that approach, I'll go ahead and use the same thing for storing spelling check and vale results.

SecondSkoll · 2024-09-24T10:06:57Z

I haven't reviewed this because while it seems perfectly functional, I disagree with the fundamental concept of putting test results into the repository.

I haven't seen the spec outlining the testing, I haven't any context for testing performed by competitors. I don't have full context for what this intends to be - but I believe it is to be able to track these metrics over time. I don't think we will get anything valuable out of that due to how significantly documentation changes over time. Each cycle, I would expect every documentation set to change significantly (content, dashboard commitments, etc.). Those changes would make raw data like this provide unusable data if you tried to do any sort of trend over time with more than two data points. I would personally prefer just an action that runs on a PR and on main, and then you would be able to manually compare results over time (you can set workflow artefacts to last 400 days - you could do that for a project and then go in and document things and provide contextual information that would be needed to actually use this data).

I don't think this is appropriate for spelling tests at all, and I have similar concerns for Vale based style guide tests.

I don't really understand what it is the testing in this PR aims to do, and what problem is it is trying to address (other than maybe the readability score).

evilnick · 2024-09-24T14:03:32Z

storing output in the repo isn't ideal but we currently have nowhere else to do it and are somewhat limited by the commandment that everything has to be able to be run from the makefile 🤷 . Ideally these jobs would run from GH and store elsewhere, but that isn't going to happen any time soon.

k-dimple · 2024-09-24T16:33:11Z

There is the option of running jobs as GitHub actions and then storing their results as GitHub artifacts. (I have used that option in the other vale-related PR that is currently open). The artifacts are stored for 90 days by default and can be downloaded anytime before that - using the link that GH provides within the action's results. Will that help?

SecondSkoll · 2024-09-25T02:30:45Z

I think a larger discussion needs to be had about the value of these metrics and how they can/should be used. I understand you need to gather data to be able to process it and synthesise something valuable from it - but from what I can see right now it's likely the data we're gathering will inform the outcome rather than the desired outcome informing the data gathering.

I do think that storing these tests as an artefact would be better than storing them in the repo (and you could do integration tests by comparing to the previous artefacts - and you could also make the pipelines trigger on a schedule so you never lose the most recent ones). But I would also keep in mind that you can go back to any commit and download all the files to run these tests against if you want to.

I don't want to block this, but I feel strongly about the documentation metrics being purposeful - and the purpose of these tests haven't been explained to me (again, other than readability - which I also think shouldn't be Flesch-Kincaid). I also feel strongly that storing these artefacts in the repository is a bad idea.

edibotopic · 2024-09-26T11:18:53Z

Thanks for all the input everyone.
I will modify the script to not persist data in the repo.

There is definitely room in the future for some discussion about which metrics might be preferred and how they might be analysed.

Update:

Data no longer persists and just outputs to the terminal: 36148b5
Some minor improvements to the scripts:
- cleanup: 0bf3376 , b6f5c49
- indent data output to make more readable: a690f3d
- better protection against duplicates when grepping for HTML tags: 6f6d363

SecondSkoll

While I still feel ARI is a better readability metric for technical documentation, I think this is overall much better.

It's probably worth mentioning that Vale has a native readability style, which might have been easier than doing this manually - and will make it easier to swap if we want to.

edibotopic · 2024-09-27T07:19:40Z

Thanks @SecondSkoll . I appreciate your time on this and your thoughtful remarks, which will need to be considered in future iterations.

There's nothing stopping us from swapping in ARI later on, or considering multiple readability metrics. I personally don't have strong views on that and would be interested to hear about the relative merits. ARI seems to only rely on counts of words, sentences and characters so would be easy to implement in bash without invoking Vale at all (if that's what we wanted).

On the Vale readability styles: we were aware of that but because that included several outputs we weren't interested in, and also didn't include some things that we wanted, we opted to use the metrics Vale outputs natively (via ls-metrics). I would have preferred not to use Vale at all because it requires venv and slows the script down. It was the last metric to be implemented and the rest didn't require vale.

In future we should consider what approach might offer us flexibility to adjust the metrics but also best capacity to maintain them.

SecondSkoll · 2024-09-30T06:30:12Z

I'm going to merge this in, as I would like to have a couple of days of code-freeze to update the extension version with all the functionality in main, so we can start using that as a default.

evilnick · 2024-09-30T09:34:32Z

thanks. sorry for the delay in checking in on this, have been ill :'(

evilnick and others added 9 commits September 13, 2024 15:04

add metrics

13c42a1

remove unrequired script

dcfd722

explicitly add yq

ba9e42b

Merge pull request #283 from edibotopic/metrics

525e505

update allmetrics

just use sed for now

725cbad

zero things

10b5f04

ensure there are built docs to check

8171afb

no longer requiring yq

145ad81

k-dimple requested a review from SecondSkoll September 24, 2024 09:06

edibotopic added 5 commits September 26, 2024 16:25

don't persist metrics data in repo

36148b5

tab indent data to make more readable

a690f3d

clean up dead comments and variables

0bf3376

grep only matching part of tags to prevent duplicates

6f6d363

quote variables

b6f5c49

SecondSkoll approved these changes Sep 27, 2024

View reviewed changes

SecondSkoll merged commit d7f19e5 into main Sep 30, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metrics gathering #284

Add metrics gathering #284

evilnick commented Sep 19, 2024 •

edited by edibotopic

Loading

k-dimple commented Sep 24, 2024

SecondSkoll commented Sep 24, 2024 •

edited

Loading

evilnick commented Sep 24, 2024

k-dimple commented Sep 24, 2024

SecondSkoll commented Sep 25, 2024

edibotopic commented Sep 26, 2024 •

edited

Loading

SecondSkoll left a comment

edibotopic commented Sep 27, 2024 •

edited

Loading

SecondSkoll commented Sep 30, 2024

evilnick commented Sep 30, 2024

Add metrics gathering #284

Add metrics gathering #284

Conversation

evilnick commented Sep 19, 2024 • edited by edibotopic Loading

k-dimple commented Sep 24, 2024

SecondSkoll commented Sep 24, 2024 • edited Loading

evilnick commented Sep 24, 2024

k-dimple commented Sep 24, 2024

SecondSkoll commented Sep 25, 2024

edibotopic commented Sep 26, 2024 • edited Loading

SecondSkoll left a comment

Choose a reason for hiding this comment

edibotopic commented Sep 27, 2024 • edited Loading

SecondSkoll commented Sep 30, 2024

evilnick commented Sep 30, 2024

evilnick commented Sep 19, 2024 •

edited by edibotopic

Loading

SecondSkoll commented Sep 24, 2024 •

edited

Loading

edibotopic commented Sep 26, 2024 •

edited

Loading

edibotopic commented Sep 27, 2024 •

edited

Loading