-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add in serviceability, release doc & re-arrange README and doc md files #282
Add in serviceability, release doc & re-arrange README and doc md files #282
Conversation
Codecov ReportPatch coverage:
Additional details and impacted files@@ Coverage Diff @@
## main #282 +/- ##
==========================================
- Coverage 84.61% 83.91% -0.71%
==========================================
Files 27 31 +4
Lines 3530 4259 +729
==========================================
+ Hits 2987 3574 +587
- Misses 402 516 +114
- Partials 141 169 +28
☔ View full report in Codecov by Sentry. |
@@ -0,0 +1,58 @@ | |||
# Serviceability |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Correct me if I'm wrong but I assume the target audience is 1) the HAS developer writing code that needs to emit serviceable logs and 2) the SRE that needs to know where to look for logs and troubleshoot. If this is the case, let's update the doc with this info
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What kind of info are we looking for on 1 and 2 in your suggestion..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just an overview blurb describing the purpose and intent of this doc. When I read this doc without any context, I see it can serve two purposes 1. From a developer perspective for someone who is unfamiliar with instrumenting logs and what is supposed to go into them, the Understanding the Logs
topic can be useful. 2) from the troubleshooting perspective for a developer/SRE. Understanding where to look for the logs and what to do with them is also useful.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@kim-tsao I can update this PR now that we have more info on getting access doc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated!
docs/serviceability.md
Outdated
View the application-service controller logs by tailing the manager container log of the controller manager pod. Example, | ||
|
||
``` | ||
oc logs -f application-service-application-service-controller-manager -c manager |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Depending on where this cluster is i.e. local cluster, staging or prod, we will need to understand how we can get access in order to run this command but I think the preferred approach for a managed service is to pull the logs from a centralized log store like Cloudwatch or Splunk since the retention period is longer. Since the details are unknown for now, we can clarify and say Deploy on a Local Cluster
with another section or sub-section for Deployed on Managed Clusters
as TBD
``` | ||
|
||
### Understanding the Logs | ||
Each application-service controller logs their reconcile logic to the manager. The log message format is generally of format |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should also link to this ADR once it's been merged. It explains the expected log conventions. Maybe add a placeholder to link to the AppStudio log conventions ADR so it serves as a reminder
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The ADR has been mentioned in the doc
docs/serviceability.md
Outdated
|
||
## Debugging | ||
|
||
- Insert break points at the controller functions to debug unit tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This only explains debugging unit tests which is pretty narrow if there is a real problem. For managed services, we would typically get a copy of the logs, scan it for errors and then search the code repo with the error message snippets to see where the problem may be coming from but we need to be careful since some messages may be concatenated so searching on a unique word may be better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I put in more info on how to debug from the code.
docs/serviceability.md
Outdated
- When debugging unit and integration tests, remember that the mock clients used are hosting dummy data and mocking the api call and returns | ||
|
||
## Common Problems | ||
- A Github Personal Access token is required as the application-service controller requires the token for pushing the resources to the GitOps repository |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are you suggesting users use their own github PAT to create gitops repos under their own user org? if so, link to these instructions https://github.com/redhat-appstudio/application-service#creating-a-github-secret-for-has. We should also follow github's suggestion to use a fine grained token instead of the classic one: https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/creating-a-personal-access-token#fine-grained-personal-access-tokens
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated
8386a98
to
655f366
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left some comments, but overall liked the changes.
README.md
Outdated
|
||
A Kubernetes operator to create and manage applications and control the lifecycle of applications. | ||
|
||
## Building & Testing | ||
This operator provides a `Makefile` to run all the usual development tasks. If you simply run `make` without any arguments, you'll get a list of available "targets". | ||
This repository is closely associated with the [application-api](https://github.com/redhat-appstudio/application-api/) repository, which contains the public APIs. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: public APIs for what?
docs/build-service.md
Outdated
@@ -0,0 +1,11 @@ | |||
# Setting up the AppStudio Build Service environment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is no longer necessary for HAS
docs/build-and-test.md
Outdated
@@ -0,0 +1,43 @@ | |||
# Build and Test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TBH, I like having build and deploy together in the same file.
docs/deploy.md
Outdated
|
||
The following section outlines the steps to deploy application-service on a physical Kubernetes cluster. | ||
|
||
## Setting up the AppStudio Build Service environment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See my other comment, I think this can be removed. It hasn’t been needed for HAS since mid last year.
docs/serviceability.md
Outdated
- When creating a `Component` from the `ComponentDetectionQuery`, remember to replace the generic application name `insert-application-name`, if the information is being used from a `ComponentDetectionQuery` status | ||
|
||
## FAQs | ||
Q. Where can I view the application-service api types? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would also suggest linking to the book of appstudio.
docs/serviceability.md
Outdated
- the log message. For example, `"msg":"Finished reconcile loop for user-tenant/devfile-sample-go-basic-development-binding-hr9nm"`. You may search for the string `Finished reconcile loop for` in the application-service repository to track down the code logic that is emitting the log. Remember to look out for resource name concatenation and/or error wrapping that may not turn up in your code search. It is advised to exclude such strings from the code search for debugging purposes | ||
|
||
## Common Problems | ||
- When deploying HAS locally or on a local cluster, a Github Personal Access Token is required as the application-service controller requires the token for pushing the resources to the GitOps repository. Please refer to the [instructions](../docs/deploy.md#creating-a-github-secret-for-application-service) in the deploy section for more information |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don’t think I saw steps for deploying HAS locally (I.e. outside of a container)?
docs/serviceability.md
Outdated
|
||
## Debugging | ||
|
||
- Insert break points at the controller functions to debug unit tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How to setup HAS with debugger (e.g. dlv in vscode) would be good in this section
oc logs -f application-service-application-service-controller-manager -c manager -n application-service | ||
``` | ||
|
||
### Deployed on a Managed Cluster |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps just leave this section out since it’s RHTAP specific
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will probably keep it in for now, because Kim suggested we put it here in her PR review. I think we can iterate over it and move it in the future if we dont want it.
docs/serviceability.md
Outdated
``` | ||
make install | ||
make build | ||
./bin/manager |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This feels like it should go in the deploy doc? Plus I believe some environment variables (disabling webhooks and git token) need to be set before running the binary.
then just say here that the logs are available in the terminal window
docs/serviceability.md
Outdated
@@ -0,0 +1,76 @@ | |||
# Serviceability | |||
|
|||
The serviceability document aims to help local application-service developers and SRE engineers to access and service the application-service component. This document will help you understand how to access and understand the application-service logs, how to debug an application-service problem and provides a quick summary on the various questions that you might have regarding application-service. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
NIT: SRE (Site Reliability Engineer) Engineer? 😛
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
2593b4b
to
f1d3ce9
Compare
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
@johnmcollier I have addressed your PR review, PTAL! Thanks. |
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: johnmcollier, maysunfaisal The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What does this PR do?:
Which issue(s)/story(ies) does this PR fixes:
https://issues.redhat.com/browse/DEVHAS-294
PR acceptance criteria:
Unit/Functional tests
Documentation
Client Impact
How to test changes / Special notes to the reviewer:
Adds in information about logging, debugging, common problems, FAQs. Rearranges md files.
Inspired from the https://github.com/redhat-appstudio/managed-gitops repo