-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stats Endpoint #396
Comments
Thank @MichaelLukowski to get this rolling. Maybe is it worth it also to add another bit about the "name" of the DRS Server? |
@MichaelLukowski OK to assign you as the champion? I changed you to the owner of the ticket |
@briandoconnor yes that's fine. I'll take it |
Revving this since we'll discuss at GA4GH connect. Can we do something really simple like the information that powers https://stats.gen3.org? What about expanding service_info? That seems like a natural place rather than creating a new stats endpoint? In this mockup I also included geo/cloud location information (as an array since a given DRS server may manage data across multiple locations). It would be useful to be able to have a map of DRS servers and to know general high-level stats on them. There might be interest to break down stats by geo location but maybe that's overkill. Still, I like including a location since it will help us draw a map of the DRS servers out there....
|
Do we know if others standards have used I'm trying to understand the log/lat idea. A location of a DRS Server does not really give an idea of what data is there, so it's more to have them populated on a map? If that so, would not make more sense to build the map using the location of the institutions/partners that are running the Server? |
An example of the endpoint that powers stats.gen3.org is this https://gen3.biodatacatalyst.nhlbi.nih.gov/index/_stats Simply it is just a single json record that contains "fileCount" and "totalFileSize" fields. I personally don't know of a standard that has modified the I will have a PR for this later this week. For right now I will add a new |
In the Cloud WS meeting on Aug 12th, 2024 we decided to simplify the feature described in Issue #396 for DRS release 1.5. I updated this feature branch to just include the following two variables in the service-info endpoint:
The goal is to keep the metadata as simple as possible while still providing some useful understanding of the contents of a given DRS server. |
total_file_size should not be an integer in bytes as this would only allow us to signify 2 billion bytes ~ 2GB Suggest to use string or number as the type for this field |
Per the connect discussions I believe it would be a good idea to set up a
stats
endpoint for DRS servers.This will allow us to build something like a dashboard for existing DRS servers and show the research community just how much data is being hosted on existing DRS servers.
I think simply the endpoint can just be a
GET
to/stats
and the minimal information required from this endpoint would betotal_files
andtotal_file_size
.I would love to get some discussion related to this.
The text was updated successfully, but these errors were encountered: