Vanishing messages, removed queues #813
-
This seems to be a server issue, I am not sure where to report it.
First I ran sr3 on my home desktop, metpx version 3.00.45 with the
I verified that the received notifications matched the downloaded files. There were no errors, only a few warnings in the logs. The next attempt was retrieval of the 00Z model run on my home computer. During that time, I monitored the network traffic with
Again, there were no errors or even warnings in the log files. Are there restrictions on the server side for the amount of data a client can download? Or something else is going on? |
Beta Was this translation helpful? Give feedback.
Replies: 16 comments 20 replies
-
stating the obvious: no, that's not normal, you should have gotten the full complement every time (presumably 193 on every line.) there are restrictions related to having to high a backlog (10,000 files in the queue or something like that.) but I don't think that would be the problem here. note: the configuration seems to be posted twice |
Beta Was this translation helpful? Give feedback.
-
I will try again for the 18Z model issue on my home system (faster internet, bypassing NOAA gateway). |
Beta Was this translation helpful? Give feedback.
-
At 21:10Z I have received ~1500 files. I see connections to 3 hosts: 105.189.10.90, 91 and 47. Only two are active, the .47 traffic rate is ~30 kb/s all the time. The total volume retrieved is ~0.95 GB. So it is much better than yesterday, when it timed out at about 0.8 GB. |
Beta Was this translation helpful? Give feedback.
-
Forgot about the queue. Yest, it is |
Beta Was this translation helpful? Give feedback.
-
@petersilva , would |
Beta Was this translation helpful? Give feedback.
-
Final result same format as before:
Interestingly, the earlier files are missing. Note that the total number of files for this model is about 32,000, all come within ~15 min. The 10k limit for number of messages is to small, unless the client is on a fast network. |
Beta Was this translation helpful? Give feedback.
-
Can someone check the cron jobs on dd.weather? I have a feeling that if one is more than 3 hours behind, the files get deleted at source. then attempts to download, fail, and it will get queued for retry for days, and that will slow things down. perhaps do sr3 status on your subscribers when downloading. There should be a retry queue listed. see what it looks like, also perhaps consult the lag display. |
Beta Was this translation helpful? Give feedback.
-
To tell the truth, I never liked having the queue limit at all. We were forced to do something because we had people starting queues and leaving them there, where they would build up to millions, and slow the service for everyone... kind of a inadvertant DOS attacks. The analysts on duty picked 10,000 and we thought about it a bit, bit it was really just a guess on our part. It could very well be that a higher number is needed. |
Beta Was this translation helpful? Give feedback.
-
can someone clarify why this is closed? I'm not sure if it's resolved... whoever closed it, please mark their reasoning. I'm worried I closed it myself with a mis-click. |
Beta Was this translation helpful? Give feedback.
-
12Z runs: they all have 183... 1 extra? at any rate the results seem fairly consistent.
While the downloads are happenning, and look at how much lag is reported. the more lag, the more likely stuff will get dropped. |
Beta Was this translation helpful? Give feedback.
-
Lags were huge, 1000+ s. I am pretty sure it is the queue limit issue. What I don't understand is why there were files missing from earlier forecast hours, the later periods came OK. It looks like the queue was purged when the limit was hit. |
Beta Was this translation helpful? Give feedback.
-
I'm going to ask about raising the limit to 50,000 |
Beta Was this translation helpful? Give feedback.
-
The other thing is, maybe you ( @yt87 )need to talk to your networking people to inquire about bandwidth between us. 1MB is kinda slow. Since getting it to my house is no problem... (meaning it exits all Canadian infrastructure) the bottleneck needs to be further away from us. |
Beta Was this translation helpful? Give feedback.
-
@yt87 if you send an email to ec.dps-client.ec@canada.ca then we can set you up with an account besides anonymous... |
Beta Was this translation helpful? Give feedback.
-
after internal discussions, there was agreement in principle to raise the limit to 50,000. Such a change has to go through change management, will probably take a couple of weeks before it happens. |
Beta Was this translation helpful? Give feedback.
-
Isn't it weird that if you close a discussion, it doesn't show up in the Discussions list? |
Beta Was this translation helpful? Give feedback.
I believe when the queue limit is reached, the oldest messages are purged from the queue and the 10,000 most recently published will remain.
You could further tweak the subtopics a bit to reduce the number of messages that get put into your queue. Since you only want every third hour, you could try