Include Content-Length response on getData with single file #155

paulmillar · 2024-02-07T09:50:58Z

The getData API call returns files' data. The response may be in the form of compressed data, archived (zip file) or as simply the file's data (when requesting the data of a single file).

Currently, for content larger than 16 KiB, all responses to getData are missing the Content-Length header and the response entity ("the data") is transferred via chunked encoding.

Using chunked encoding makes sense for dynamically generated content; for example, when IDS is sending compressed or archived ("zip-ed") content. However, chunked encoding is not required when IDS is transferring a single file's data. In this latter case, IDS knows the length of data it will send, so it can include the Content-Length response header and send the file's data without using chunked encoding.

Although the current behaviour (sending a file's data using chunked encoding) is valid HTTP, it is very uncommon for a file server to do this. Personally, I don't know of any other service that uses chunked encoding when sending a file's data. To the best of my knowledge, IDS is unique in doing this.

Moreover, sending the file's data in chunked encoding causes problems for certain clients.

To give a concrete example, the File Transfer Service (FTS) queries the source service with a HEAD request, to discover the file's size. It then uses this information to verify that the complete file was transferred. If IDS is the source then it responds to FTS' HEAD request with a response that does not include the Content-Length header. FTS misinterprets this as the file has zero length, resulting in the transfer failing due to (apparent) file size mismatch.

These problems are ultimately the result of bugs, as using chunked-encoding is a legitimate HTTP response. These bugs have been reported upstream (and I intend to pursue those issues); however, fixing them will likely be a low priority activity. This is because IDS' use of chunked encoding to transfer file's content is such an outlier behaviour.

Therefore, it would be a good idea if IDS were to provide a file's data using an HTTP response that includes Content-Length and sends the data without chunked encoding.

The text was updated successfully, but these errors were encountered:

paulmillar · 2024-02-07T10:02:53Z

#151 is an early pull-request that attempted to resolve this issue. The pull-request triggered some discussion, which is (unfortunately) tied to that pull-request. From this message, there was a proposed strategy for resolving this issue, which I've rephrased below:

Add the new testing framework to allow testing of HTTP response for getData requests.
Write extra unit tests to exercise the case when IDS/Payara doesn't cache the HTTP responses (the HTTP responses are chunked encoded).
Update IDS' behaviour, so sending a single file's data is no longer chunk encoded.

paulmillar · 2024-02-07T10:05:19Z

Following this strategy, there is now a series of three patches available in the development/add-content-length-for-single-file-download branch.

paulmillar · 2024-02-07T10:06:04Z

Would it be easier to process these proposed changes by placing each patch as a separate pull-request, or have all three patches as a single pull-request?

RKrahl · 2024-02-12T10:24:58Z

Dear @paulmillar, first of all many thanks for submitting this issue and also for your persistence in pushing it forward! Sorry for the slow response, I was simply busy with other things.

However, it seems that your impatience make things more complicated than they need to be. Your very first attempt to fix this in #151 with 88f869d was just fine. Yes, I did formulate a concern that this change might result in a reply combining a Content-Length header with chunked encoding, which would be illegal. So I suggested to add a test to make sure that this does not happen, just to be on the safe side. In the meanwhile, @MLewerenzHZB added such a test in #153. This test checks that any reply from IDS either has a Content-Length header or a Transfer-Encoding: chunked, but not both. I didn't asked for any more testing than this. As it turned out, my original concern was ill-founded.

Your extended test framework is interesting, but seems to be a little oversized for this issue, in particular as it adds another dependency on a third party library. So I rather tend to leave it with the simple test from @MLewerenzHZB.

So my suggestion now is to do the following:

revert your branch in Include Content-Length response header #151 back to 88f869d,
merge master which includes the test from added response conformity check to TestingClient #153,
accept and merge the result.

If you are happy with this, I'd just do that.

paulmillar · 2024-02-12T22:21:04Z

Hi Rolf,

Sounds like a plan!

I've created a pull-request (see #156) that, I believe, matches your wishes.

paulmillar mentioned this issue Feb 7, 2024

Include Content-Length response header #151

Closed

paulmillar mentioned this issue Feb 7, 2024

Add framework to allow test-specific assertions on HTTP response. #154

Closed

RKrahl added the enhancement New feature or request label Feb 12, 2024

RKrahl added this to the 2.1.0 milestone Feb 12, 2024

RKrahl mentioned this issue Feb 13, 2024

Include Content-Length response header #156

Merged

RKrahl linked a pull request Feb 13, 2024 that will close this issue

Include Content-Length response header #156

Merged

RKrahl closed this as completed in #156 Feb 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include Content-Length response on getData with single file #155

Include Content-Length response on getData with single file #155

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

RKrahl commented Feb 12, 2024 •

edited

Loading

paulmillar commented Feb 12, 2024

Include Content-Length response on getData with single file #155

Include Content-Length response on getData with single file #155

Comments

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

paulmillar commented Feb 7, 2024

RKrahl commented Feb 12, 2024 • edited Loading

paulmillar commented Feb 12, 2024

RKrahl commented Feb 12, 2024 •

edited

Loading