The iterative batch generation code is so strange to me in Chapter 6 04_arxiv preprocessing.py. #13

jeffacode · 2016-11-02T11:50:55Z

First, for i in range(0, len(text) - self.length + 1, self.max_length // 2):. I'm sorry, but what if len(text) is actually smaller than self.length(I assume it's the max_length)? And Why would I need to do this process?

Second, assert all(len(x) == len(windows[0]) for x in windows). Why do I need to make every text the same length?

Next, the following while True. Isn't it going to loop infinitely?

Last, batch = windows[i: i + self.batch_size]. I don't think last batch generated will be the same size as previous ones in first dimension.

Hope someone could answer my questions:)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The iterative batch generation code is so strange to me in Chapter 6 04_arxiv preprocessing.py. #13

The iterative batch generation code is so strange to me in Chapter 6 04_arxiv preprocessing.py. #13

jeffacode commented Nov 2, 2016 •

edited

Loading

The iterative batch generation code is so strange to me in Chapter 6 04_arxiv preprocessing.py. #13

The iterative batch generation code is so strange to me in Chapter 6 04_arxiv preprocessing.py. #13

Comments

jeffacode commented Nov 2, 2016 • edited Loading

jeffacode commented Nov 2, 2016 •

edited

Loading