-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error downloading eswiki dumps #19
Comments
At first sight, this is a problem related with the function that validates the MD5 checksum of downloaded files. In this way, the program can double-check that the file contents actually matches the original file on the server. For some reason, the function that calculates the MD5 checksum of the downloaded file is failing and throws the exception in your error. It's strange, since I've just parsed few weeks ago the same dump without any issue. ¿How much RAM do you have? The size of the last file is 4.6 GB and you may run out of memory when it tries to load it to calculate the MD5 checksum. |
Hello, I have 8GB RAM.
I will try to run again this process tonight after a reboot in order to have the memory as empty as possible. Thank you very much for your fast reply. |
Great, please let me know about the results. It might be difficult to replicate the bug in our systems without identifying first the possible cause behind this error. |
I ran again 3 times on my machine and I had the same problem. Later I ran it on a more powerful server and It ran that part successfully. So, I think that this issue is related to the memory as you said. |
Hello,
I'm trying to download eswiki dumps but I got the following error:
OverflowError: unbounded read returned more bytes than a Python string can hold
This is the whole trace:
Do you have any clue?
Thanks
The text was updated successfully, but these errors were encountered: