Workaround dist.apache.org download restrictions #752

gerlowskija · 2025-01-23T18:03:11Z

After a "passing" RC, the wizard has RMs download the CRDs and helm charts from a 'staging' area on dist.apache.org and then upload them to the final location. We have scripts to do this, but these were broken recently when dist.apache.org changed its robots.txt to disallow unknown "crawlers".

This commit gets our scripting working again by tweaking a wget invocation to not strictly obey the robots.txt for dist.apache.org, which likely isn't intended for restricting foundation-internal usecases such as ours.

After a "passing" RC,the wizard has RMs download the CRDs and helm charts from a 'staging' area on dist.apache.org and then upload them to the final location. We have scripts to do this, but these were broken recently when dist.apache.org changed its robots.txt to disallow unknown "crawlers". This commit gets our scripting working again by tweaking a `wget` invocation to not strictly obey the robots.txt for dist.apache.org, which likely isn't intended for restricting foundation-internal usecases such as ours.

HoustonPutman approved these changes Jan 23, 2025

View reviewed changes

gerlowskija merged commit 0ffddb6 into apache:main Jan 24, 2025
1 check passed

gerlowskija deleted the ignore-robots-for-single-dir-downloads branch January 24, 2025 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workaround dist.apache.org download restrictions #752

Workaround dist.apache.org download restrictions #752

gerlowskija commented Jan 23, 2025 •

edited

Loading

Workaround dist.apache.org download restrictions #752

Workaround dist.apache.org download restrictions #752

Conversation

gerlowskija commented Jan 23, 2025 • edited Loading

gerlowskija commented Jan 23, 2025 •

edited

Loading