diff --git a/src/content/docs/guides/getting-started.md b/src/content/docs/guides/getting-started.md index e8b97ca..8b829f1 100644 --- a/src/content/docs/guides/getting-started.md +++ b/src/content/docs/guides/getting-started.md @@ -310,6 +310,6 @@ When analyzing the results from this, you can see the % of websites that use dif ![Simple JOIN Example](./type_summary_example_query2.png) -To explore more interactive examples, read the [Guided Tour](./guided-tour.md). +To explore more interactive examples, read the [Guided Tour](../guided-tour/). If you want to explore deeper you have everything you need - infrastructure, documentation, community. Enjoy exploring this data and feel free to share your results and ask questions on the [HTTP Archive Discuss section](https://discuss.httparchive.org/). diff --git a/src/content/docs/guides/guided-tour.mdx b/src/content/docs/guides/guided-tour.mdx index eb8b45f..0e00267 100644 --- a/src/content/docs/guides/guided-tour.mdx +++ b/src/content/docs/guides/guided-tour.mdx @@ -6,11 +6,11 @@ description: HTTP Archive data analysis in BigQuery The HTTP Archive contains a tremendous amount of information that can be used to understand the evolution of the web. And since the raw data is available in Google BigQuery, you can start digging into it with a minimal amount of setup! -If you are new to BigQuery, then the [Getting Started guide](./getting-started.md) will walk you through the basic setup. That guide ends with a sample query that explores MIME types from the `pages` tables. In this guide, we'll explore more of the tables and build additional queries that you can learn from. The easiest way to get started is by following along, testing some of the queries and learning from them. If you need any help then there is plenty of support available from the community at [https://discuss.httparchive.org](https://discuss.httparchive.org). +If you are new to BigQuery, then the [Getting Started guide](../getting-started/) will walk you through the basic setup. That guide ends with a sample query that explores MIME types from the `pages` tables. In this guide, we'll explore more of the tables and build additional queries that you can learn from. The easiest way to get started is by following along, testing some of the queries and learning from them. If you need any help then there is plenty of support available from the community at [https://discuss.httparchive.org](https://discuss.httparchive.org). **Prerequisites:** -- This guide assumes that you've completed the setup from the [Getting Started guide](./getting-started.md). +- This guide assumes that you've completed the setup from the [Getting Started guide](../getting-started/). - You would be safe processing extremely-large tables contained in this dataset if you follow the [minimizing query costs guide](/guides/minimizing-costs/). - It also assumes some familiarity with SQL. All of the examples provided will be using [Standard SQL](https://cloud.google.com/bigquery/docs/reference/standard-sql/). diff --git a/src/content/docs/guides/minimizing-costs.md b/src/content/docs/guides/minimizing-costs.md index dacd30c..fe41c12 100644 --- a/src/content/docs/guides/minimizing-costs.md +++ b/src/content/docs/guides/minimizing-costs.md @@ -29,6 +29,7 @@ WHERE client = 'desktop' AND rank = 1000 ``` + ::: ## Use `TABLESAMPLE` @@ -81,6 +82,7 @@ WHERE LIMIT 1 ``` + ::: ## Use the `sample_data` dataset @@ -97,4 +99,4 @@ BigQuery allows you to preview entire rows of a table without incurring a query To access the preview, click on a table name from the workspace explorer and select the **Preview** tab. -Note that generating the preview may be slow for tables with large payloads, like `response_bodies` or `pages`. Also note that the text values are truncated by default, so you will need to expand the field to get the full value. \ No newline at end of file +Note that generating the preview may be slow for tables with large payloads, like `response_bodies` or `pages`. Also note that the text values are truncated by default, so you will need to expand the field to get the full value. diff --git a/src/content/docs/guides/release-cycle.md b/src/content/docs/guides/release-cycle.md index 198e004..e696072 100644 --- a/src/content/docs/guides/release-cycle.md +++ b/src/content/docs/guides/release-cycle.md @@ -31,4 +31,4 @@ There isn't currently a way to be notified when a new crawl is available to quer The reports on the HTTP Archive website are automatically generated as soon as the BigQuery data is available. -Auxilliary reports like the [Core Web Vitals Technology Report](https://cwvtech.report/) are generated manually soon after the data becomes available. \ No newline at end of file +Auxilliary reports like the [Core Web Vitals Technology Report](https://cwvtech.report/) are generated manually soon after the data becomes available.