From 9e747c7ba16e61aab2ceac361c34546c1e48874a Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Hynek=20Kydl=C3=AD=C4=8Dek?= <39408646+hynky1999@users.noreply.github.com> Date: Mon, 22 May 2023 21:23:49 +0200 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index ad5e1fd9..641f483c 100644 --- a/README.md +++ b/README.md @@ -102,6 +102,6 @@ Note that you can use the --n_proc option to specify the number of processes to ### Advanced usage The whole project was written with modularity in mind. That means that you -can adjust the framework to your needs. To know more check see [https://hynky1999.github.io/CmonCrawl/build/html/index.html](documentation) +can adjust the framework to your needs. To know more check see [https://hynky1999.github.io/CmonCrawl/build/html/usage.html](documentation) Instead of first getting the records and then extracting them, you can do both in a distributed setting. For more info look at [CZE-NEC](https://github.com/hynky1999/Czech-News-Classification-dataset) project.