Built site for gh-pages

ZokszY · Jul 30, 2024 · 1fddea9 · 1fddea9
1 parent 4312fd4
commit 1fddea9
Show file tree

Hide file tree

Showing 17 changed files with 147 additions and 108 deletions.
diff --git a/.nojekyll b/.nojekyll
@@ -1 +1 @@
-2057fe92
+936ff0ec
diff --git a/Internship-Report.pdf b/Internship-Report.pdf
diff --git a/diagrams/Modifed_AMF_GD_YOLOv8.png b/diagrams/Modifed_AMF_GD_YOLOv8.png
diff --git a/index.html b/index.html
@@ -239,7 +239,7 @@ <h1 class="title">Internship Report</h1>
 
 <section id="abstract" class="level1 unnumbered unlisted">
 <h1 class="unnumbered unlisted">Abstract</h1>
-<p>The goal of the internship was to study the combination of LiDAR point clouds and aerial images in a deep learning model to identify individual trees, and in particular those covered by other trees. To do this, I modified a model capable of merging LiDAR and RGB data to feed it with more information about the geometry below the canopy surface. This required to create my own tree dataset, using publicly available data from the Netherlands. A few interesting results emerged, but due to the dataset being too small, I couldn’t really draw conclusions about potential improvements of this new pipeline. Therefore, this new pipeline should be evaluated on a larger dataset to precisely determine its influence on the results.</p>
+<p>The goal of the internship was to study the combination of LiDAR point clouds and aerial images in a deep learning model to identify individual trees, and in particular those covered by other trees. To do this, I modified a model capable of merging LiDAR and RGB data to feed it with more information about the geometry below the canopy surface. This required to create my own tree dataset, using publicly available data from the Netherlands. A few interesting results emerged and the model proved its ability to quickly learn to find large and medium trees, even with a small training dataset. However, this new pipeline should be evaluated on a larger dataset to precisely determine the influence of the modifications on the performance regarding small and covered trees.</p>
 <p>The source code for this report can be found <a href="https://github.com/ZokszY/Geodan-internship-report">here</a><a href="#fn1" class="footnote-ref" id="fnref1" role="doc-noteref"><sup>1</sup></a> and the online report can be found <a href="https://zokszy.github.io/Geodan-internship-report">here</a><a href="#fn2" class="footnote-ref" id="fnref2" role="doc-noteref"><sup>2</sup></a>.</p>
 
 

diff --git a/qmd-files/conclusion.html b/qmd-files/conclusion.html
@@ -211,8 +211,8 @@ <h1 class="title">Conclusion</h1>
 
 
 <p>All in all, the results of this internship are interesting and promising, even if not decisive.</p>
-<p>Regarding datasets, the new dataset that was created is promising for several reasons. First, its spatial extent can easily be extended since the raw data that is used is publicly available and covers the whole of the Netherlands. Then, its quality will also surely keep increasing in the future with the different iterations of the images and the point clouds, at the cost of small annotations modifications to update the dataset with new and cut-off trees, as well as tree growth. The diversity of trees and environments from the Netherlands is obviously not even close from what we can find all around the earth, which wouldn’t make it a great dataset to train a global model, but it hat the potential to be a perfect playground for testing new methods. Finally, the main drawback of this dataset are the spatial and temporal shifts between each type of raw data. But this shift has at least proven to be manageable by the deep learning models that were trained here. Having this shift is also interesting because counting on having perfectly aligned RGB images and point cloud is even less likely than having both of them available in the first place.</p>
-<p>Regarding the model, it is unclear whether having multiple layers of CHM really improves the results. This is because these layers would have the biggest impact in the detection of covered trees, which are a specific case that is harder than the other trees. And the training dataset was too small, the model overfitted quickly and could really reach the state when it start learning to find these harder trees. Therefore, more experiments on a bigger dataset, maybe using better augmentation techniques, would be required to get an answer. Besides that, the architecture in itself proved to provide great performance and is quickly able to learn to detect the medium and large trees. Some interesting improvements could easily be added to the model, such as the prediction of mask instead of bounding boxes, which only requires to change the detection heads, or the prediction of species. However, these changes would require the dataset to be substantially with species and precise delineations for all trees.</p>
+<p>Regarding datasets, the new dataset that was created is promising for several reasons. First, its spatial extent can easily be extended since the raw data that is used is publicly available and covers the whole of the Netherlands. Then, its quality will also surely keep increasing in the future with the different iterations of the images and the point clouds, at the cost of small annotations modifications to update the dataset with new and cut-off trees, as well as tree growth. The diversity of trees and environments from the Netherlands is obviously not even close from what can be found globally, which wouldn’t make it a great dataset to train a global model, but it has the potential to be a perfect playground for testing new methods. Finally, the main drawback of this dataset are the spatial and temporal shifts between each type of raw data. But these shifts have at least proven to be manageable by the deep learning models that were trained here. Having these shifts is also interesting because counting on having perfectly aligned RGB images and point clouds is even less likely than having both of them available in the first place.</p>
+<p>Regarding the model, it is unclear whether having multiple layers of CHM really improves the results. This is because these layers would have the biggest impact in the detection of covered trees, which are a specific case that is harder than the other trees. And since the training dataset was too small, the model overfitted quickly and could really reach the state when it start learning to find these harder trees. Therefore, more experiments on a bigger dataset, maybe using better augmentation techniques, would be required to get an answer. Besides that, the architecture in itself proved to provide great performance and is quickly able to learn to detect the medium and large trees. Some interesting improvements could easily be added to the model, such as the prediction of mask instead of bounding boxes, which only requires to change the detection heads, or the prediction of species. However, these changes would require the dataset to be substantially with species and precise delineations for all trees.</p>