Skip to content

Commit

Permalink
Built site for gh-pages
Browse files Browse the repository at this point in the history
  • Loading branch information
ZokszY committed Jul 30, 2024
1 parent 4312fd4 commit 1fddea9
Show file tree
Hide file tree
Showing 17 changed files with 147 additions and 108 deletions.
2 changes: 1 addition & 1 deletion .nojekyll
Original file line number Diff line number Diff line change
@@ -1 +1 @@
2057fe92
936ff0ec
Binary file modified Internship-Report.pdf
Binary file not shown.
Binary file modified diagrams/Modifed_AMF_GD_YOLOv8.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion index.html
Original file line number Diff line number Diff line change
Expand Up @@ -239,7 +239,7 @@ <h1 class="title">Internship Report</h1>

<section id="abstract" class="level1 unnumbered unlisted">
<h1 class="unnumbered unlisted">Abstract</h1>
<p>The goal of the internship was to study the combination of LiDAR point clouds and aerial images in a deep learning model to identify individual trees, and in particular those covered by other trees. To do this, I modified a model capable of merging LiDAR and RGB data to feed it with more information about the geometry below the canopy surface. This required to create my own tree dataset, using publicly available data from the Netherlands. A few interesting results emerged, but due to the dataset being too small, I couldn’t really draw conclusions about potential improvements of this new pipeline. Therefore, this new pipeline should be evaluated on a larger dataset to precisely determine its influence on the results.</p>
<p>The goal of the internship was to study the combination of LiDAR point clouds and aerial images in a deep learning model to identify individual trees, and in particular those covered by other trees. To do this, I modified a model capable of merging LiDAR and RGB data to feed it with more information about the geometry below the canopy surface. This required to create my own tree dataset, using publicly available data from the Netherlands. A few interesting results emerged and the model proved its ability to quickly learn to find large and medium trees, even with a small training dataset. However, this new pipeline should be evaluated on a larger dataset to precisely determine the influence of the modifications on the performance regarding small and covered trees.</p>
<p>The source code for this report can be found <a href="https://github.com/ZokszY/Geodan-internship-report">here</a><a href="#fn1" class="footnote-ref" id="fnref1" role="doc-noteref"><sup>1</sup></a> and the online report can be found <a href="https://zokszy.github.io/Geodan-internship-report">here</a><a href="#fn2" class="footnote-ref" id="fnref2" role="doc-noteref"><sup>2</sup></a>.</p>


Expand Down
4 changes: 2 additions & 2 deletions qmd-files/conclusion.html
Original file line number Diff line number Diff line change
Expand Up @@ -211,8 +211,8 @@ <h1 class="title">Conclusion</h1>


<p>All in all, the results of this internship are interesting and promising, even if not decisive.</p>
<p>Regarding datasets, the new dataset that was created is promising for several reasons. First, its spatial extent can easily be extended since the raw data that is used is publicly available and covers the whole of the Netherlands. Then, its quality will also surely keep increasing in the future with the different iterations of the images and the point clouds, at the cost of small annotations modifications to update the dataset with new and cut-off trees, as well as tree growth. The diversity of trees and environments from the Netherlands is obviously not even close from what we can find all around the earth, which wouldn’t make it a great dataset to train a global model, but it hat the potential to be a perfect playground for testing new methods. Finally, the main drawback of this dataset are the spatial and temporal shifts between each type of raw data. But this shift has at least proven to be manageable by the deep learning models that were trained here. Having this shift is also interesting because counting on having perfectly aligned RGB images and point cloud is even less likely than having both of them available in the first place.</p>
<p>Regarding the model, it is unclear whether having multiple layers of CHM really improves the results. This is because these layers would have the biggest impact in the detection of covered trees, which are a specific case that is harder than the other trees. And the training dataset was too small, the model overfitted quickly and could really reach the state when it start learning to find these harder trees. Therefore, more experiments on a bigger dataset, maybe using better augmentation techniques, would be required to get an answer. Besides that, the architecture in itself proved to provide great performance and is quickly able to learn to detect the medium and large trees. Some interesting improvements could easily be added to the model, such as the prediction of mask instead of bounding boxes, which only requires to change the detection heads, or the prediction of species. However, these changes would require the dataset to be substantially with species and precise delineations for all trees.</p>
<p>Regarding datasets, the new dataset that was created is promising for several reasons. First, its spatial extent can easily be extended since the raw data that is used is publicly available and covers the whole of the Netherlands. Then, its quality will also surely keep increasing in the future with the different iterations of the images and the point clouds, at the cost of small annotations modifications to update the dataset with new and cut-off trees, as well as tree growth. The diversity of trees and environments from the Netherlands is obviously not even close from what can be found globally, which wouldn’t make it a great dataset to train a global model, but it has the potential to be a perfect playground for testing new methods. Finally, the main drawback of this dataset are the spatial and temporal shifts between each type of raw data. But these shifts have at least proven to be manageable by the deep learning models that were trained here. Having these shifts is also interesting because counting on having perfectly aligned RGB images and point clouds is even less likely than having both of them available in the first place.</p>
<p>Regarding the model, it is unclear whether having multiple layers of CHM really improves the results. This is because these layers would have the biggest impact in the detection of covered trees, which are a specific case that is harder than the other trees. And since the training dataset was too small, the model overfitted quickly and could really reach the state when it start learning to find these harder trees. Therefore, more experiments on a bigger dataset, maybe using better augmentation techniques, would be required to get an answer. Besides that, the architecture in itself proved to provide great performance and is quickly able to learn to detect the medium and large trees. Some interesting improvements could easily be added to the model, such as the prediction of mask instead of bounding boxes, which only requires to change the detection heads, or the prediction of species. However, these changes would require the dataset to be substantially with species and precise delineations for all trees.</p>



Expand Down
Loading

0 comments on commit 1fddea9

Please sign in to comment.