diff --git a/README.md b/README.md
index c9758f2..6231bb0 100644
--- a/README.md
+++ b/README.md
@@ -1,3 +1,18 @@
-# DTW-C++
+DTW-C++
+===========================
+[![Ubuntu unit](https://github.com/Battery-Intelligence-Lab/dtw-cpp/workflows/Ubuntu%20unit/badge.svg)](https://github.com/Battery-Intelligence-Lab/dtw-cpp/actions)
+[![macOS unit](https://github.com/Battery-Intelligence-Lab/dtw-cpp/workflows/macOS%20unit/badge.svg)](https://github.com/Battery-Intelligence-Lab/dtw-cpp/actions)
+[![Windows unit](https://github.com/Battery-Intelligence-Lab/dtw-cpp/workflows/Windows%20unit/badge.svg)](https://github.com/Battery-Intelligence-Lab/dtw-cpp/actions)
+![Website](https://img.shields.io/website?url=https%3A%2F%2FBattery-Intelligence-Lab.github.io%2Fdtw-cpp%2F)
+[![codecov](https://codecov.io/gh/Battery-Intelligence-Lab/dtw-cpp/branch/main/graph/badge.svg?token=K739SRV4QG)](https://codecov.io/gh/Battery-Intelligence-Lab/dtw-cpp)
 
-DTW-C++ is a dynamic time warping (DTW) based clustering library in C++. The user can input multiple time series (potentially of variable lengths) and the number of desired clusters if known or a range of possible cluster numbers if not known. DTW-C++ can cluster the time series using k-Medoids or mixed integer programming (MIP). k-Medoids is generally quicker but may be subject to sticking in local optima, whereas MIP can find globally optimal clusters. ANY DEPENDANCIES?
+
+![GitHub all releases](https://img.shields.io/github/downloads/Battery-Intelligence-Lab/dtw-cpp/total) 
+[![](https://img.shields.io/badge/license-BSD--3--like-5AC451.svg)](https://github.com/Battery-Intelligence-Lab/dtw-cpp/blob/main/LICENSE)
+
+In this `readme.md` a summary is given. You may find the detailed documentation [here](https://Battery-Intelligence-Lab.github.io/dtw-cpp/).  
+If you are affected by the sudden change of main branch, please switch to [dtw-cpp_v0.0.2]([https://github.com/Battery-Intelligence-Lab/dtw-cpp/tree/dtw-cpp_v2](https://github.com/Battery-Intelligence-Lab/dtw-cpp/tree/dtwc_0_0_2)) branch. 
+
+Introduction
+===========================
+DTW-C++ is a dynamic time warping (DTW) and clustering library, written in C++, for time series data. The user can input multiple time series (potentially of variable lengths), and the number of desired clusters (if known), or a range of possible cluster numbers (if the specific number is not known). DTW-C++ can cluster time series data using k-medoids or mixed integer programming (MIP); k-medoids is generally quicker, but may be subject getting stuck in local optima, whereas MIP can find globally optimal clusters.
diff --git a/joss/paper.bib b/joss/paper.bib
index 6eaa4b7..4758a18 100644
--- a/joss/paper.bib
+++ b/joss/paper.bib
@@ -1,156 +1,3 @@
-
-@article{reniers2019review,
-  title={Review and performance comparison of mechanical-chemical degradation models for lithium-ion batteries},
-  author={Reniers, Jorn M and Mulder, Grietus and Howey, David A},
-  journal={Journal of The Electrochemical Society},
-  Doi = {10.1149/2.0281914jes},
-  volume={166},
-  number={14},
-  pages={A3189},
-  year={2019},
-  publisher={IOP Publishing}
-}
-
-@article{reiners2022digital,
-  title={Digital twin of a MWh-scale grid battery system for efficiency and degradation analysis},
-  author={Reniers, Jorn M and Howey, David A},
-  year={2022},
-}
-
-
-@article{kumtepeli2020energy,
-  title={Energy arbitrage optimization with battery storage: 3D-MILP for electro-thermal performance and semi-empirical aging models},
-  author={Kumtepeli, Volkan and Hesse, Holger C and Schimpe, Michael and Tripathi, Anshuman and Wang, Youyi and Jossen, Andreas},
-  journal={IEEE Access},
-  volume={8},
-  pages={204325--204341},
-  year={2020},
-  publisher={IEEE}
-}
-
-@inproceedings{naumann2017simses,
-  title={Simses: Software for techno-economic simulation of stationary energy storage systems},
-  author={Naumann, Maik and Truong, Cong Nam and Schimpe, Michael and Kucevic, Daniel and Jossen, Andreas and Hesse, Holger C},
-  booktitle={International ETG Congress 2017},
-  pages={1--6},
-  year={2017},
-  organization={VDE}
-}
-
-@article{moller2022simses,
-  title={SimSES: A holistic simulation framework for modeling and analyzing stationary energy storage systems},
-  author={M{\"o}ller, Marc and Kucevic, Daniel and Collath, Nils and Parlikar, Anupam and Dotzauer, Petra and Tepe, Benedikt and Englberger, Stefan and Jossen, Andreas and Hesse, Holger},
-  journal={Journal of Energy Storage},
-  volume={49},
-  pages={103743},
-  year={2022},
-  publisher={Elsevier}
-}
-
-@article{tranter2022liionpack,
-  title={liionpack: A Python package for simulating packs of batteries with PyBaMM},
-  author={Tranter, Thomas and Timms, Robert and Sulzer, Valentin and Planella, Ferran and Wiggins, Gavin and Karra, Suryanarayana and Agarwal, Priyanshu and Chopra, Saransh and Allu, Srikanth and Shearing, Paul and others},
-  journal={Journal of Open Source Software},
-  volume={7},
-  number={70},
-  year={2022},
-  publisher={The Open Journal}
-}
-
-@article{howey2019tools,
-  title={Tools for battery health diagnostics and prediction},
-  author={Howey, David A},
-  journal={The Electrochemical Society Interface},
-  volume={28},
-  number={1},
-  pages={55},
-  year={2019},
-  publisher={IOP Publishing}
-}
-
-@article{Pearson:2017,
-	Adsnote = {Provided by the SAO/NASA Astrophysics Data System},
-	Adsurl = {http://adsabs.harvard.edu/abs/2017arXiv170304627P},
-	Archiveprefix = {arXiv},
-	Author = {{Pearson}, S. and {Price-Whelan}, A.~M. and {Johnston}, K.~V.},
-	Eprint = {1703.04627},
-	Journal = {ArXiv e-prints},
-	Keywords = {Astrophysics - Astrophysics of Galaxies},
-	Month = mar,
-	Title = {{Gaps in Globular Cluster Streams: Pal 5 and the Galactic Bar}},
-	Year = 2017}
-
-@book{Binney:2008,
-	Adsnote = {Provided by the SAO/NASA Astrophysics Data System},
-	Adsurl = {http://adsabs.harvard.edu/abs/2008gady.book.....B},
-	Author = {{Binney}, J. and {Tremaine}, S.},
-	Booktitle = {Galactic Dynamics: Second Edition, by James Binney and Scott Tremaine.~ISBN 978-0-691-13026-2 (HB).~Published by Princeton University Press, Princeton, NJ USA, 2008.},
-	Publisher = {Princeton University Press},
-	Title = {{Galactic Dynamics: Second Edition}},
-	Year = 2008}
-
-@article{zenodo,
-	Abstractnote = {<p>Gala is a Python package for Galactic astronomy and gravitational dynamics. The bulk of the package centers around implementations of gravitational potentials, numerical integration, and nonlinear dynamics.</p>},
-	Author = {Adrian Price-Whelan and Brigitta Sipocz and Syrtis Major and Semyeong Oh},
-	Date-Modified = {2017-08-13 14:14:18 +0000},
-	Doi = {10.5281/zenodo.833339},
-	Month = {Jul},
-	Publisher = {Zenodo},
-	Title = {adrn/gala: v0.2.1},
-	Year = {2017},
-	Bdsk-Url-1 = {http://dx.doi.org/10.5281/zenodo.833339}}
-
-@ARTICLE{gaia,
-   author = {{Gaia Collaboration} and {Prusti}, T. and {de Bruijne}, J.~H.~J. and
-	{Brown}, A.~G.~A. and {Vallenari}, A. and {Babusiaux}, C. and
-	{Bailer-Jones}, C.~A.~L. and {Bastian}, U. and {Biermann}, M. and
-	{Evans}, D.~W. and et al.},
-    title = "{The Gaia mission}",
-  journal = {\aap},
-archivePrefix = "arXiv",
-   eprint = {1609.04153},
- primaryClass = "astro-ph.IM",
- keywords = {space vehicles: instruments, Galaxy: structure, astrometry, parallaxes, proper motions, telescopes},
-     year = 2016,
-    month = nov,
-   volume = 595,
-      eid = {A1},
-    pages = {A1},
-      doi = {10.1051/0004-6361/201629272},
-   adsurl = {http://adsabs.harvard.edu/abs/2016A%26A...595A...1G},
-  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
-}
-
-@ARTICLE{astropy,
-   author = {{Astropy Collaboration} and {Robitaille}, T.~P. and {Tollerud}, E.~J. and
-	{Greenfield}, P. and {Droettboom}, M. and {Bray}, E. and {Aldcroft}, T. and
-	{Davis}, M. and {Ginsburg}, A. and {Price-Whelan}, A.~M. and
-	{Kerzendorf}, W.~E. and {Conley}, A. and {Crighton}, N. and
-	{Barbary}, K. and {Muna}, D. and {Ferguson}, H. and {Grollier}, F. and
-	{Parikh}, M.~M. and {Nair}, P.~H. and {Unther}, H.~M. and {Deil}, C. and
-	{Woillez}, J. and {Conseil}, S. and {Kramer}, R. and {Turner}, J.~E.~H. and
-	{Singer}, L. and {Fox}, R. and {Weaver}, B.~A. and {Zabalza}, V. and
-	{Edwards}, Z.~I. and {Azalee Bostroem}, K. and {Burke}, D.~J. and
-	{Casey}, A.~R. and {Crawford}, S.~M. and {Dencheva}, N. and
-	{Ely}, J. and {Jenness}, T. and {Labrie}, K. and {Lim}, P.~L. and
-	{Pierfederici}, F. and {Pontzen}, A. and {Ptak}, A. and {Refsdal}, B. and
-	{Servillat}, M. and {Streicher}, O.},
-    title = "{Astropy: A community Python package for astronomy}",
-  journal = {\aap},
-archivePrefix = "arXiv",
-   eprint = {1307.6212},
- primaryClass = "astro-ph.IM",
- keywords = {methods: data analysis, methods: miscellaneous, virtual observatory tools},
-     year = 2013,
-    month = oct,
-   volume = 558,
-      eid = {A33},
-    pages = {A33},
-      doi = {10.1051/0004-6361/201322068},
-   adsurl = {http://adsabs.harvard.edu/abs/2013A%26A...558A..33A},
-  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
-}
-
 @article{Aghabozorgi2015,
    abstract = {Clustering is a solution for classifying enormous data when there is not any early knowledge about classes. With emerging new concepts like cloud computing and big data and their vast applications in recent years, research works have been increased on unsupervised solutions like clustering algorithms to extract knowledge from this avalanche of data. Clustering time-series data has been used in diverse scientific areas to discover patterns which empower data analysts to extract valuable information from complex and massive datasets. In case of huge datasets, using supervised classification solutions is almost impossible, while clustering can solve this problem using un-supervised approaches. In this research work, the focus is on time-series data, which is one of the popular data types in clustering problems and is broadly used from gene expression data in biology to stock market analysis in finance. This review will expose four main components of time-series clustering and is aimed to represent an updated investigation on the trend of improvements in efficiency, quality and complexity of clustering time-series approaches during the last decade and enlighten new paths for future works.},
    author = {Saeed Aghabozorgi and Ali Seyed Shirkhorshidi and Teh Ying Wah},
@@ -224,3 +71,73 @@ @misc{UCRArchive2018
 month = {October},
 note = {\url{https://www.cs.ucr.edu/~eamonn/time_series_data_2018/}}
 }
+
+@article{Sakoe1978,
+   abstract = {This paper reports on an optimum dynamic programming (DP) based time-normalization algorithm for spoken word recognition. First, a general principle of time-normalization is given using time-warping function. Then, two time-normalized distance definitions, called symmetric and asymmetric forms, are derived from the principle. These two forms are compared with each other through theoretical discussions and experimental studies. The symmetric form algorithm superiority is established. A new technique, called slope constraint, is successfully introduced, in which the warping function slope is restricted so as to improve discrimination between words in different categories. The effective slope constraint characteristic is qualitatively analyzed, and the optimum slope constraint condition is determined through experiments. The optimized algorithm is then extensively subjected to experimental comparison with various DP-algorithms, previously applied to spoken word recognition by different research groups. The experiment shows that the present algorithm gives no more than about two-thirds errors, even compared to the best conventional algorithm. © 1978 IEEE},
+   author = {Hiroaki Sakoe and Seibi Chiba},
+   doi = {10.1109/TASSP.1978.1163055},
+   issn = {00963518},
+   issue = {1},
+   journal = {IEEE Transactions on Acoustics, Speech, and Signal Processing},
+   pages = {43-49},
+   title = {Dynamic Programming Algorithm Optimization for Spoken Word Recognition},
+   volume = {26},
+   year = {1978},
+}
+
+@article{Rajabi2020,
+   abstract = {Smart meters have been widely deployed in power networks since the last decade. This trend has resulted in an enormous volume of data being collected from the electricity customers. To gain benefits for various stakeholders in power systems, proper data mining techniques, such as clustering, need to be employed to extract the underlying patterns from energy consumptions. In this paper, a comparative study of different techniques for load pattern clustering is carried out. Different parameters of the methods that affect the clustering results are evaluated and the clustering algorithms are compared for two data sets. In addition, the two suitable and commonly used data size reduction techniques and feature definition/extraction methods for load pattern clustering are analysed. Furthermore, the existing studies on clustering of electricity customers are reviewed and the main results are highlighted. Finally, the future trends and major applications of clustering consumption patterns are outlined to inform industry practitioners and academic researchers to optimize smart meter operational use and effectiveness.},
+   author = {Amin Rajabi and Mohsen Eskandari and Mojtaba Jabbari Ghadi and Li Li and Jiangfeng Zhang and Pierluigi Siano},
+   doi = {10.1016/j.rser.2019.109628},
+   issn = {18790690},
+   journal = {Renewable and Sustainable Energy Reviews},
+   keywords = {Clustering algorithms,Comparative study,Data mining,Load pattern,Smart grids,Smart meters},
+   month = {3},
+   publisher = {Elsevier Ltd},
+   title = {A comparative study of clustering techniques for electrical load pattern segmentation},
+   volume = {120},
+   year = {2020},
+}
+
+@misc{Tavenard2020,
+   abstract = {tslearn is a general-purpose Python machine learning library for time series that offers tools for pre-processing and feature extraction as well as dedicated models for clustering, classification and regression. It follows scikit-learn's Application Programming Interface for transformers and estimators, allowing the use of standard pipelines and model selection tools on top of tslearn objects. It is distributed under the BSD-2-Clause license, and its source code is available at https://github.com/tslearn-team/tslearn.},
+   author = {Romain Tavenard and Johann Faouzi and Gilles Vandewiele and Felix Divo and Guillaume Androz and Chester Holtz and Marie Payne and Roman Yurchak and Marc Rußwurm},
+   journal = {Journal of Machine Learning Research},
+   keywords = {classification,clustering,data mining,pre-processing,time series},
+   pages = {1-6},
+   title = {Tslearn, A Machine Learning Toolkit for Time Series Data},
+   volume = {21},
+   url = {https://github.com/tslearn-team/tslearn.},
+   year = {2020},
+}
+
+@misc{Dau2018,
+   author = {Hoang Anh Dau and Eamonn Keogh and Kaveh Kamgar and Chin-Chia Michael Yeh and Yan Zhu and Shaghayegh Gharghabi and Chotirat Ann Ratanamahatana and Yanping and Bing Hu and Nurjahan Begum and Anthony Bagnall and Abdullah Mueen and Gustavo Batista},
+   month = {10},
+   title = {The UCR Time Series Classification Archive},
+   url = {https://www.cs.ucr.edu/~eamonn/time_series_data_2018/},
+   year = {2018},
+}
+
+@article{Huangfu2018,
+   abstract = {This paper introduces the design and implementation of two parallel dual simplex solvers for general large scale sparse linear programming problems. One approach, called PAMI, extends a relatively unknown pivoting strategy called suboptimization and exploits parallelism across multiple iterations. The other, called SIP, exploits purely single iteration parallelism by overlapping computational components when possible. Computational results show that the performance of PAMI is superior to that of the leading open-source simplex solver, and that SIP complements PAMI in achieving speedup when PAMI results in slowdown. One of the authors has implemented the techniques underlying PAMI within the FICO Xpress simplex solver and this paper presents computational results demonstrating their value. In developing the first parallel revised simplex solver of general utility, this work represents a significant achievement in computational optimization.},
+   author = {Q. Huangfu and J. A.J. Hall},
+   doi = {10.1007/s12532-017-0130-5},
+   issn = {18672957},
+   issue = {1},
+   journal = {Mathematical Programming Computation},
+   keywords = {Linear programming,Parallel computing,Revised simplex method},
+   month = {3},
+   pages = {119-142},
+   publisher = {Springer Verlag},
+   title = {Parallelizing the dual revised simplex method},
+   volume = {10},
+   year = {2018},
+}
+
+@misc{gurobi,
+  author = {{Gurobi Optimization, LLC}},
+  title = {{Gurobi Optimizer Reference Manual}},
+  year = 2023,
+  url = "https://www.gurobi.com"
+}
diff --git a/joss/paper.md b/joss/paper.md
index f459e88..eae4fcb 100644
--- a/joss/paper.md
+++ b/joss/paper.md
@@ -32,9 +32,9 @@ We present an approach for computationally efficient dynamic time warping (DTW)
 
 # Statement of need
 
-Clustering time series is becoming increasingly popular as data availability increases; however as the data avilability increases, so does the complexity of the clustering problem. Most time series clustering objectives currently depend on dimension reduction techniques or finding features from the time series which can induce bias into the clustering [@Aghabozorgi2015]. Dynamic time warping [@Sakoe1978DynamicRecognition] is a well-known technique for manipulating time series to enable comparisons between datasets, using  local warping (stretching or compressing along the time axis) of the elements within each time series to find an optimal alignment between series. This emphasises the similarity of the shapes of the respective time series, rather than the exact alignment of specific features. Unfortunately, DTW does not scale well in computational speed as the length and number of time series to be compared increases---the computational complexity grows quadratically with the total number of data points. This complexity is a barrier to DTW being widely implemented in time series clustering  [@Rajabi2020ASegmentation]. ``DTW-C++`` is written to handle large time series datasets, working on the raw data rather than reduced dimension data or selected features from the time series, across the various applications. 
+Clustering time series is becoming increasingly popular as data availability increases; however as the data avilability increases, so does the complexity of the clustering problem. Most time series clustering objectives currently depend on dimension reduction techniques or finding features from the time series which can induce bias into the clustering [@Aghabozorgi2015]. Dynamic time warping [@Sakoe1978] is a well-known technique for manipulating time series to enable comparisons between datasets, using  local warping (stretching or compressing along the time axis) of the elements within each time series to find an optimal alignment between series. This emphasises the similarity of the shapes of the respective time series, rather than the exact alignment of specific features. Unfortunately, DTW does not scale well in computational speed as the length and number of time series to be compared increases---the computational complexity grows quadratically with the total number of data points. This complexity is a barrier to DTW being widely implemented in time series clustering  [@Rajabi2020]. ``DTW-C++`` is written to handle large time series datasets, working on the raw data rather than reduced dimension data or selected features from the time series, across the various applications. 
 
-While there are other packages available for time series clustering using DTW, namely [@Petitjean2011] and [@meert2020wannesm], ``DTW-C++`` offers signficant imporvements in both speed and memory use, allowing larger datasets to be clustered. This is done by task level parallelisation, allowing multiple pairwise comparsions between time series to be evaluated simulataneously, as well as more efficient memory management by solving the DTW distance using only the preceding vector rather than storing the entire warping matrix. This means that the warping path between each time series is not stored, but this is not required for the clustering process - only the final cost is needed. In addition, MIP is preferable to other DTW clustering packages which use k-based methods for clustering, as k-based methods are suseptible to sticking in local optima. MIP finds the global optimum in most cases, and in the rare event that the global optimum is not found, the gap between the best solution found and the global optimum is given.
+While there are other packages available for time series clustering using DTW, namely \texttt{DTAIDistance} [@meert2020wannesm] and \texttt{TSlearn} [@Tavenard2020], ``DTW-C++`` offers signficant imporvements in both speed and memory use, allowing larger datasets to be clustered. This is done by task level parallelisation, allowing multiple pairwise comparsions between time series to be evaluated simulataneously, as well as more efficient memory management by solving the DTW distance using only the preceding vector rather than storing the entire warping matrix. This means that the warping path between each time series is not stored, but this is not required for the clustering process - only the final cost is needed. In addition, MIP is preferable to other DTW clustering packages which use k-based methods for clustering, as k-based methods are suseptible to sticking in local optima. MIP finds the global optimum in most cases, and in the rare event that the global optimum is not found, the gap between the best solution found and the global optimum is given.
 
 Time series clustering applications range from energy to find consumption patterns, to detecting brain activity in medical applications, to discovering patterns in stock price trends in the fincance industry. The target audience for this software can therefore range acorss multiple disciplines, intended for any user with a requirement for time-series clustering.
 
@@ -43,7 +43,7 @@ Time series clustering applications range from energy to find consumption patter
 
 The current functionality of the software is as follows:
 
-* Calculate DTW pairwise distances between time series, using a vector based approach to reduce memory use. There is also the option to use a Sakoe-Chiba band to restrict warping in the DTW distance calculation [@Sakoe1978DynamicRecognition]. This speeds up the computation time as well as being a useful constraint for some time series clustering scenarios (e.g., if an event must occur within a certain time window to be considered similar).
+* Calculate DTW pairwise distances between time series, using a vector based approach to reduce memory use. There is also the option to use a Sakoe-Chiba band to restrict warping in the DTW distance calculation [@Sakoe1978]. This speeds up the computation time as well as being a useful constraint for some time series clustering scenarios (e.g., if an event must occur within a certain time window to be considered similar).
 * Produce a distance matrix containing all pairwise comparisons between each time series in the dataset.
 * Split all time series into a predefined number of clusters, with a representative centroid time series for each cluster. This can be done using MIP or k-medoids clustering, depending on user choice.
 * Output the clustering cost, which is the sum of distances between every time series within each cluster and its cluster centroid.
@@ -90,7 +90,7 @@ The DTW distance $C_{x,y}$ is found for each pairwise comparison. As shown in \r
 
 Using this matrix, ($D$), the time series can be split into ($k$) separate clusters with integer programming. The problem formulation begins with a binary square matrix $A^{p\times p}$, where $A_{ij}=1$ if time series ($j$) is a member of the $i$th cluster centroid, and 0 otherwise, as shown in \autoref{fig:A_matrix}.
 
-![Example output from the clustering process, where an entry of 1 indicates that time series $j$ belongs to cluster with centroid $i$. \label{fig:A_matrix}](../media/cluster_matrix_formation4.svg)
+![Example output from the clustering process, where an entry of 1 indicates that time series $j$ belongs to cluster with centroid $i$. \label{fig:A_matrix}](../media/cluster_matrix_formation4.svg){ width=70% }
 
 As each centroid has to be in its own cluster, non-zero diagonal entries in  $A$ represent centroids. In summary, the following constraints apply: 
 
@@ -118,13 +118,13 @@ The optimisation problem to solve, subject to the above constraints, is
     A^\star = \min_{A} \sum_i \sum_j D_{ij} \times A_{ij}.
 \end{equation}
 
-After solving this integer program, the non-zero diagonal entries of ($A$) represent the centroids, and the non-zero elements in the corresponding columns in ($A$) represent the members of that cluster. In the example in \autoref{fig:A_matrix}, the clusters are time series 1, **2**, 5 and 3, **4** with the bold time series being the centroids.
+This integer program is solved using Gurobi [@gurobi] or HiGHS [@Huangfu2018]. After solving this integer program, the non-zero diagonal entries of ($A$) represent the centroids, and the non-zero elements in the corresponding columns in ($A$) represent the members of that cluster. In the example in \autoref{fig:A_matrix}, the clusters are time series 1, **2**, 5 and 3, **4** with the bold time series being the centroids.
 
 Finding global optimality can increase the computation time, depending on the number of time series within the dataset and the DTW distances. Therefore, there is also a built-in option to cluster using k-medoids, as used in other packages such as \texttt{DTAIDistance} [@meert2020wannesm]. The k-medoids method is often quicker as it is an iterative approach, however it is subject to getting stuck in local optima. The results in the next section show the timing and memory performance of both MIP clustering and k-medoids clustering using \texttt{DTW-C++} compared to other packages.
 
 # Comparison
 
-We compared our approach with two other DTW clustering packages, \texttt{DTAIDistance} [@Meert2020Dtaidistance] and \texttt{TSlearn} [@Tavenard2020TslearnData]. The datasets used for the comparison are from the UCR Time Series Classification Archive [@Dau2018TheArchive], and consist of 128 time series datasets with up to 16,800 data series of lengths up to 2,844. The full results can be found in the Appendix. Benchmarking against  \texttt{TSlearn}  was stopped after the first 22 datasets because the results were consistently over 20 times slower than \texttt{DTW-C++}. \autoref{tab} shows the results for datasets downselected to have a number of time series ($N$) greater than 100 and a length of each time series greater than 500 points. This is because \texttt{DTW-C++} is aimed at larger datasets where the speed improvements are more relevant.
+We compared our approach with two other DTW clustering packages, \texttt{DTAIDistance} [@meert2020wannesm] and \texttt{TSlearn} [@Tavenard2020]. The datasets used for the comparison are from the UCR Time Series Classification Archive [@Dau2018], and consist of 128 time series datasets with up to 16,800 data series of lengths up to 2,844. The full results can be found in the Appendix. Benchmarking against  \texttt{TSlearn}  was stopped after the first 22 datasets because the results were consistently over 20 times slower than \texttt{DTW-C++}. \autoref{tab} shows the results for datasets downselected to have a number of time series ($N$) greater than 100 and a length of each time series greater than 500 points. This is because \texttt{DTW-C++} is aimed at larger datasets where the speed improvements are more relevant.
 
 <style>
 table {
@@ -167,9 +167,52 @@ Table: Computational time comparison of \texttt{DTW-C++} using MIP and k-medoids
 | StarLightCurves            | 8236                  | 1024                  | N/A             | **18551.7**           | 27558.1           | 33                 |
 | UWaveGestureLibraryAll     | 3582                  | 945                   | N/A             | **1194.6**            | 4436.9            | 73                 |
 
+
+\begin{table}[]
+\resizebox{\textwidth}{!}{%
+\begin{tabular}{l|p{.125\textwidth}p{.125\textwidth}p{.125\textwidth}p{.125\textwidth}p{.125\textwidth}p{.125\textwidth}}
+                           & Number of time series    & Length of time series    & DTW-C++ MIP (s) & DTW-C++ k-Medoids (s) & DTAI Distance (s) & Time decrease (\%) \\
+\hline
+CinCECGTorso               & 1380 & 1639 & 3008.4      & \textbf{1104.2}   & 1955.9       & 44                 \\
+Computers                  & 250  & 720  & 16.1        & \textbf{10.5}     & 12.8         & 18                 \\
+Earthquakes                & 139  & 512  & 3.2         & \textbf{2.4}      & 2.5          & 3                  \\
+EOGHorizontalSignal        & 362  & 1250 & 81.8        & \textbf{27.6}     & 82.9         & 67                 \\
+EOGVerticalSignal          & 362  & 1250 & 85.9        & \textbf{30.2}     & 85.2         & 65                 \\
+EthanolLevel               & 500  & 1751 & 325.7       & \textbf{198.9}    & 302.3        & 34                 \\
+HandOutlines               & 370  & 2709 & 383.7       & \textbf{280.9}    & 415.9        & 32                 \\
+Haptics                    & 308  & 1092 & 65.5        & \textbf{24.0}     & 45.5         & 47                 \\
+HouseTwenty                & 119  & 2000 & 23.8        & \textbf{19.1}     & 22.0         & 13                 \\
+InlineSkate                & 550  & 1882 & 412.4       & \textbf{198.9}    & 423.4        & 53                 \\
+InsectEPGRegularTrain      & 249  & 601  & 12.3        & \textbf{5.6}      & 8.9          & 37                 \\
+InsectEPGSmallTrain        & 249  & 601  & 11.6        & \textbf{5.3}      & 8.9          & 41                 \\
+LargeKitchenAppliances     & 375  & 720  & 44.6        & \textbf{25.6}     & 31.8         & 20                 \\
+Mallat                     & 2345 & 1024 & 2948.7      & \textbf{517.0}    & 2251.3       & 77                 \\
+MixedShapesRegularTrain    & 2425 & 1024 & 2811.8      & \textbf{1221.9}   & 2367.1       & 48                 \\
+MixedShapesSmallTrain      & 2425 & 1024 & 2793.7      & \textbf{934.0}    & 2369.3       & 61                 \\
+NonInvasiveFetalECGThorax1 & 1965 & 750  & 52599.0     & \textbf{128.7}    & 941.9        & 86                 \\
+NonInvasiveFetalECGThorax2 & 1965 & 750  & 4905.4      & \textbf{115.6}    & 951.0        & 88                 \\
+Phoneme                    & 1896 & 1024 & 46549.0     & \textbf{198.4}    & 1560.6       & 87                 \\
+PigAirwayPressure          & 208  & 2000 & 84.6        & \textbf{56.7}     & 73.2         & 23                 \\
+PigArtPressure             & 208  & 2000 & 78.9        & \textbf{41.8}     & 71.1         & 41                 \\
+PigCVP                     & 208  & 2000 & 73.5        & \textbf{51.7}     & 69.5         & 26                 \\
+RefrigerationDevices       & 375  & 720  & 36.8        & \textbf{20.3}     & 28.4         & 28                 \\
+ScreenType                 & 375  & 720  & 38.6        & \textbf{16.1}     & 28.5         & 43                 \\
+SemgHandGenderCh2          & 600  & 1500 & 335.9       & \textbf{315.2}    & 325.4        & 3                  \\
+SemgHandMovementCh2        & 450  & 1500 & 177.7       & \textbf{107.2}    & 181.1        & 41                 \\
+SemgHandSubjectCh2         & 450  & 1500 & 186.4       & \textbf{96.7}     & 177.6        & 46                 \\
+ShapesAll                  & 600  & 512  & 67.5        & \textbf{15.1}     & 44.4         & 66                 \\
+SmallKitchenAppliances     & 375  & 720  & 41.7        & \textbf{23.8}     & 30.1         & 21                 \\
+StarLightCurves            & 8236 & 1024 & N/A         & \textbf{18551.7}  & 27558.1      & 33                 \\
+UWaveGestureLibraryAll     & 3582 & 945  & N/A         & \textbf{1194.6}   & 4436.9       & 73                
+\end{tabular}}
+\caption{Computational time comparison of \texttt{DTW-C++} using MIP and k-medoids, vs.\ \texttt{DTAIDistance}, and \texttt{TSlearn}, on datasets in the UCR Time Series Classification Archive where $N>100$ and $L>500$.}
+\label{tab:small_table}
+\end{table}
+
+
 As can be seen in these results, \texttt{DTW-C++} is the fastest package for 90\% of the datasets, and all 13 datasets where \texttt{DTAIDistance} was faster were cases where the entire clustering process was completed in 1.06 seconds or less. Across the whole collection of datasets, \texttt{DTW-C++} was on average 32% faster. When looking at larger datasets with $N > 1000$, \texttt{DTW-C++} is on average 65% faster. In all apart from 2 of the 115 cases where \texttt{DTW-C++} is the fastest, it uses the k-medoids algorithm. This is however to be expected as the latter is an iterative clustering method and therefore does not compute all DTW distances. \autoref{fig:k_med} clearly shows the increasing superiority of \texttt{DTW-C++} as the number of time series increases. In this comparison, both algorithms use k-medoids, so the speed improvement is due to faster dynamic time warping. 
 
-![\texttt{DTW-C++} k-medoids  clustering becomes increasingly faster compared to \texttt{DTAIDistance} as the number of time series increases. \label{fig:k_med}](../media/k_med_speed_nn (1).pdf)
+![\texttt{DTW-C++} k-medoids  clustering becomes increasingly faster compared to \texttt{DTAIDistance} as the number of time series increases. \label{fig:k_med}](../media/k_med_speed_nn (1).pdf){ width=80% }
 
 # Acknowledgements
 
@@ -313,6 +356,157 @@ Table: Speed comparison of \texttt{DTW-C++} using MIP and k-Medoids, DTAIDistanc
 | WormsTwoClass                  | 77                    | 900                   | 2.5413          | 2.05345               | 2.154319         |             |
 | Yoga                           | 3000                  | 426                   | 1221.19         | 544.997               | 631.1096         |             |
 
+\begin{landscape}
+\scriptsize
+
+\begin{longtable}[c]{p{.30\textwidth} | p{.11\textwidth} p{.11\textwidth} p{.11\textwidth} p{.11\textwidth} p{.11\textwidth} p{.11\textwidth}}
+\setcounter{LTchunksize}{100}
+\kill
+
+\caption{Speed comparison of \texttt{DTW-C++} using MIP and k-Medoids, DTAIDistance and TSlearn on all datasets in the UCR Time Series Classification Archive.} \\
+
+& Number of time series & Length of time series & DTW-C++ MIP (s) & DTW-C++ k-Medoids (s) & DTAIDistance (s) & TSlearn (s) \\
+\hline
+\endfirsthead
+
+& Number of time series & Length of time series & DTW-C++ MIP (s) & DTW-C++ k-Medoids (s) & DTAIDistance (s) & TSlearn (s) \\
+\hline
+\endhead
+
+ACSF1                          & 100                & 1460   & 10.3943     & 10.038            & 14.50846     & 389.8806 \\
+Adiac                          & 391                & 176    & 29.304      & 1.26198           & 3.874026     & 172.4454 \\
+AllGestureWiimoteX             & 700                & Vary   & 200.949     & 2.08638           & 10.82009     &          \\
+AllGestureWiimoteY             & 700                & Vary   & 104.812     & 2.73356           & 5.791508     &          \\
+AllGestureWiimoteZ             & 700                & Vary   & 63.5701     & 1.38316           & 5.431429     &          \\
+ArrowHead                      & 175                & 251    & 2.27864     & 0.897899          & 0.913173     & 60.84465 \\
+Beef                           & 30                 & 470    & 0.148394    & 0.214371          & 0.180589     & 9.44669  \\
+BeetleFly                      & 20                 & 512    & 0.107265    & 0.127118          & 0.077477     & 13.51861 \\
+BirdChicken                    & 20                 & 512    & 0.121148    & 0.132698          & 0.072962     & 7.080103 \\
+BME                            & 150                & 128    & 2.54826     & 0.231731          & 0.245758     & 28.88994 \\
+Car                            & 60                 & 577    & 0.764395    & 0.402418          & 0.49119      & 53.99938 \\
+CBF                            & 900                & 128    & 52.708      & 2.69667           & 7.495611     & 264.24   \\
+Chinatown                      & 343                & 24     & 9.61945     & 0.136138          & 0.286097     & 12.98259 \\
+ChlorineConcentration          & 3840               & 166    & 18894.9     & 70.1711           & 201.1307     & 1890.485 \\
+CinCECGTorso                   & 1380               & 1639   & 3008.41     & 1104.24           & 1955.915     & 28990.66 \\
+Coffee                         & 28                 & 286    & 0.091578    & 0.101924          & 0.058483     & 4.686621 \\
+Computers                      & 250                & 720    & 16.1162     & 10.5249           & 12.81184     & 860.0778 \\
+CricketX                       & 390                & 300    & 13.4394     & 1.85138           & 6.000062     & 173.9918 \\
+CricketY                       & 390                & 300    & 13.0524     & 1.87443           & 5.811014     & 192.1667 \\
+CricketZ                       & 390                & 300    & 13.9427     & 1.95588           & 5.861976     & 279.3739 \\
+Crop                           & 16800              & 46     &             & 77.5763           & 6563.978     & 9618.363 \\
+DiatomSizeReduction            & 306                & 345    & 11.6206     & 2.72367           & 4.685899     & 227.1111 \\
+DistalPhalanxOutlineAgeGroup   & 139                & 80     & 0.871482    & 0.114135          & 0.161292     & 5.294581 \\
+DistalPhalanxOutlineCorrect    & 276                & 80     & 5.43763     & 0.420276          & 0.373589     & 8.474895 \\
+DistalPhalanxTW                & 139                & 80     & 1.70307     & 0.127389          & 0.140715     & 5.276537 \\
+DodgerLoopDay                  & 80                 & 288    & 0.408889    & 0.242574          & 0.248893     &          \\
+DodgerLoopGame                 & 138                & 288    & 1.55933     & 0.709293          & 0.759482     &          \\
+DodgerLoopWeekend              & 138                & 288    & 1.5386      & 0.864304          & 0.763335     &          \\
+Earthquakes                    & 139                & 512    & 3.17021     & 2.40538           & 2.475174     &          \\
+ECG200                         & 100                & 96     & 0.56719     & 0.171865          & 0.082707     &          \\
+ECG5000                        & 4500               & 140    &             & 53.3946           & 206.1784     &          \\
+ECGFiveDays                    & 861                & 136    & 366.465     & 4.1557            & 6.770701     &          \\
+ElectricDevices                & 7711               & 96     &             & 60.5279           & 408.6165     &          \\
+EOGHorizontalSignal            & 362                & 1250   & 81.7745     & 27.6169           & 82.88655     &          \\
+EOGVerticalSignal              & 362                & 1250   & 85.8957     & 30.2248           & 85.22367     &          \\
+EthanolLevel                   & 500                & 1751   & 325.686     & 198.929           & 302.3411     &          \\
+FaceAll                        & 1690               & 131    & 2118.18     & 3.9713            & 34.63922     &          \\
+FaceFour                       & 88                 & 350    & 0.668328    & 0.53225           & 0.442447     &          \\
+FacesUCR                       & 2050               & 131    & 1882.32     & 6.43057           & 47.43581     &          \\
+FiftyWords                     & 455                & 270    & 21.3023     & 2.50822           & 9.540631     &          \\
+Fish                           & 175                & 463    & 3.62051     & 1.06903           & 2.695585     &          \\
+FordA                          & 1320               & 500    & 703.644     & 132.113           & 168.9301     &          \\
+FordB                          & 810                & 500    & 347.998     & 57.843            & 65.1031      &          \\
+FreezerRegularTrain            & 2850               & 301    & 735.378     & 190.672           & 300.8929     &          \\
+FreezerSmallTrain              & 2850               & 301    & 730.621     & 252.548           & 296.348      &          \\
+Fungi                          & 186                & 201    & 1.99303     & 0.569607          & 0.744395     &          \\
+GestureMidAirD1                & 130                & Vary   & 1.09427     & 0.313373          & 0.367755     &          \\
+GestureMidAirD2                & 130                & Vary   & 0.948828    & 0.282304          & 0.364915     &          \\
+GestureMidAirD3                & 130                & Vary   & 0.777829    & 0.338532          & 0.360999     &          \\
+GesturePebbleZ1                & 172                & Vary   & 3.99312     & 0.378018          & 0.592169     &          \\
+GesturePebbleZ2                & 158                & Vary   & 4.81083     & 0.344109          & 0.589917     &          \\
+GunPoint                       & 150                & 150    & 1.35765     & 0.29607           & 0.341601     &          \\
+GunPointAgeSpan                & 316                & 150    & 7.86469     & 1.11579           & 1.061542     &          \\
+GunPointMaleVersusFemale       & 316                & 150    & 7.95605     & 0.745125          & 1.15842      &          \\
+GunPointOldVersusYoung         & 315                & 150    & 7.77761     & 0.773618          & 1.09943      &          \\
+Ham                            & 105                & 431    & 1.27085     & 0.972879          & 1.009123     &          \\
+HandOutlines                   & 370                & 2709   & 383.677     & 280.885           & 415.8791     &          \\
+Haptics                        & 308                & 1092   & 65.493      & 24.0428           & 45.48866     &          \\
+Herring                        & 64                 & 512    & 0.623086    & 0.533143          & 0.533707     &          \\
+HouseTwenty                    & 119                & 2000   & 23.7889     & 19.1              & 22.04339     &          \\
+InlineSkate                    & 550                & 1882   & 412.36      & 198.895           & 423.3659     &          \\
+InsectEPGRegularTrain          & 249                & 601    & 12.2515     & 5.62819           & 8.897502     &          \\
+InsectEPGSmallTrain            & 249                & 601    & 11.6215     & 5.31629           & 8.943162     &          \\
+InsectWingbeatSound            & 1980               & 256    & 93853.4     & 20.0948           & 117.4865     &          \\
+ItalyPowerDemand               & 1029               & 24     & 56.7128     & 0.871386          & 2.074309     &          \\
+LargeKitchenAppliances         & 375                & 720    & 44.5535     & 25.5697           & 31.76365     &          \\
+Lightning2                     & 61                 & 637    & 0.844298    & 0.943195          & 0.77562      &          \\
+Lightning7                     & 73                 & 319    & 0.518613    & 0.245382          & 0.290609     &          \\
+Mallat                         & 2345               & 1024   & 2948.67     & 516.964           & 2251.27      &          \\
+Meat                           & 60                 & 448    & 0.484361    & 0.268107          & 0.355596     &          \\
+MedicalImages                  & 760                & 99     & 92.8507     & 1.19697           & 3.664919     &          \\
+MelbournePedestrian            & 2439               & 24     & 144.153     & 2.36702           & 27.96836     &          \\
+MiddlePhalanxOutlineAgeGroup   & 154                & 80     & 1.3446      & 0.122859          & 0.152365     &          \\
+MiddlePhalanxOutlineCorrect    & 291                & 80     & 7.59361     & 0.251781          & 0.404614     &          \\
+MiddlePhalanxTW                & 154                & 80     & 4.56761     & 0.173091          & 0.172191     &          \\
+MixedShapesRegularTrain        & 2425               & 1024   & 2811.82     & 1221.93           & 2367.125     &          \\
+MixedShapesSmallTrain          & 2425               & 1024   & 2793.7      & 934.047           & 2369.322     &          \\
+MoteStrain                     & 1252               & 84     & 131.375     & 4.10743           & 6.979077     &          \\
+NonInvasiveFetalECGThorax1     & 1965               & 750    & 52599       & 128.662           & 941.9007     &          \\
+NonInvasiveFetalECGThorax2     & 1965               & 750    & 4905.43     & 115.551           & 950.9619     &          \\
+OliveOil                       & 30                 & 570    & 0.196972    & 0.406894          & 0.211004     &          \\
+OSULeaf                        & 242                & 427    & 20.9774     & 2.79601           & 4.513287     &          \\
+PhalangesOutlinesCorrect       & 858                & 80     & 39.8342     & 1.58711           & 3.595135     &          \\
+Phoneme                        & 1896               & 1024   & 46549       & 198.36            & 1560.56      &          \\
+PickupGestureWiimoteZ          & 50                 & Vary   & 0.142592    & 0.085126          & 0.072093     &          \\
+PigAirwayPressure              & 208                & 2000   & 84.5969     & 56.658            & 73.22682     &          \\
+PigArtPressure                 & 208                & 2000   & 78.9084     & 41.8304           & 71.07334     &          \\
+PigCVP                         & 208                & 2000   & 73.4538     & 51.724            & 69.45288     &          \\
+PLAID                          & 537                & Vary   & 35.9025     & 6.0825            & 18.25728     &          \\
+Plane                          & 105                & 144    & 0.515582    & 0.132734          & 0.201192     &          \\
+PowerCons                      & 180                & 144    & 4.85703     & 0.368735          & 0.364681     &          \\
+ProximalPhalanxOutlineAgeGroup & 205                & 80     & 6.29439     & 0.141562          & 0.220177     &          \\
+ProximalPhalanxOutlineCorrect  & 291                & 80     & 7.70703     & 0.234308          & 0.466041     &          \\
+ProximalPhalanxTW              & 205                & 80     & 6.66169     & 0.188157          & 0.231558     &          \\
+RefrigerationDevices           & 375                & 720    & 36.8233     & 20.3085           & 28.38962     &          \\
+Rock                           & 50                 & 2844   & 8.7356      & 7.90198           & 8.94623      &          \\
+ScreenType                     & 375                & 720    & 38.5828     & 16.1376           & 28.46865     &          \\
+SemgHandGenderCh2              & 600                & 1500   & 335.918     & 315.171           & 325.3503     &          \\
+SemgHandMovementCh2            & 450                & 1500   & 177.672     & 107.214           & 181.079      &          \\
+SemgHandSubjectCh2             & 450                & 1500   & 186.388     & 96.7203           & 177.5783     &          \\
+ShakeGestureWiimoteZ           & 50                 & Vary   & 0.173276    & 0.141293          & 0.076811     &          \\
+ShapeletSim                    & 180                & 500    & 4.83641     & 3.37486           & 3.450108     &          \\
+ShapesAll                      & 600                & 512    & 67.5375     & 15.1164           & 44.40931     &          \\
+SmallKitchenAppliances         & 375                & 720    & 41.7491     & 23.7893           & 30.08536     &          \\
+SmoothSubspace                 & 150                & 15     & 5.48273     & 0.108177          & 0.100557     &          \\
+SonyAIBORobotSurface1          & 601                & 70     & 45.182      & 0.763845          & 1.43163      &          \\
+SonyAIBORobotSurface2          & 953                & 65     & 83.0127     & 1.42315           & 3.08514      &          \\
+StarLightCurves                & 8236               & 1024   &             & 18551.7           & 27558.11     &          \\
+Strawberry                     & 370                & 235    & 16.4157     & 3.40908           & 3.515086     &          \\
+SwedishLeaf                    & 625                & 128    & 54.7593     & 1.07994           & 4.134455     &          \\
+Symbols                        & 995                & 398    & 177.772     & 30.3962           & 63.36334     &          \\
+SyntheticControl               & 300                & 60     & 4.76377     & 0.237314          & 0.409787     &          \\
+ToeSegmentation1               & 228                & 277    & 15.5281     & 1.31453           & 1.779399     &          \\
+ToeSegmentation2               & 130                & 343    & 1.71634     & 0.984133          & 0.938638     &          \\
+Trace                          & 100                & 275    & 0.729685    & 0.298243          & 0.353099     &          \\
+TwoLeadECG                     & 1139               & 82     & 86.9761     & 2.78576           & 5.642078     &          \\
+TwoPatterns                    & 4000               & 128    & 15892.5     & 36.3566           & 138.4134     &          \\
+UMD                            & 144                & 150    & 1.29514     & 0.183408          & 0.26445      &          \\
+UWaveGestureLibraryAll         & 3582               & 945    &             & 1194.61           & 4436.893     &          \\
+UWaveGestureLibraryX           & 3582               & 315    &             & 122.541           & 524.8729     &          \\
+UWaveGestureLibraryY           & 3582               & 315    &             & 113.072           & 532.3661     &          \\
+UWaveGestureLibraryZ           & 3582               & 315    &             & 107.301           & 525.2477     &          \\
+Wafer                          & 6164               & 152    &             & 178.151           & 406.4461     &          \\
+Wine                           & 54                 & 234    & 0.215293    & 0.145209          & 0.130907     &          \\
+WordSynonyms                   & 638                & 270    & 1525.07     & 3.00847           & 13.84392     &          \\
+Worms                          & 77                 & 900    & 2.61596     & 1.6201            & 1.9559       &          \\
+WormsTwoClass                  & 77                 & 900    & 2.5413      & 2.05345           & 2.154319     &          \\
+Yoga                           & 3000               & 426    & 1221.19     & 544.997           & 631.1096     &          \\
+
+
+\label{tab:big_table}
+\end{longtable}
+
+\end{landscape}
 
 # References