Main purpose

The project aims to study the use of the feature space latent representation for the purpose of network monitoring. We trained different machine learning models to learn a latent representation that can be used both to detect similar devices and detect anomalies. The learning and deploying process is unsupervised and can be run using raw traffic pcaps.

Usage on new data using pretrained models

Download the checkpoint and place it in a new directory called checkpoints in the project root. Follow the instructions in lab/ to create the dataset from raw pcaps files and place .pkl files into dataset/. Open the jupyter notebook notebooks/VisualizationTool.ipynb to visualize the dataset. Use T-SNEVisualizer.ipynb to visualize the model output.

Project structure

/
│   lab.requirements.txt: pip packages required to start the lab
│   requirements.txt: pip packages required to start the training/prediction 
│
└───notebooks: 
│   │   CICIDS2017_T-SNE_Visualizer.ipynb: visualize and study the model predictions
│   │   CICIDS2017_VisualizationTool.ipynb: visualizes the dataset extracted
│   │   ... (the other notebooks can be ignored and may be removed by future commits)
│   
└───folder2
    |   cicids2017.py: grid search model selection and evaluation
    │   data_generator.py: extracts the data from ntopng backend database
    │   Seq2Seq.py: class modelling sequence to sequence autoencoders
    |   AnchoredTs2Vec.py: class modelling triplet loss based models
    |   ntopng_constants.py: mainly used to set ntopng features to be used by the models
    |   Callbacks.py: training callbacks
    |   AnomalyDetector.py: defines the class used to build contexts from dataframes(WindowedDataGenerator)
                            and the base class of our models implementing point-wise application features

Name		Name	Last commit message	Last commit date
Latest commit History 270 Commits
lab		lab
notebooks		notebooks
src		src
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
lab.requirements.txt		lab.requirements.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Main purpose

Usage on new data using pretrained models

Project structure

About

Releases

Packages

Languages

License

samuelesabella/Detecting-network-anomalies-using-the-feature-space-latent-representation

Folders and files

Latest commit

History

Repository files navigation

Main purpose

Usage on new data using pretrained models

Project structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages