Skip to content

Fall2018-project5-sec1proj5-grp5 created by GitHub Classroom

Notifications You must be signed in to change notification settings

TZstatsADS/Fall2018-project5-sec1-grp5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ADS Project 5:

Term: Fall 2018

Collaborative

  • Project summary In this project, we applied Alternating Least Square algorithm on Spark platform to provide movie recommendations to clients. We obtained data from 'Movielens', it has over 27 million ratings and more than 58 thousand movies. MovieLens

  • Main Idea Collaborative

  • General Idea Collaborative

  • Basic Algorithm (Alternating Least Square) Collaborative The running time for our model is 97.435 second.

  • Example In this recomendation system, clients are allowed to input their concerned movies and corresponding ratings. For example, there is a user 0.

Collaborative

The running time for retrain our model is 97.435 second.

This is the actual recommendation for user 0. Collaborative

  • Performance The RMSE for our model is 0.8188. (Similar competitions were hosted by Netflix and their best RMSE is 0.9514.

  • Conclusion Taking advantage of cloud computing. Model has the capability of adding new user/rating and re-train in a very short period of time. Going forward, with more computational power, we can further reduce the computing time and hopefully close to real time

  • Enviornment:

  • emr - 5.19.0, Spark 2.3.2 on Hadoop 2.8.5 YARN with Ganglia 3.7.2 and Zeppelin 0.8.0

  • master: m5.4xlarge

  • slaves: 3 * m5.4xlarge

Contribution statement: (default) All team members are major contributors in this project. All team members approve our work presented in this GitHub repository including this contributions statement.

Following suggestions by RICH FITZJOHN (@richfitz). This folder is orgarnized as follows.

proj/
├── lib/
├── data/
├── doc/
├── figs/
└── output/

Please see each subfolder for a README file.

About

Fall2018-project5-sec1proj5-grp5 created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •