GitHub - sukalpomitra/AccelerometerDataAnalysis: Raw to tidy data using R code (http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones)

title	subtitle	author	mode
ReadMe		Sukalpo Mitra	selfcontained

The instruction list

Step 1 - take the zip downloaded from coursera, and extract it.
Step 2 - Copy the folder UCI HAR Dataset with all its content and paste it in your R working directory
Step 3 - Open up R version 3.1.2 and install the dplyr package by typing install.packages("dplyr")
Step 4 - Load the package by typing library(dplyr)
Step 5 - Source the run_analysis function by typing source(filelocation) where filelocation is the location where the run_analysis.R is kept along with the filename. For e.g:- if run_analysis.R is kept at C drive then file location should be "C:/run_analysis.R"
Step 6 - Run run_analysis function by typing run_analysis()

Explanation of the code

Step 1 - The program checks whether the directory UCI HAR Dataset directory exists in the working directory and whether X_train.txt,y_train.txt,subject_train.txt files exist under the train folder in UCI HAR Dataset directory. It also checks whether X_test.txt,y_test.txt,subject_test.txt files exist under the test folder in UCI HAR Dataset directory. If not the program stops and shows the validation error message.
Step 2 - Once the validation passes all the files are first read and the data are stored as data frames.
Step 3 - The data in y_train.txt is added as a new column to the data read from X_train.txt. The column is named as activity
Step 4 - The data in subject_train.txt is added as a new column to the data read from X_train.txt. The column is named as subject.
Step 5 - The data in y_test.txt is added as a new column to the data read from X_test.txt. The column is named as activity
Step 6 - The data in subject_test.txt is added as a new column to the data read from X_test.txt. The column is named as subject
Step 7 - Both the datasets read from X_test.txt and X_train.txt are then joined together
Step 8 - The columns having the mean and standar deviation of the measures are then extracted from the merged dataset
Step 9 - The activity column is then transformed to a factor using levels and labels from activity_labels.txt
Step 10 - The columns of the merged dataset are then given meaningful names
Step 11 - The merged dataset is then grouped by Activity and Subject column
Step 12 - The grouped by dataset is then summarized by calculating averages on the measure columns and written to a txt file called "tidydataset.txt" and is placed in the working directory.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
CodeBook.md		CodeBook.md
ReadMe.md		ReadMe.md
run_analysis.R		run_analysis.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The instruction list

Explanation of the code

About

Releases

Packages

Languages

sukalpomitra/AccelerometerDataAnalysis

Folders and files

Latest commit

History

Repository files navigation

The instruction list

Explanation of the code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages