Friday, October 3, 2014

R Statistical Software Basics – Descriptive Statistics - Skewness

The practice sheet titled StatisticMarks Data.csv downloaded from Link. – Download Sheet

About the Data Sheet - The data in this sheet is related to marks scored by 100 Students in a Statistical Test.

Based on the data, we will use R Software Statistical functions to analyze the descriptive statistics.

In the Data Sheet, we have Data from A2:A101, A1 being the header of the Data. I have stored the StatisticMarks.csv file in Working Directory on my Desktop.

setwd("C:/Users/Rajesh Prabhakar/Desktop/R")

For inputting or reading Data from “StatisticMarks.csv” file, R Command would be

StatMarks=read.csv("StatisticMarks.csv")

Skewness
Describe asymmetry from the normal distribution in a set of statistical data. Skewness can come in the form of "negative skewness" or "positive skewness", depending on whether data points are skewed to the left (negative skew) or to the right (positive skew) of the data average.
In R Statistical Software, Skewness is represented by function “skewness”. 

For skewness Load e1071 package or PerformanceAnalytics Package

skewness(   )

In the Data Sheet, we have Data from A2:A101, A1 being the header of the Data titled StatisticsMarks. 

skewness(StatMarks$StatisticsMarks)

StatMarks is the name of the variable in which we stored the data followed by $ sign and column header of the Data i.e. StatisticsMarks.

Remember the title of the column should be exactly same including the large caps & small caps or else it will give error.

In R the file names, column headers and row headers should exactly match the same or else the function will give errors

The result of this function in R Console is
skewness(StatMarks$StatisticsMarks)
[1] -0.8221308

Result is negatively skewed.

No comments:

Post a Comment