04 January 2018

Data Sains

BY Amirzie No comments

the first thing you should do is look at it! What is the format of the data? What are the dimensions? What are the variable names? How are the variables stored? Are there missing data? Are there any f laws in the data?

a data frame. It is the default class for data read into R using functions like read.csv() and read.table()

Since the dataset is stored in a data frame, we know it is rectangular. In other words, it has two dimensions (rows and columns) and fits neatly into a table or spreadsheet. Use dim(plants) to see exactly how many rows and columns we're dealing with.


The first number you see (5166) is the number of rows (observations) and the second number (10) is the number of columns (variables)


0 comments:

Post a Comment