Details

  1. Due by the start of next class.
  2. Turn-in through the link on Blackboard.
  3. Use diamonds data.

Part 1: PROC CONTENTS

PROC CONTENTS is a useful procedure to understand the structure of your data. It is very similar to the str() function in R.

Run PROC CONTENTS on the data. Screenshot the image.

  1. What type of data is the variable carat and price?

Part 2: PROC MEANS

Run PROC MEANS on the data only for the variables carat and price. Use the default output.

  1. What is the average, minimum, and maximum carat size in the data?
  2. What is the average, minimum, and maximum diamond price in the data?

Part 3: PROC UNIVARIATE

Produce a historgram for carat and diamond.

  1. Are the data normally distributed?

Note: X-Y analyses require data to be normally distributed. We will normalize the data later in this class.

Submission:

Your submission should include:

  1. a Word document with the PROC CONTENTS screenshot and answers to the questions above

Upload to Blackboard!