To Projects

CSC 423 -- Lab 1

Use the ExamSco and NIST Examples to help you write your SAS and/or R code. The blood pressure of fourteen persons is measured, both in standing and supine position. The blood pressure readings are recorded in the datasets blood-pressure1.txt and blood-pressure2.txt. The SAS and R scripts perform the following tasks:

  1. Create datasets named bp1 and bp2 that contain the data from the input files blood-pressure1.txt and blood-pressure2.txt, respectively. If you are using SAS, define labels for the variables in each dataset.
     
  2. Print each dataset in 1.
     
  3. For bp1, obtain these univariate statistics:
     

    If you are using SAS, compute these statistics using both proc means and proc univariate.
    If you are using R, use these functions: mean, sd, quantile, and IQR.
     
    Note: if you perform the percentile calculations in both SAS and R, you will see that different percentile values are obtained.  Don't worry about these differences. SAS has 5 different percentile calculation methods; R has 9 quantile calculation methods.
     
  4. (Optional) If you are using SAS, repeat the analyses in Problem 3 using bp2.  Skip this problem if you are using R.
     
  5. Using bp1, create three histograms of the Standing and Supine columns, one histogram with the default width, one histogram with bins larger than the default (less bins than the default), and one histogram with bins smaller than the default (more bins than the default). Note that the default histogram has bins with midpoints at 60, 180, 300, 420, 540 for Standing, so you could use 0 to 600 for the range of the histogram.
     
  6. Using bp1, create normal plots for the Standing and Supine columns.
     
  7. Using bp2, create side-by-side boxplots for Press using the independent variable Pos.
     
  8. Find 95% confidence intervals of the true pressures of the populations represented in bp1.
     
  9. Delete any bad observations from bp1.  Then perform a paired two-sample t-test using the "cleaned" dataset bp1.
     
  10. Delete any bad observations from bp2.  Then perform an independent two-sample t-test using the "cleaned" dataset bp1.
     

Tips and Hints for Lab1

  1. Set your Windows File Explorer to show file extensions. See the very bottom announcement on the Announcements Page.
     
  2. SAS: use labels for your variables to increase the readability of the output. A label is an expanded, human readable explanation of a variable. For example:
    Labels will also be applied to graphs.
  3. R: use the cat function to provide explanations in your output. cat is more flexible function than print for printing captions and explanations. You can include new line tokens \n in what is printed for example:
  4. SAS: when using proc univariate to produce histograms or normal plots, you can suppress the usual printed output with the noprint option if you only want the plots.
     
  5. R: when creating graphs, use the xlab and ylab arguments to customize the x and y axis labels.
     
  6. R: use the t.ttest function to obtain a confidence interval for the population mean.
     
  7. SAS: use the bp1 dataset with one column for each treatment group for the paired-sample t-test; use the bp2 dataset with a categorical variable that labels the treatment groups for the independent two-sample t-test.
     
  8. R: use the bp1 dataset for both the paired-sample t-test and the independent two-sample t-test.
     
  9. SAS: to export the graphs to include in your Word document for Project 1, right click on the graph in the Results Viewer and select Save Picture.
     
  10. R: to create a PNG file with a single graph that can be included in your Word document for Project 1, use the R functions png and dev.off. For example: