SAS Examples

Home
Course Schedule
SAS examples
SAS instructions
SAS Handouts
Lecture Notes
COL Course website

Note: To run the SAS code files, you should save them on your computer as .sas files.

 

Week 10 - Comparison of multiple groups - GLM procedure.

Plant waste: The data refer to five suppliers of the Levi-Strauss clothing manufacturing plant in Albuquerque. The firm's quality control department collects weekly data on percentage waste (run-up) relative to what can be achieved by computer layouts of patterns on cloth. It is possible to have negative values, which indicate that the plant employees beat the computer in controlling waste. Under questions, are the difference among the 5 plants. SAS code plantwaste.sas - Data set: plantwaste.txt.

 

Week 9 - Comparison of multiple groups - ANOVA model.

Heart rate data: How is heart rate affected by physical activity? The more work your body is doing the higher your heart rate will be. An experiment was conducted in the fall of 1993 to explore the nature of the relationship between a person's heart rate and the frequency at which that person stepped up and down on steps of various heights. Both frequency of stepping and height of the step were expected to affect heart rate. See lecture notes. SAS code heartrate.sas - Data set: stepping.txt.

 

Flow rates data: The data were collected in a study comparing the variability in air velocity through clean-room air filters to the variability stated in the purchaser’s specifications (see lecture handout). SAS code flowrate.sas - Data set: flowrate.txt.

 

Week 7 - Model selection in multiple regression.

Car fuel consumption - The data were collected in 1975 on 32 brands of automobiles. The data are on gas mileage (miles/gallon), motor displacement (cubic in), horse power (ft-lb), torque (ft-lb), compression ratio, rear axle ratio, carburetor (barrels), number of transmission speeds, overall length (in), width (in), weight (lbs) and type of transmission (1-automatic, 2=manual). he goal of the analysis is to determine which variables are important for predicting the gasoline mileage performance of the vehicles (in miles per gallon). SAS program mileage.sas - Data file mileagesas.txt

 

Week 6 - More on multiple regression - non-linearity and collinearity.

Performance of field sales representatives. Data in Chapter 13 of OTT. 

Data were collected on 512 sales representatives and  include

PROFIT = net profit margin for all orders placed through the representatives.

AREA = area of the district in thousands of square miles

POPN = millions of people in the district

OUTLETS = number of outlets in the district

COMMIS = 1 for full-commission representatives and 0 for partially salaried representative. SAS program sales.sas - Data file sales.txt

 

Movie opening data: Data were collected on 32 movies released between 1997-1998. The data are on the variables: Movie = Title of the movie

Opening = Gross receipts for the weekend after the movie was released (in millions of dollars) Budget = The total budget for the movie (in millions of dollars), Star = Whether or not the movie has a superstar, Release = Whether or not the movie was released in the summer. SAS program movie.sas - Data file movie.txt

 

Week 5 and 6 - Multiple regression.

CPU usage: A study was conducted to examine what factors affect the CPU usage. A set of 38 processes written in a programming language was considered. For each program, data were collected on the CPU usage (time) in seconds of time, the number of lines (line) in thousands generated by the program execution, the number of programs to complete an application, the number of devices. SAS program cpu.sas - Data file cpudat.txt

 

World Oil production: Data are on the world oil production in millions of barrels. SAS program oilprod.sas - Data file oilprod.txt

 

Week 4  - Straight line regression.

The tower of Pisa: The following data represents the lean of the tower of Pisa from 1975 to 1987. 

   

Year 75  76 77 78 79 80 81 82 83 84 85 85 87
Lean  642  644 656 667 673 688 696 698  713 717 725 742 757

    

The lean is the distance between where  a point at the top of the tower is and where it would be if the tower were straight.  The units for the lean are tenths of a millimeter above 2.9 meters.  So 642 is short for 2.9642 meters. SAS program: pisa.sas.

 

Diamond size: The source of the data is a full page advertisement placed in the Straits Times newspaper issue of February 29, 1992, by a Singapore-based retailer of diamond jewelry. The variables are  the size of the diamond in carats (1 carat = .2 gram) and the price of ladies’ rings (single diamond stone) in Singapore dollars. Data Set: diamond.txt   SAS Program: diamond.sas

 

Week 3 - Non parametric tests on averages and two-sample tests.

 

Mortgage refusal rate: Data on refusal rates for white and minority applicants. SAS code mortgage_dep.sas - Data set: mortgage_refusal_rate.txt.

 

Flow rates data: The data were collected in a study comparing the variability in air velocity through clean-room air filters to the variability stated in the purchaser’s specifications (see lecture handout). SAS code flowrate.sas - Data set: flowrate.txt.

 

Recycling waste data (page 243 in Ott): data from a random sample of 25 households, the weekly weight of recyclable material is recorded (see lecture handout). SAS code waste.sas.

 

Keyboard data (2): Data are the completion time (in sec.) for standard data entry tasks using two different types of keyboards. The data points (y1,y2) are pairs of times required to complete a specific task performed by the same individual using keyboard 1 and keyboard 2, respectively. SAS code keybrd2.sas.

 

Week 2 - One sample tests on averages.

Keyboard data: Data are the input times of a standard data entry task with a new type of keyboard. SAS program file: keyboard.sas.

 

Web-application data - A web developer company creates a search engine for a travel-dot com company. Data on the time needed to the application to process  each flight search. Data in averweb.txt and SAS program file: averweb.sas.

 

Time between failures - Data on the time (in hours) between system failures were collected during a study on machine performance that involved 39 similar processes. Data set in faildata.txt, SAS program file: faildata.sas.