Note:
To run the SAS code files, you should save them on your computer as .sas
files.
Week
10 - Comparison of multiple groups - GLM procedure.
Plant
waste: The data refer to five suppliers of the Levi-Strauss clothing
manufacturing plant in Albuquerque. The firm's quality control department
collects weekly data on percentage waste (run-up) relative to what can be
achieved by computer layouts of patterns on cloth. It is possible to have
negative values, which indicate that the plant employees beat the computer
in controlling waste. Under questions, are the difference among the 5
plants. SAS code
plantwaste.sas
- Data set: plantwaste.txt.
Week
9 - Comparison of multiple groups - ANOVA model.
Heart
rate data: How is heart rate affected by physical activity? The more work your body is doing the higher your heart rate will be. An experiment was conducted in the fall of 1993 to explore the nature of the relationship between a person's heart rate and the frequency at which that person stepped up and down on steps of various heights. Both frequency of stepping and height of the step were expected to affect heart rate.
See lecture notes. SAS code
heartrate.sas
- Data set: stepping.txt.
Flow rates data: The
data were collected in a study comparing the variability in air velocity
through clean-room air filters to the variability stated in the
purchaser’s specifications (see lecture handout). SAS code
flowrate.sas
- Data set: flowrate.txt.
Week
7 - Model selection in multiple regression.
Car
fuel consumption - The data were collected in 1975 on 32 brands of
automobiles. The
data are on gas
mileage (miles/gallon), motor displacement (cubic in), horse power
(ft-lb), torque (ft-lb), compression ratio, rear axle ratio, carburetor
(barrels), number of transmission speeds, overall length (in), width (in),
weight (lbs) and type of transmission (1-automatic, 2=manual). he
goal of the analysis is to determine which variables are important for
predicting the gasoline mileage performance of the vehicles (in miles per
gallon). SAS
program
mileage.sas -
Data file
mileagesas.txt
Week
6 - More on multiple regression - non-linearity and collinearity.
Performance
of field sales representatives. Data in Chapter 13 of OTT.
Data
were collected on 512 sales representatives and include
PROFIT = net profit margin for
all orders placed through the representatives.
AREA = area of the district in
thousands of square miles
POPN = millions of people in
the district
OUTLETS = number of outlets in
the district
COMMIS = 1 for
full-commission representatives and 0 for partially salaried
representative. SAS
program
sales.sas -
Data file
sales.txt
Movie
opening data: Data were collected on 32 movies released between
1997-1998. The data are on the variables: Movie = Title of the movie
Opening
= Gross receipts for the weekend after the movie was released (in millions
of dollars) Budget = The total budget for the movie (in millions of
dollars), Star = Whether or not the movie has a superstar, Release
= Whether or not the movie was released in the summer. SAS
program
movie.sas -
Data file
movie.txt
Week 5
and 6
- Multiple regression.
CPU usage: A
study was conducted to examine what factors affect the CPU usage. A set of
38 processes written in a programming language was considered. For each
program, data were collected on the CPU usage (time)
in seconds of time, the number of lines (line)
in thousands generated by the program execution, the number of programs to
complete an application, the number of devices. SAS
program
cpu.sas
-
Data file
cpudat.txt
World Oil production: Data are on the world oil production in
millions of barrels. SAS
program
oilprod.sas
-
Data file
oilprod.txt
Week 4 - Straight line regression.
The tower of Pisa: The
following data represents the lean of the tower of Pisa from 1975 to 1987.