Working with your group, go to the DASL homepage and choose a data set:Conduct the following analysis for each chosen variable of your data set: (15 points)
- Click on "All Methods"
- Choose from either the "Correlation" or "Regression" link
- Choose at least two variables that might have an interesting relationship
Conduct the following analysis for a single variable of your data set: (15 points)
- Graph the histogram
- Indicate whether the histogram is symmetric or skewed
- Report the appropriate center and spread
Conduct the following bivariate analysis of your data set: (25 points)
- Use the normal approximation to calculate the bottom 5% and the top 10% of the data.
- Compute three confidence intervals (90%, 95% and 99%) for the population mean
- Test the significance of the claim that the population average is less than the median value of the data (link to hypothesis-testing calculator). Assume alpha equals 0.05. Report your results.
- Draw a scatter plot of your data.
- Calculate the regression model for your data.
- Determine if there are any outliers in your data. If there are, remove them from the data and reapeat steps 1 and 2 above.
All groups will present their projects during the final exam time (2:45 - 5:00) on Friday November 18, 2005.
Guidelines:
Each group will create
- a spreadsheet showing the data analysis
- a short report summarizing the results of your analysis.
- a web page presenting the results of your analysis.
The web page should contain a non-technical summary of your project with a description of the results of your analysis. It should be intelligible to a person who does not know regression analysis. Suppose you are talking to your boss who does not know statistics or to a friend who is not familiar with statistical terminology. Finally, provide some plausible explanation that involves the story behind the data set and is consistent with your results.
The web page will have links to both the data analysis spreadsheet and the short report. Also, if there are external web pages relevant to your project topic the please add links.
There are no rules on the way the page should look. You can use your esthetical skills to add colors and pictures to the page. Be careful not to overdo with colors, though!
Technical Tips:
- The web page presenting your data will be called project.htm
- Using MS Word and MS Excel, it is easy to save documents as web pages: use the File|Save As.. menu option and then choose "Web Page" from the drop down list in the Save as type: box.
- FTP the files and all related files (picture, background picture if you have any etc) to your students.depaul.edu and place them in a csc239 directory.
- Set the correct permissions for files (644) and directories (711 or 755) either via FTP or by telneting.
- Check out your page and test all the links at http://students.depaul.edu/~yourlogin/csc239/project.htm
- Keep in mind that every time you make changes to the page you have to re-FTP it!
General grading scheme:
Group grade:Individual grade:
- the data analysis and the written report will count 55% of the group project grade.
- the web page with working links will count 25% of the group project grade.
- the presentation will count 20% of the group project grade.
- submit a document that chronicles your contributions to the project as well as the contributions of the other group member(s). This will count 80% of the individual grade.
- the presentation will count 20% of the individual grade.
One student will post the web page on his or her shrike account. One group member will send their final projects as three attachments (the web page, the report and the spreadsheet) in an email to cmiller@cti.depaul.edu. Also indicate the URL of the Web page. Each student will submit an individual report using the online submission Web site.