SOCI 6015
Final Analysis Paper
Instructions: For this analysis paper, conduct a multivariate regression analysis and write a paper describing your findings. You should be writing this as though you were presenting your findings in a formal research article. Remember that although collaboration on the assignment is acceptable, the essay needs to be distinctly your own work.
Your paper should consist of the same three sections as in the previous labs, plus a hypothesis section and a required appendix. You should include all of the following sections (accompanied by properly formatted tables): 1) Hypotheses, 2) Data, 3) Methods, 4) Results, and 5) Appendix.
1. Preparing Your Data
- Select the following ten variables for your analysis. You will need an interval-ratio dependent variable, and the following independent variables: two
- Clean your Data
- Recode any missing data
- Reverse code variables and create dummy variables as necessary.
- Check your coding with cross-tabulations (append the SPSS output)
- Construct an additive index (scale) variable. It must be computed from at least three other similar variables (either nominal or ordinal, as long as they all have the same category scheme).
- Data Transformations
- Use the ladder of powers to examine alternate transformations both
- Create a table of the transformations and discuss which transformations may be useful and why.
- Interaction Term
- Compute an interaction term from two of your existing variables to be included in one of the intermediate models in your analysis (not the final model).
appropriately coded dichotomous nominal variable, two appropriately coded multi-category nominal variables, two ordinal variables, interval-ratio variables, and one index (a variable measuring a latent concept comprised of several existing variables).
interval-ratio variables. Make sure to use the following: x3, x2, identity,v, ln(x) 1/v, 1/x, 1/ x2, 1/x3.
A1. Paper Sections
- Hypotheses
- Discuss your empirical expectations for the independent variables that you are interested in and formally state a hypothesis for each (you must have at least four).
- Data Section
- Discuss the specifics of the dataset, including the particular details of the year you are using (e.g., the size of the sample, the non-response rate, a description of the sampling frame, type of probability sample, how the data was collected, etc.).
- Compute a table (Table 1)of the Kolmogorov-Smirnov tests for each transformation of both of the two interval-ratio variables. Make sure to include the test statistic and probability value of each transformation. Remember that you don’t need to actually include a transformed variable in your final analysis, but you do need to discuss the advantages and disadvantages of doing so (both theoretically and empirically), and how you would select an appropriate transformation.
- Construct a table (Table 2)of the Variance Inflation Factors (VIF) for all of the independent interval-ratio and ordinal variables. You don’t need to compute the VIF for the nominal variables, but these variables still need to be estimated in the models you use to compute the r-squared value of the interval-ratio and ordinal variables. Interpret the table and diagnose any potential multi-collinearity issues.
- Construct a table of descriptive statistics (Table 3)and use it to explain your variables (how they variables are coded in the analysis and what the descriptive statistics of each mean) to your reader in detail. Make sure to discuss the mean of the variables (or the percentage for the dummy variables).
- Explain your variables and how they are currently coded (not the steps you took to recode) to your reader in detail.
- Discuss how you dealt with missing data.
- Methods
- Discuss the assumptions of the OLS regression model and assess how well your model meets these assumptions.
- Discuss your analytic strategy (what you are going to do in the following analysis and why).
- Discuss the logic you are using to add variables to the analysis (you need to have at least four separate models).
- Results Section.
- Construct a correlation table (Table 4)of all of the variables in the model using Pearson’s r as the measure of association (since SPSS can’t compute polychoric correlations even though they would be the most appropriate to use here). Discuss the table and diagnose any potential multicollinearity problems.
- Compute a table (Table 5)of your multivariate regression analysis. Make sure the table can stand-alone, and that it includes the r-squared valued and N at the bottom of each column.
- Make sure to also include the interaction term in in one of the models (but not the final model) and discuss whether it is significant or not.
- Discuss model fit using r-squared.
- Interpret the significant coefficients in the final model.
- Interpret any changes in significance or effect size as it relates to mediating/intervening variables.
- Evaluate your hypotheses.
- Appendix
- Attach all of the cross-tabulations you produce when you recode your variables (these can be imported directly from the SPSS output).
Conduct a multivariate regression analysis. Compute at least four separate models (all shown on the same table), beginning with the primary variable of interest, and adding additional groups of variables in each model. At least three of the independent variables need to be significant in the final model.