- Always save a clean version of your data that you do not touch (especially if accessing your data is difficult) / Never save over your only data file with recoded variables
- Always use syntax, so you have a record of what you are doing
- Keep your syntax nicely organized
- Use notes liberally in your syntax, in SPSS and STATA this means starting a line with *
- Organize syntax into sections, e.g.
Category: <span>Statistics</span>
Any statistics you calculate requires knowing what the level of measurement is, which means that you gotta know this to get very far with statistics. So what it is? I might say that level of measurement refers to the degree of quantification of a variable.… Read More “Levels of Measurement”
FREQUENCY TABLES
Frequency tables are kind of what they sound like: tables that show the frequency (or how many times) a particular response is given. These are particularly useful for summarizing your data when you have a lot of cases. It’s hard to see what’s going on when all you have are the individual responses each person gives, so frequency tables help us to organize those responses into some kind of meaningful order.… Read More “Frequency Tables”
Calculating a mean or a standard deviation is not something done all that often, given that you can only calculate such statistics with interval or ratio level variables and most such variables have too many values to put into a frequency table that will be informative beyond what raw data would look like.… Read More “Calculating a mean or standard deviation from a frequency table”
When doing social research, what we care about most are populations, but what we have access to are samples. Samples are subsets of a population that we use to make estimates about populations. For example, if I wanted to know about American attitudes about the president, I couldn’t just pick up the phone and call the 327+ million American people and say “hey, gurrrl, what do you think about the prez?”… Read More “Samples vs. Populations”
Funny story… so, a “friend” of mine may have a bit of an obsession with a particular app that allows a person to decorate rooms. It’s silly and leads this friend to zone out more than she probably should, but it’s one of her guilty pleasures.… Read More “Beware of online consumer surveys”
Carrying out statistical analyses by hand can only take you so far. Usually, we need computers to help us manage large amounts of data. One software program that is used by a lot of people is SPSS. There are many other programs you could use to manage data, such as STATA, R, SAS, or even Excel.… Read More “Pros & Cons of SPSS”
Step 1. Open SPSS on your computer
The first screen that’ll pop up asks what data you want to open. I usually just ‘x’ out of this screen or hit cancel.
Step 2. You’ll end up on a blank spreadsheet. To open the data:
1) click “file” in the top left hand corner, then
2) Open, then
3) Data
Note: Other types of files you are likely to need are “syntax” where you can type in code and “output” where you will be getting your results.… Read More “Opening Data in SPSS”
Before setting out to analyse or recode any variables, one needs to know what’s going on with the variables of interest. A few ways to do this that I’m going to show you a few ways to do this by calculating/creating:
- Frequency Tables
- Descriptive Statistics
- Graphs
Before doing any of these steps, make sure you have your data open first.… Read More “Exploring Data in SPSS”
After you’ve opened your data in SPSS and explored your data, you will likely want to recode some or all of your variables. Perhaps you want to change marital status so that married people and those in common-law relationships are counted as the same.… Read More “Recoding Variables in SPSS”
If you want to calculate any statistics split into the different categories of a variable, you can do this easily in SPSS. All you need to do is the following:
- Click on “Data” from the pull-down menus
- Click on “split file” found almost all the way at the bottom of the list of commands
- Toggle the circle at the top that says “Compare groups”
- You will now be allowed to move the variable you want to split your data by, from the box on the left into the “Groups Based on:” box on the right .
Measures of central tendency–such as the mean, median, and mode–tell us about the general trend of the cases in our sample. Do the responses fall closer to one end of the distribution or another? Are people generally high or low; young or old; or whatever might be our variable.… Read More “Calculating Measures of Central Tendency in SPSS”
Statistical estimation is the technique of making inferences (or estimates) about a population based on sample data. Because we cannot know precisely what is happening with a population, we can only make our best informed assessments from what we know of our random sample.… Read More “Calculating Confidence Intervals in SPSS”
Calculating a t-test in SPSS in order to compare differences between means and/or proportions
What you’ll need variable-wise:
For comparing mean differences between groups: a dichotomous (binary) independent variable and an interval-ratio dependent variable.
For comparing differences in proportions between groups: a dichotomous (binary) independent variable and a dummied dependent variable (i.e.… Read More “Calculating a t-test in SPSS”
Analysis of Variance (ANOVA) is a statistical test used to assess where differences in means between three or more groups are statistically significantly different. Statistical significance refers to the probability that the relationships observed in our sample, comes from a null distribution.… Read More “ANOVA in SPSS”
Chi-Square in SPSS
Calculating a chi-square statistic in SPSS is quite simple, as long as you have two categorical variables. The much harder part is in the interpretation (which we’ll get to way down below). In the example here, we are using 2016 Census (Canada) data with the following variables:
- School Attendance, asked in the survey as “At any time since September 2015, has this person attended a school, college, CEGEP, or university?”

There are three main types of files used in Stata:
Data File: Filename.dta
Syntax File: Filename.do
Output file: Filename.log
The data file is where all the data live. The syntax is where you tell STATA what to do with your data.… Read More “The Basics of STATA”