Friday, May 29, 2015

Maternity Experiences Survey PUMF

I wanted to clarify that there isn’t a PUMF for the Maternity Experiences Survey (, but I just wanted to confirm that.


That is correct, a PUMF is not available for the MES. There are publications and data tables available from the Public Health Agency website.

For more information about the Maternity Experiences Survey, consult the Public Health Agency of Canada website <>. The masterfile is also available in the RDCs <>.

If interested in a custom tab, please advise and I can put you in touch with subject matter.

Thursday, May 28, 2015

Updated Products: APS PUMF 2012

Following a client request relating to the APS 2012 Public Use Microdata File (PUMF), the Stata program APS_bootstrap.dct was created to read the flat data file APS_bootstrap.txt into a usable bootstrap weight file in Stata, and the two Stata programs and were created to label the variables in the bootstrap weight file (either English or French). In addition, two SAS programs APS_bootstrap_lbe.SAS and APS_bootstrap_lbf.SAS were created to label variables in the bootstrap weight file in SAS.

These new Stata and SAS programs were not initially created and included with the other APS 2012 PUMF files, but the client request brought to our attention that they should have been part of the APS 2012 PUMF package.

EFT: /MAD_DLI/Root/other-products/Aboriginal Peoples Survey-aps/2012/doc/syntax

Wednesday, May 27, 2015

New Files on Statistics Canada Nesstar

We are pleased to inform you that the following are now available on the Statistics Canada Nesstar Webview site ( ).


Residential Telephone Service Survey, 2013
National Graduates Survey 2013 (class of 2009/2010)
National Household Survey, 2011 (Hierarchical File)
Canadian Tobacco, Alcohol and Drugs Survey, 2013
And more to come!...

Anyone can view the metadata for these files on the Statistics Canada Nesstar site. Only DLI Contacts however will be able to access the data files on the site (PUMFs). If you are a DLI Contact and would like to access and manipulate data files on the Statistics Canada Nesstar site you will need to request a password. Please note that the PUMF data files available through Nesstar are subject to the PUMF Licence agreement.

To access the microdata housed in the Research Data Centres (RDCs), researchers must submit a project proposal to the Social Sciences and Humanities Research Council (SSHRC) and Statistics Canada.

Tuesday, May 26, 2015

DLI License in Electronic Format


Where can I find the DLI License? I can find information about the DLI License on the DLI site, but I can't find the actual license and I'd like to be able to direct people to an electronic copy.


At the present time we do not have an accessible electronic version of the licence on our website. We are revising the website content this summer and will be looking into creating an accessible version of the licence.

NPHS Sense of Coherence


A researcher is looking at the “Sense of coherence” information found in the National Population Health Survey. This seems to appear in cycles 1, 3, and 8. Since cycles 1 and 3 have public use microdata files, we were able to have a look at this data. However, the researcher has many questions about this, which I am not able to explain. Specifically:

“I would like clarifications to the NPHS data on Psychological Wellbeing, specifically the 13 item Sense of Coherence (SOC) scale. My understanding based on Antonovsky's original 13 item SOC scale is that the questions are scored on a Likert type scale ranging from 1 to 7 with scores ranging from 1-91.

How did you obtain your minimum, maximum, and average scores with the Cycle 1 and 3?

Why are your scores ranging from 4-78 (cycle 1) and 0-78 (Cycle 3).

Specifically, what constitute a lower, higher, and average scores from the cycles (1 and 3)”


Please see response from subject matter:

I have attached some background info. Unfortunately there is no longer a team for NPHS. We do not have the specific answers for the questions, however the following information should help the researcher.

The attached articles, (the first 2 pdfs) named ‘A Healthy Outlook’ and ‘Healthy Aging’, examined sense of coherence. Also attached (3rdpdf) is the Cycle 8 derived variable documentation for Sense of Coherence Scale (ST_nDH1).

The Cycle 1 to 3 derived variable documentation excerpt is shown below:

19.3 Sense of Coherence Scale
Cycle 3 Name: PY_8DH1
Cycle 2 Name: N/A
Cycle 1 Name: PY_4DH1
Based on sum of PY_n_H1 to PY_n_H13
Min = 0 , Max = 78

The 13-item version of the sense of coherence scale developed by Antonovsky was used in the NPHS. It denotes the extent to which individuals perceive events as comprehensible, manageable and meaningful.

The concept of manageability is addressed in questions Q3, Q4, Q8 and Q10. Items Q1, Q9, Q11 and Q13 measure meaningfulness and items Q2, Q5, Q6, Q7, Q12 are related to the comprehensibility dimension.

Higher scores indicate a stronger sense of coherence.

Score was reversed for questions PY_n_H1, H2, H3, H8 and H13.

GSS 23 (2009) and Domestic Abuse: Coverage of Current Versus Ex-Spouses/Partners


A student is looking for domestic abuse by type of spouse/partner in the household. We have a basic question about this - does this make sense given the available variables? From what we can see, the variables that cover abuse (physical/sexual, or emotional/financial) pertain to ex-spouses/partners, whereas the victims are often not living with a current spouse/partner.

Are we missing substantial abuse variables (physical/sexual, or emotional/financial) that relate to current spouses/partners rather than ex’s? If not, could we go back to older cycles for this?


To answer the question below, the student should be looking at both current and ex spouse/partner (which is available in Cycle 18 and 23).

Here is a breakdown of 6 derived variables that are related to ex/current abuse (depending on what student means by ”domestic abuse”, they might want to look at multiple abuse categories). From what I can see from previous StatCan articles, if they are looking at what is called “Spousal/ex-Spousal Violence”, then they are most likely restricting the sample to PRVIOL = 1. However, if you would like to look at all forms of abuse (including emotional/financial, you will want to include CREMFIN/EXEMFIN, and SPABUSE/EXABUSE in your analysis.

***Note - It is important to not lump them into one category because the timelines differ (e.g., emotional/financial appears to be covered across a lifetime, whereas physical/sexual is covered in the last 5 years).

(Current spouse/partner abuse variables)

CREMFIN (emotional/financial abuse) by current spouse/partner (EFP_Q210 to EFP_Q270) (during lifetime)

SPABUSE (physical/sexual violence) by current spouse/partner (PSP_Q110 to PSP_Q200) (past 5 years)

CRVIOL (physical/sexual violence by current) à can specify different time frames - past 12 months, etc.

(Ex spouse/partner abuse variables)

EXEMFIN (emotional/financial abuse) by ex-spouse/partner (EFX_Q210 to EFX_Q270) (during lifetime)

EXABUSE (physical/sexual violence) by ex-spouse/partner (PSX_Q110 to PSX_Q200)

EXVIOL (physical/sexual violence by ex) à can specify different time frames - past 12 months, etc.

(Both ex and current)

PRVIOL (physical/sexual violence by current or ex-spouse/partner à can specify different time frames - past 12 months, etc.

Not sure if student is interested in looking at proportion of abuse by spousal/partner. If that is the case, you could do something like this...restrict main file to “if marstat in (1,2)” (common-law or married) and then restrict to “if prviol = 1” (physical/sexual violence)

OR if you wanted to see what proportion of abuse was ex-spouse/partner vs. current-spouse/partner, you could restrict main file “if prviol = 1”, then cross by crviol (current) and exviol (ex).

Monday, May 25, 2015

Place of Work - Communting Flow Tables for 2011


A researcher is mapping potential flu epidemic scenarios and would like to use commuting to work distances at the CT level. In 2006, there were both standard products (97-561-XCB2006) and custom products (97c00) from DLI that covered place of work - commuting flows to Census Tracts. Are there similar 2011 NHS tables, or is this possible through custom tabs?


From the NHS, we have commuting flows only at the census subdivision level.

99-012-032 Commuting Flow - Census Subdivisions: Sex (3) for the Employed Labour Force Aged 15 Years and Over Having a Usual Place of Work <>

The CT commuting flows would only be available on a custom basis.  

Thursday, May 21, 2015

Indexing/Abstracting of "Insights on Canadian Society" in the CBCA Complete Database

As you’re probably aware, the following three Statistics Canada periodicals (containing descriptive commentary with a mix of tables, graphs and pie charts) ceased in 2012: Canadian Social Trends, Canadian Economic Observer and Perspectives on Labour and Income. (They are all indexed in CBCA Complete.) They are continued by Insights on Canadian Society. I’ve just been informed by ProQuest that indexing/abstracting of this title will be available in the CBCA Complete database within the next two weeks.


Government Finance Statistics


I have a researcher looking for the same information available in CANSIM 385-0001, but for more recent years than 2009. There is a note in the table saying:

“Note: 2007/2008 revised estimates and 2008/2009 estimates are the final reference years for which government revenues and expenditures statistics are available on a Financial Management System (FMS) basis. Statistics Canada will be adopting the International Monetary Fund accounting standard for government, called Government Finance Statistics, in 2014 starting with the reference year 2008/2009.”

So more recent data is available in 385-0032, however, not at the same level of specificity, such as breaking down spending into health, social services, education, etc.

Is this information available elsewhere for 2010 onward?


There are several dimensions in table 385-0001, some are not addressed in the new CANSIM tables (at this point) and others are addressed in more than one table.

In other words, my answer would change depending on what dimensions the client is looking for exactly.

The new government data is covered in tables 385-0034 up to table 385-0040 – if what is looking for is released at this point, it would be there.

Wednesday, May 20, 2015

Pilot with CIHI Extended! New files from the DADs

I am pleased to announce that the DLI has extended the pilot with the Canadian Institute for Health Information (CIHI) for another fiscal year.

New samples files from the Discharge Abstract Database (DAD) Research Analytic Files for 2013-2014 are now available on the DLI EFT!


Any questions, do not hesitate to contact me! If you have questions regarding the specific file, I will forward along accordingly to CIHI.

Tuesday, May 19, 2015

Spatial and Numeric Data Services (SANDS) Update

Spatial and Numeric Data Services (SANDS) has new web pages! 

Please check them out here: <> NOTE: The URL is NEW, too! Please change any links you may have to them accordingly.

We welcome your feedback. If you see things that need changing or, if you have suggestions to improve the pages, please send them to me.

1981 Census Division Map of Alberta

I wonder whether any of you may have a map of the 1981 Alberta census divisions on you?A student I am working with has to determine the census division for Fort McMurray in 1981. It is now in division 16, however, according to CANSIM table 051-0031, there were only 15 census divisions in Alberta in 1981.


In 1981 Fort McMurray was in Census Division # 12.

Thursday, May 14, 2015

Release Date Updates for GSS 2013


A user here is wondering when in May and June the cycle 27 General Social Surveys will be available – is it possible to get an updated release date?


I followed up with Subject Matter and received an update that there has been a delay from the previously anticipated dates:

The GSS Social Identity component is scheduled for release this summer (we’re aiming for late June-early July) and the Giving, Volunteering and Participating is scheduled for release in the fall of 2015.

Divorce Rates in Canada


Could you point me to the latest divorce rates in Canada. I could only find 2008 data.


CANSIM tables pertaining to divorce rates were derived from the Vital Statistics - Divorce Database, which is no longer active. For more information, see the survey page found here


The most recent data available: Data release - July 13, 2011 (2008 is the last year for which these data were collected.)

Wednesday, May 13, 2015

New ICS study

Participation in extracurricular activities and high school completion among off-reserve First Nations people, 2012

by Paula Arriagada

Insights on Canadian Society

This study reports on predictors of high school completion by age 18 among First Nations males and females aged 18 to 24 living off reserve, with a particular focus on extracurricular activities (participation in sports, arts and clubs). The results are based on data from the 2012 Aboriginal Peoples Survey (APS).

To access the study released today:

Usage Stats for Training Repository


I'm Wondering about whether some usage statistics are available for the training repository:

1) How often have files in the Training Repository been downloaded, either in total or by file?

2) How many searches are conducted?


They are using Google Analytics to report on it.

I can report that in 2014, there were a total of 35 new entries into the repository. In 2014, there were 1,676 sessions of the repository which accounted for 7,867 page views. It is in the Bi-annual report, which is available to DLI contacts, under section 1.2



How I can provide a researcher with the KLEMS database? I found an earlier message saying it is CANSIM but I don't think that would be everything? (CANSIM 383-0021 and 383-0032) I also know that over time the data went from 4 digit NAICS to 3-digit NAICS which caused a problem for a different researcher.


The database KLEMS is now available in CANSIM tables 383-0021 and 383-0032. Table 383-032 replaces 383-0022. The data was distributed the 23 of April 2014. The author division has also supplied the DLI with the following table, please see attached.

Tuesday, May 12, 2015

Canadian Internet Use Survey (CIUS) 2014 PUMF?


Will there a PUMF for the CIUS? And if so, is there an expected release date?


Note the most recent collection period for CIUS is for 2012. I also check the surveys currently in collection and CIUS is not listed <>

I received the following confirmation from subject matter –

At this point, we do not have any funding to run another iteration of the Canadian Internet Use Survey. We are currently pursuing options to run another iteration but nothing is in place yet – I do not think we would have anything in the field until RY2016 at this point since a redesign would also be necessary to reflect the changes in technology and priorities.

We would plan to include the creation of a PUMF in any dissemination plans for a future CIUS.

Monday, May 11, 2015

Updated Products - Labour Force Survey (LFS) April 2015

Labour Force Survey (LFS) – April 2015

LFS data for April 2015 are now available on the EFT site.

The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

The LFS guide:

Eft: /MAD_DLI/Root/other-products/Labour Force Survey - lfs/1976-2013/data/

Friday, May 8, 2015

Survey of Emergency Preparedness and Resilience in Canada


I have a faculty member interested to know if a PUMF is planned for the ‘Survey of Emergency Preparedness and Resilience in Canada’. I can see that the page says ‘data release October 2015’ but I’d like confirmation on if that means a PUMF, or not. <>.


I confirmed with subject matter that there will be an analytical report published, but at this time there are no plans for create standard tables or a PUMF. The file will also be accessible through the Research Data Centres.

CDC Violence Against Children Surveys (VACS)

I read about these surveys in a newsletter and thought they were interesting enough (in a profoundly sad way) to share with others.

Violence Against Children Surveys (VACS)

Survey on Financing and Growth of Small and Medium Enterprises


Are there any data publicly available from Survey on Financing and Growth of Small and Medium Enterprises? The data is supposed to be released in Fall 2015, can we find out what product is expected to be released?


I do not see any standard tables on the statcan website. I noted on the survey page that Statistics Canada is conducting this survey on behalf of a consortium led by Industry Canada. The data obtained from this survey will be used by both the public and private sectors. Industry Canada will use this information to study the availability of financing to SMEs and to recommend policy changes to assist businesses.

Entry in the Daily

The following report is on Industry Canada's website
Summary of the Survey on Financing and Growth of Small and Medium Enterprises, 2011

Pertaining to data available:

Request an instant electronic copy of the results:
2011 Survey in PDF (2.6 MB, 39 pages), or
2011 Survey in Excel format (1.5 MB)
Survey Information:
2011 Survey Questionnaire

2011 Methodology Report

ICO Installation


I’ve had a question from a professor who’s using ICO for a research project. He’s encountering intermittent difficulties with the installation of the software; specifically, the installer isn’t compatible with newer versions of Windows. He has an old machine that he can use if necessary, but would prefer to avoid it. Has anyone else had or heard of similar issues?


What year is your researcher using? As of 2001, its to be used with Windows XP, 7, 8 as long as have MS Access. I do not believe it works with Windows 7 (v. 1998 to 2010). 2010 Q4 onwards works with XP and Windows 7.

There is also an ICO product review at the DLI ontario training, which can be found on the DLI training repository <>.

Thursday, May 7, 2015

CCHS 2012 - Middle East Immigrants


I have a student who is looking for a variable that matches a question in the CCHS 2012 Questionnaire. She is looking to identify immigrants from the Middle East and is specifically interested in SDC_Q4C which asks if the respondent belongs to one or more of the racial or cultural groups on the list. I couldn’t find this variable or a similar variable anywhere in the dataset and most other responses to cultural questions only have responses of White or Visible Minority.

I’m getting the impression that the only variable that describe cultural origin is SDCGCGT, Cultural or racial origin, and it does not necessarily specify the cultural group or relate to immigration. SDCFIMM, immigration flag, only seems to answer yes or no and doesn’t specify a region. Am I correct assuming we can’t isolate data on immigrants from specific areas in CCHS 2012?


Questions that are asked on a survey questionnaire are not necessary reflected on the Public version of the microdata file.

Each Public Use Microdata File (PUMF) is based on a corresponding master data file. The modifications performed by Statistics Canada before the PUMF is released ensure that the risk of breaching confidentiality has been removed. Since the results of any analysis performed do not have to be scrutinized before they are released, the file is considered "public."

Modifications made to master files to convert them to PUMFs may include: collapsing of variables (e.g., age groups instead of individual years of age); collapsing variables into one variable (e.g., multiple language questions collapsed into one language variable for analysis); suppressing variables (although the variable is part of the master file, it will not show up in the public file); and removing outliers (removing cases that are extremes - often used with income).DLI Survival Guide: <>

The variables related to the SDC module of the questionnaire that are present on the public microdata file are:

2012: Annual Component
SDC: Socio-demographic characteristics
Country of birth - Canada/other
Knowledge of official languages
First official language spoken - (D)
Owned, Rented
Language(s) spoken at home
Immigrant - (F)
Length/time in Canada since imm. - (D,G)
Cultural or racial origin - (D, G)
Variable SDCGCGT: Cultural or racial origin - (D, G) , as you noted, is a derived variable from -


Based on SDC_7A, SDC_7B, SDC_7C, SDC_7D, SDC_7E, SDC_7F, SDC_7G, SDC_7H, SDC_7I, SDC_7J, SDC_7K, SDC_7L, SDC_7M, SDC_43A, SDC_43B, SDC_43C, SDC_43D, SDC_43E, SDC_43F, SDC_43G, SDC_43H, SDC_43I, SDC_43J, SDC_43K, SDC_43L, SDC_43M.

I checked the metadata for the masterfile available in Nesstar <> and noted the following variables were available on the masterfile.
Inline image 1
access to the masterfiles is through the Research Data Centre
For more information:

or your researcher can request a custom tab:

Custom tabulations for all Health Statistics Division Surveys are also available on a cost recovery basis. For more information, contact Health Statistics Division Client Services Unit at

LFS Question

I have downloaded quite a number of files the Labour Force Survey for a researcher. In the data directory, there are folders with names like "". When I unzip them the files are .prn files. As I know very little about SPSS, could someone please tell me what these files are and how to use them in SPSS or if they are even for SPSS?


The .prn files are ASCII text data files. You need to edit the SPSS syntax file (the .sps one) to point to the data file location, and then you can run the syntax within SPSS to load the data. Afterwards you can save it as a .sav file or export into any number of different format.

You can find rather more detailed instructions here (
It’s for ICPSR data, but the principle is the same):


These are the data files. In the doc folder I believe is where you will find the SPSS syntax to create the file. I recommend consulting the DLI Survival guide, which helps address how to work with the syntax file. 

Wednesday, May 6, 2015

May 6, 2015 RELEASE: Health region data now included within the 2011 Census and NHS Profiles

As part of the ongoing partnership with Health Statistics Division, the Census Program Dissemination Project is pleased to announce that the 2011 Census Profile and the 2011 National Household Survey (NHS) Profile have been updated to include census and NHS data for the 121 health regions in Canada plus an additional 14 Local Health Integration Networks (LHINs) in Ontario.

For both profile products, the place name, postal code and geographic code searches have been updated to now display health regions in the results. The health region names and boundaries are those that came into effect in December 2013. Historically, census data for health regions were also included within the 2001 and 2006 Census Community profiles.

Friday, May 1, 2015

RTRA questions

We have some questions regarding RTRA. A lot of the questions below reflect the concerns of smaller institutions which are concerned with equity of access to these data regardless of university size and budget.
1. How much does it cost for a university to subscribe?

Minimum $5K, attached word document explains the set up!

2. Can two universities partner together for a RTRA subscription (similar to a RDC / Branch RDC model)?
We could probably work out a scenario where this would work but there would be several constraints. The RTRA service has 2 contracts, the first with institution that pays for the service, the other is with the individual user. For a partnership to work there would have to be some documentation between the institution that pays for the service and the users from the partner institution. This could get complicated though.

3. How many “seats” for RTRA users are acquired for this payment (one “hub/focal point” and how many “floating” licenses)?

Details in the word document

4. Is it possible to get fewer “seats” for a reduced payment?

No, it really isn’t feasible for us to lower the cost of the RTRA. Even if we managed to sign up more institutions with a lower cost, we wouldn’t be able to support the users or the IT infrastructure. Essentially we would be shooting ourselves in the foot.

5. How is billing done (e.g., fiscal year billing, calendar year billing, billing from time of subscription to next year, prorated billing to end of fiscal year, etc.)?

Fiscal year billing, however, we will prorate the cost of the RTRA for institutions signing up later in the year.

6. Is there any reduction in subsequent years, since the “hub/focal point” would presumably remain constant?

No, the cost of the service will not be lowered in subsequent years.

Other questions unrelated to pricing:

1. I don’t recall hearing about a research application process in the past. How would that work for a library focal point which wished to support and fulfill data requests from graduate or undergraduate students, or from faculty or staff (e.g., university institutional planning or president’s office) who simply wanted tables produced for them?

There is no research application process. We ask each of the RTRA users to let us know why they need access to the system and data but there is no evaluation process or criteria. The application form is used to collect information about the user so that we can create their account.

2. We were told that users needed to specify which datasets they intended to access. One of the selling points in a presentation was that a user didn’t need to do so – having obtained access to RTRA, they were free to use whatever files they needed. Has this changed (and why, as it makes the service far less attractive)?

In the user application we ask the RTRA user to be precise in terms of which data sets they would like to access. They can indicate all available if they choose. There is no change in price if users choose to access one or all of the data sets in the RTRA.

New ICS study

Changes in debt and assets of Canadian families, 1999 to 2012

by Sharanjit Uppal and Sébastien LaRochelle-Côté

Insights on Canadian Society

This paper examines changes in debt, assets and net worth among Canadian families with debt over the period 1999 to 2012, by selected family characteristics. It also examines the extent to which two key ratios of indebtedness, the debt-to-income ratio and the debt-to-asset ratio, varied over the period.

To access the study:

New ICS study

Grandparents living with their grandchildren

by Anne Milan, Nadine Laflamme and Irene Wong (Demography Division)

Insights on Canadian Society

Even though most grandparents live in separate households from their adult children and grandchildren, sometimes the grandparent and grandchild generations live together. This article provides information on the number of grandparents who are in this particular situation, along with their living arrangements and their ethnocultural and sociodemographic characteristics.

To access the study: