Thursday, December 29, 2016

Revised Bootstrap weights files for the CTADS 2015 PUMF

Please note that Statistics Canada is currently working on revised Bootstrap weights files for the CTADS 2015 PUMF.

The new files should be ready near the end of January 2017.

Friday, December 23, 2016

Survey Methodology (Statistics Canada's scientific journal)

Survey Methodology

Statistics Canada recently released Volume 42, Number 2 (December 2016) of its scientific journal Survey Methodology on its website. This highly recognized, peer reviewed journal allows researchers, statisticians, mathematicians and methodologists from around the world to share research in the field of survey techniques and their practical applications.

The December 2016 online issue of Survey Methodology is now available. This issue contains six papers, two short notes and one corrigendum.

Regular papers
Tests for evaluating nonresponse bias in surveys
(Sharon L. Lohr, Minsun K. Riddles and David Morganstein)

A cautionary note on Clark Winsorization
(Mary H. Mulry, Broderick E. Oliver, Stephen J. Kaputa and Katherine J. Thompson)
Short notes

A note on the concept of invariance in two-phase sampling designs
(Jean-François Beaumont and David Haziza)


Note to readers

Published since 1975, Survey Methodology has been a dependable reference point entirely dedicated to the latest advances in the field of survey techniques and methodology used around the world. To increase access to this scientific research and for environmental reasons, Statistics Canada publishes the journal free of charge on its website.

Historical papers from Survey Methodology are now available online. Electronic copies of any paper published since December 1981 (Volume 7-2) can be obtained here. Papers from Volume 1 to Volume 7-1 can also be obtained by contacting us at

How to submit an article for publication

Authors are invited to submit their articles to the journal’s Editor by email. Articles must be submitted in English or French, preferably in Word with MathType for the mathematical expressions. For instruction on formatting, please consult the guidelines for manuscripts.

All papers are refereed. However, authors retain full responsibility for the contents of their papers, and opinions expressed are not necessarily those of the journal's Editorial Board or of Statistics Canada.

Beware of hackers and fake websites. The journal Survey Methodology will never ask for a publication fee, nor any financial information from authors. Pirate website fraudulently representing the journal have recently been identified. Be careful. Do not hesitate to contact the editor in chief directly

Subscribing is easy and free

Would you like to receive an automatic alert by email when new issues of Survey Methodology are available on Statistics Canada’s website? Go to

Login or Register.
Once inside the My StatCan portal, select ‘Email notifications.’
Under the ‘Publications’ tab, choose the subject ‘Statistical Methods,’ then add the item ‘Survey Methodology (12-001-X)’.

For more information, visit or contact us.

Don’t hesitate to share this email with anyone who might be interested in Survey Methodology.

Note: You are receiving this email because you have previously shown interest in Statistics Canada data, reports, analysis or conferences.

Thursday, December 22, 2016

Today's Release - HES 2015

Households and the Environment Survey (HES)

HES data for 2015 are now available on the EFT site.

The Households and the Environment Survey (HES) measures the environmental practices and behaviours of Canadian households that relate to the condition of our air, water and soils. The survey was also designed to collect data to develop and improve three key environmental indicators: air quality, water quality and greenhouse gas emissions.

Eft: /MAD_PUMF_FMDG_DAM/Root/3881_HES_EME/2015/

Wednesday, December 21, 2016

CSDs by MIZ classification codes

Is anyone able to point me to how to find the MIZ classification code for all CSDs (those outside CMAs and CAs) in Canada (2011)? 

Before I re-install Geosuite, is this product able to attach the MIZ category codes? Or, does this require a custom tabulation request?

I just checked GeoSuite 2001, which I downloaded to answer a question, and it includes a SACType field ( 

 So, I'm assuming that the more recent GeoSuites also include this data.

2016 Census Public Use Microdata Files (PUMFS) and Research Data Centre File Release Dates

To those inquiring about the release schedule for 2016 Census Public Use Microdata Files (PUMFS) and Research Data Centre File, the release dates are as follows:

1. 2016 Census Long Form Census RDC File April 30, 2018
2. 2016 Census Individuals PUMF December 18, 2018
3. 2016 Census Hierarchical PUMF June 18, 2019

PCCF and PCCF+ Analysis

Does anyone know if the PCCF and PCCF+ files can be analyzed using R?

PCCF is an ASCII text file and can be read by any statistical program environment, including R.

PCCF+ consists of many files processed by a SAS program that would require a major re-write to use in R, although the output from the SAS program includes ASCII text files that could be used directly in R.

Friday, December 16, 2016

2016 Census microdata

Given the current schedule, what is the best estimate of when one would either be able to order a custom tab from the 2016 Census of Individuals, or get access to the master file via the RDC?

Here is the link to the 2016 Census Project release schedule:

Subject matter has confirmed that custom tabulations will be available after these release dates. The first data release is scheduled for February 8, 2017, and will be population and dwelling counts.

In regards to your question on accessing the masterfile via the RDC, various factor such as the size of data files, urgency of the data, documentation and vetting rules availability affect the turnaround time for Masterfiles. The average time could be between 2 days to 2 weeks for a survey that has all factors mentioned.

Wednesday, December 14, 2016

DLI Training Repository: DLI webinar on the SPSD/M!

The session materials from the DLI webinar on the SPSD/M is now available on the DLI Training Repository:

For more information on the SPSD/M, please see

Tuesday, December 13, 2016

Enumeration area maps for historical censuses

what are the odds of finding enumeration area (EA) maps for the 1961 and 1971 censuses?

Specifically, I'm looking to determine which rural fringe EAs correspond to the areas later amalgamated into Timmins but which were not considered part of the CA/major urban centre in each of those census years.

It would be great to inventory paper and digital maps, it is on our radar for next year as part of the older census boundaries project. Some good news though, you may soon be able to get 1971 Census EAs in digital boundary form. Statistics Canada will be releasing this in the New Year as part of a data restoration project Giuseppe Filoso is working on (cc’d on this message).

For now, you can see some maps available through the Internet Archive, and population centre points for EAs in the GeoPortal, but I’m not sure if they cover the rural areas around Timmins.

1961 Census Reference Maps (Major urban areas)

1971 Census Geographic Attribute File (EAs population centre points)

The Statistical Registers and Geography Division has confirmed that they are working on a project to convert and update historical data holdings. It’s a work in progress and is currently prioritized by requests.

Thursday, December 8, 2016

CIS (Canadian Income Survey) Available as a PUMF?


A grad student here seeking to obtain a PUMF for the CIS (Canadian Income Survey). It appears the CIS is done alongside the LFS and has occurred annually since 2012.

I’ve check NESSTAR and the DLI-EFT and didn’t come across CIS. It does appears that there are data tables from CIS included in CANSIM but I just wanted to check to see if I’ve missed any available PUMF for it.


“The release estimate at this point is either JAN or FEB 2017” for the CIS PUMF."

Wednesday, December 7, 2016

Percentage of urban areas in Montreal, Toronto and Vancouver CMAs

I have been asked if I could provide the percentage of urban areas (in square km) for the Montreal, Vancouver and Toronto CMAs. My understanding is that this would translate, in terms of the 2011 Census geography, in the area covered by Population Centres within those CMAs.

I’m not sure if those numbers are readily available, but I thought I could calculate them using GeoSuite 2011 by:
  • Generating a list of POPCTR areas for each CMA, making sure to exclude any rural area (POPCTRRAclass = 1)
  • Adding up the values of the POPCTRRAarea field for all population centres
  • Dividing that result by the area of the CMA
Can someone confirm that is an accurate method for calculating the % of the area covered by urban/population centres within a CMA?

GEO has provided the following response:

“We talked with subject matter on this. The methodology presented by the client is reasonable. The two things to remember are:
  • PopCentres do not overlap CMA boundaries
  • There may be multiple PopCentres of different types (Primary, Secondary, Fringe) within a CMA
So, removing the Rural Area PopCentres and calculating land areas with the remaining seems to be a valid methodology.”

Intellectual Disability Data Request

The researcher is looking for data sets that link intellectual disabilities (not just mental health) and substance use treatment?

The Canadian Community Health Survey has a Healthy Aging component (C2008-2009). Specifically it has the variable CR1_01A, which asks “During the past 12 months, did you receive short-term or long-term professional assistance at home ...?”. The questionnaire further clarifies that professional assistance, means help from paid workers or volunteer organizations, which include assistance received because of physical, mental or cognitive health problems or limitations or because of aging. Furthermore it does deal with alcohol and drug abuse, specifically variables CIH_2, CIH_3, CIH_8L, which asks respondents about ways to improve their health, with options such as receiving medical treatment.

Tuesday, December 6, 2016

Social Policy Simulation Database and Model (SPSD/M) V22.3 is now available

The DLI is pleased to announce that version 22.3 of the Social Policy Simulation Database and Model (SPSD/M) is now available. For an overview of the SPSD/M Database and Model product, see here.

This is a model update – the database remains unchanged and remains based on the 2010 survey data from the Survey of Labour and Income Dynamics (SLID). As part of your license agreement for version 22.0, you are entitled to this model update. Depending on how you received version 22.0 of the SPSD/M, you will either be sent a CD-ROM or an email indicating how to download the file through the Electronic File Transfer facility.

This update includes the following:

• New parameters for the Working Income Tax Benefit (WITB) have been included. As these parameters have provincial dimensions, the parameters for Quebec, Alberta, and British Columbia are no longer necessary.
• The Quebec Health Contribution is eliminated as of January 1, 2017.
• The Canada Child Benefit is indexed as of July 1, 2020.
• The calculation of the amount of the Canada Child Benefit when the calendar year flag is on (FCBENCAL) has been corrected.
• A new parameter has been included for the non-refundable tax credit rate for tuition fees in the calculation of provincial taxes.

Please inform the DLI team if there are any issues.

Friday, December 2, 2016

New files on Statistics Canada Nesstar

We are pleased to inform you that the following are now available on the Statistics Canada Nesstar WebView site (

Labour Force Survey (LFS), 2016 – October
Labour Force Survey (LFS), 2016 – November
Employment Insurance Coverage Survey (EICS), 2014

And more to come!

To access the microdata housed in the Research Data Centres (RDCs), researchers must submit a project proposal to the Social Sciences and Humanities Research Council (SSHRC) and Statistics Canada.
Please contact me if you have any questions or comments.

Today's Release - LFS - November 2016

Labour Force Survey (LFS) – November 2016

LFS data for November 2016 are now available on the EFT site.

This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products
The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

Eft: /MAD_PUMF_FMDG_DAM/Root/3107_LFS_EPA/1976-2016/data/


We are pleased to inform you that the following are now available on the Statistics Canada Nesstar WebView site (

Labour Force Survey (LFS), 2016 – November


Wednesday, November 30, 2016

LFS Microdata

Is it possible to access LFS microdata pre-1976.
If so, how far back can we go? As far as 1945?

“Regrettably, the microdata files are solely available going back to 1976, inclusively; nothing before that.”

Household Pets

I have some researchers looking for information on household pets both nationally and in Sarnia/Lambton. They would like to know:

- Stats related to what kind of pets people have and how many in each household
- Age related statistics of who is likely to adopt a pet
- Household expenditures on pets, pet supplies, and veterinary services broken down by age group

I was able to find information on the expenditures so far but I’m coming close to the deadline and am still searching for the rest.

I found this on the Does Statistics Canada collect this information? page that may be of some assistance:

“Pets, cats, dogs, stray animals, animal cruelty
Statistics Canada does not collect data on the number of pets (including cats and dogs) or stray animals in Canada. We do not collect data on animal cruelty or animal testing.

Statistics Canada has information on the following:
  • pet expenses as a part of household spending
  • livestock/farm animals
  • aquaculture/fish farming
  • ethical consumption
Some resources include
2011 Census of Agriculture
ARCHIVED – Spending patterns in Canada (See Tables 1, 2 and 4)
ARCHIVED – Ethical consumption

See also
For more information, see the following subjects:
Livestock and aquaculture
Food and nutrition
Natural resources

Unfortunately, you may be out of luck, as far as STC goes. On the page
it states "Statistics Canada does not collect data on the number of pets (including cats and dogs) or stray animals in Canada."

This might be of use to you:
Consumer Corner: Canadian Pet Market Outlook, 2014$department/deptdocs.nsf/all/sis14914

Files with pet information (not going down to the Sarnia/Lambton unless go to the RDC, and then only MAYBE):
General Social Survey, 2010 [Canada]: Cycle 24, Time-Stress and Well-Being. Main file: 4 edition

Household Internet Use Survey, 2003 (and 2002 and 2001) [Canada]


National Population Health Survey, 1994-1995 [Canada]: General file (and Health and Supplements files)

What kind of pet - grouped

Tuesday, November 29, 2016

Gasoline consumption (in litres) for the Northwest Territories

A user is looking for gasoline consumption in litres by province/territory, which they found in table 405-0002 They need these statistics for 1996 to 2005 for all provinces & territories. However, we notice that the Northwest Territories series was terminated in 1998.

Do you know of any other source for gasoline consumption data for the Northwest Territories?


“For the Gasoline survey (Cansim#405-0002), it is the combined series NWT+NU that were terminated in 1998, since the creation of NU territory in 1999, data are split for NWT and NU, and are available on Cansim 405-0002.

Another source they can consult is RESD Cansim#128-0017: Energy use, final demand: RESD uses AEND for the final demand breakdown (Annual End use of Refined Petroleum Products: survey #2168: this survey is not published separately on Cansim ); most of the data on RESD are either confidential or not available. In RESD (AEND) that will be the sales by refiner/major distributor that will be reflected: AEND is called “Annual End use of RPP” but it is from the refiner/major distributor surveyed side: what they sold to the different industries which not what is actually consumed by those industries.

When comparing MRPP domestic sales of motor gasoline to Gasoline survey gross sales of gasoline : MRPP sales are lower because some motor gasoline sales might be done through secondary distributors (those distributors are not surveyed under MRPP). On the other hand MRPP diesel sales are higher than diesel sales reported in the Gasoline survey because Gasoline survey include only Taxable diesel sales to NWT customers while MRPP estimates include net domestic sales to NWT, higher diesel estimates for MRPP are caused by the inventories held (fuel purchased but not sold) by fuel distributors in the NWT.”

Friday, November 25, 2016

Proxy access to WDS and DLI Nesstar

Is it possible to configure proxy access to the WDS and Nesstar, or is access by IP recognition only?

Yes you can set up a proxy access.

These would be the URLs to include in your proxy settings:




Do we have access to SPSD/M through the DLI? If so, what's the process for gaining access?

In October, through individual communication send to DLI Contacts, the Data Liberation Initiative (DLI) team was pleased to announce that the Social Policy Simulation Database and Model (SPSD/M) is now available through the DLI. For an overview of the SPSD/M product, see here.

Membership in the DLI is subject to the DLI Licence Agreement. The SPSD/M will be governed under a separate licence agreement as an appendix to the DLI Licence Agreement. These documents were circulated in personalized messages to DLI Contact email addresses.

If you are interested in obtaining access to this product, we ask that the library director (or the authorized signatory) sign the agreement and return a copy to the DLI. We will accept a scanned signed (electronic) version of the Appendix IV licence agreement for the SPSD/M.

Access to the product will be granted under a new safe in the DLI’s electronic file transfer (EFT) safe.

Geography for Disability Data


A faculty member here is interested in mapping disability information for southern Ontario (ideally, the GTA). What's the smallest level of geography they can hope for? The metadata for the Canadian Survey on Disability is not available, so I can't check there; the mater file for CCHS lists 2006 DA as a data element, but I can't tell if that applies to every other variable.


"It would have to be clarified, depending on which variables the client wants, it may be possible. If they want overall disability rate is it possible to cover the GTA, that should be fine. The issue is always the size of the sample specific to your level of geography and data attributes as well as how many cross tabulations you are doing."

Thursday, November 24, 2016

Survey of Approaches to Educational Planning 2013

I just had a query about the Survey of Approaches to Educational Planning (SAEP) 2013 ( I realize that the survey is available via the RDC, but are there any plans to make a PUMF available?

We have received the following response from subject matter:

"At present, we do not have any plan to make the survey data available via PUMF as there has been no budgetary provision made for it. We will keep you inform of any management decision regarding the provision of a SAEP PUMF."

Wednesday, November 23, 2016

Financial Data

I have a user who found industry Canada’s, financial performance data resource

They loved the data they found there, but it only cover recent years (2014)
They are now looking for similar historical data. More precisely, they are seeking information such as gross margins, net margins from 1997 to 2009 in respect of companies with revenues between $5M and $20M for the following NAICS codes:

Codes SCIAN :

Where might I go to find this?

I believe the Financial Performance Indicator files are on the EFT site: /MAD_DLI_IDD_DAM/Root/other_autres/2510_FPICB_IPFEC. 

 Also on the WDS under "Business and Financial". I haven't used these for a while. They were (are?) CDs, and I think you will have to install them on your machine.

Tuesday, November 22, 2016


The Canadian Alcohol and Drug Usage Monitoring Survey (CADUMS) has an metadata entry in NESSTAR but not data to download. I can not find the survey on the EFTP site either.

I have a users looking for the 2011 CADUMS pump and possibly other years as well. So is it possible to get this PUMP (and a SAS file for it)?

Thank you for bringing this to our attention. I see that previously only the metadata was published. We will work on updating these files so that users can also download the data.

We have already republished 2011. We will add 2012 CADUMS to our priority list.

Upon further investigation, I see that the survey was conducted by Health Canada, which is why it would not be located on the DLI EFT site.

The original files are found on the mirror site hosted at the University of Alberta here:

Monday, November 21, 2016

Updated Products: EICS 2014 and 2015 V2

Please note the updated products listed below and the path to access them via the EFT site.

Employment Insurance Coverage Survey (EICS) 2014 and 2015

EICS data for 2014 and 2015 is now available on the EFT site.

The Employment Insurance Coverage Survey provides a meaningful picture of who does or does not have access to EI benefits among the jobless and those in a situation of underemployment. The survey also covers access to maternity and parental benefits.

Eft: /MAD_PUMF_FMGD_DAM/Root/4428_EICS_ECAE/2014
Eft: /MAD_PUMF_FMGD_DAM/Root/4428_EICS_ECAE/2015

Saturday, November 19, 2016

RDC analyst access to documentation

The RDC analyst here asked me for the documentation for the NLSCY synthetic files from cycles 5-8. She has a researcher who would like the documentation to prepare code outside the RDC and take it in. I was surprised that an RDC analyst would be asking for this on behalf of a researcher rather than directing the researcher to me and I suggested that if the researcher is affiliated, that he/she should get in touch with me. The response I got back made it seem like the analyst might be preparing the code for the researcher. In re-reading her two e-mails, perhaps she was just using the first person to refer to her own experience before she was an RDC analyst as an example, but assuming she does want to get the documentation for herself I wanted to double check.

As you might imagine I'm not concerned about the level of service she provides to her researchers in general, but I am wondering  whether there have been any changes in how we deliver DLI services on a campus that also has an RDC. In the olden days there seemed to be a pretty tall wall between DLI and RDC data (I'm not talking about documentation here - just data files). Is it still okay to give the documentation to the analyst or because these are synthetic files do I have to confirm that the researcher is affiliated even for the codebooks, etc?

It's very likely that the researcher will be affiliated and that the analyst is simply being really nice in providing assistance to the researcher, but it's also possible that he or she might be from outside the institution because the RDC does allow non-institution researchers. As such, I'm reluctant simply to point to SDA without having determined a little more about the situation. If she ends up wanting the data as well I can remind her of the DLI conditions of use with the synthetic data (authorized user/authorized use), although I expect she's aware of this.

There are about a thousand other nuances to this, but my main question is whether I am allowed to grab the documentation for the analyst just like I might for any pumf and any other researcher (not whether there might be a better way, although I'm also interested in this).

I feel like the way I've posed my questions/comments makes it seem like I'm being territorial, but this is not the case. I really just don't know what the protocol is in this case. I kind of have it in my head that the RDC analysts have everything at their fingertips and therefore it comes as a surprise when I find out that this mightn't be the case.

In regards to “whether I am allowed to grab the documentation for the analyst just like I might for any PUMF…” it’s suggested that you ask the analyst to have the researcher contact you directly. That way you can determine if the researcher is eligible for the services of the DLI. The synthetic files are not available in the RDC so the researcher should seek them out on her own, or through your library as a DLI member. She can produce code and bring it in the RDC provided her research proposal has been approved.

To further add, a DLI librarian should not be asked by the analyst for RDC materials. An analyst is to direct the researcher to the DLI if DLI services are required for their analysis. Also, DLI material is not placed on RDC servers, therefore, there should be no need to even ask DLI to grab documentation.

Wednesday, November 16, 2016

EICS 2015

Employment Insurance Coverage Survey (EICS) 2015

EICS data for 2015 is now available on the EFT site.

The Employment Insurance Coverage Survey provides a meaningful picture of who does or does not have access to EI benefits among the jobless and those in a situation of underemployment. The survey also covers access to maternity and parental benefits.

Eft: /MAD_PUMF_FMGD_DAM/Root/4428_EICS_ECAE/2015

Monday, November 14, 2016

Machinery and equipment price indexes

An Economics researcher is looking for the background sampling framework and the underlying detailed data for the Machinery and equipment price indexes, see IMDB page:

The commodity (commodity 30) in question is Non-electric furnaces and heating equipment.

His first question is whether the price indexes for this commodity include the residential sector at all (in full or in part)?

If residential furnace prices are covered, then his second question is availability of the most detailed data, that is:

1) prices over time for natural gas furnaces by
2) energy efficiency rating (AFUE) of the furnace, and this may be dreaming in technicolour, but is not necessary
3) why the customer replaced their furnace?


“No, the residential sector is not part of MEPI. The index provides quarterly estimates of price changes for machinery and equipment purchased by industries in Canada.”

Thursday, November 10, 2016

Prison Farms

Are there any statistics available on what/who makes up the prison farm population in Canada, both recent and historically? I have found some publications on the Correctional Services website but not statistics.

Followup Response
I had an extremely similar question two years ago!  I'll dig around for what I found, and share with you that info.

Depending on the province -and the actual stats you are after, - the Annual Reports from each ministry or department of Corrections or Prisons or Attornies General, can give you some neat info, too!

Interesting question! I admit I had to look up what a prison farm was. After some searching, I came across the following reference on the statcan website:

Statistics On Penal Institutions, Pardons, and Parole (Series Z173-226)
General note
There have been several distinct types of penal and detentive institutions in Canada: penitentiaries; provincial prisons; other provincial institutions such as reformatories, industrial farms, and training schools for juveniles; and municipal jails. With the evolution of penal and correctional philosophy, nearly all reformatories and industrial farms and training schools have gradually been phased out in recent years. Since municipal jails serve mainly as temporary detention centres rather than as places where convicted persons serve their sentences, the principal penal institutions are penitentiaries and prisons.

Under the BNA Act, the federal Government of Canada has exclusive responsibility for establishing, maintaining, and managing penitentiaries. Penitentiaries are the penal institutions where convicted persons sentenced to two years' imprisonment or more serve their terms. Provincial prisons are under the jurisdiction of their provincial governments; persons sentenced to less than two years' imprisonment serve their terms in these institutions. (Since 1972, however, there have been federal-provincial transfer agreements under which persons with sentences of less than two years may in certain circumstances serve their term in a penitentiary and some persons with sentences of two years or more may serve their terms in provincial prisons.)

Statistics on provincial prisons (or jails) and other penal institutions have been published since 1918 by Statistics Canada in the annual reports, Statistics of Criminal and Other Offences until 1955, and in Correctional Statistics thereafter. A program for uniform reporting of statistical data specifically on training schools was inaugurated in 1963, with 1964 being the first complete year for which data were available. Publication of statistics continued until 1974; data were accumulated for several years following without being published, and the program was ultimately terminated.

That being said, it would seem that perhaps the ministries collect this information.

LFS through the DLI - Economic Regions

I note the geography for the LFS through DLI[Nesstar] is limited to province and the 3 largest CMAs, however CANSIM includes a table [282-0124] with geography for the ERs.

I thought this would have been addressed before so I did look at the DLI archive and training repository, but find nothing addressing the fact that the LFS in Nesstar does not includes a breakdown by ER.

That is correct, in the Public Use Microdata File (PUMF) (the file on nesstar), the geographic variables are limited to the province and the 3 largest CMAs. Included in the PUMF you also have a wealth of other variables to choose from. Microdata enables a user more control over the variables offered for analysis. Whereas, the tables in CANSIM have limited variables for which a user can manipulate.

The variables in an aggregate data file (table on CANSIM) do not lend themselves to generating cross-tabulations of individuals since the initial unit of observation has been replaced by time (eg: time series data), geography (eg: census / geography) or a social construct (eg: cause of death – HID).

Modifications are performed to a PUMFs to anonymise the file, including combining variables, limiting categories, removing variables (example with Geography), etc, that will reduce the risk of users identifying a respondent.

More information can be found in section 4. Data in the DLI Survival Guide.

Wednesday, November 9, 2016

CTADS 2015

Canadian Tobacco, Alcohol and Drugs Survey (CTADS) 2015

CTADS data for 2015 are now available on the EFT site.

As of 2015, new content, covering alcohol use as well as prescription and non-prescription drug use, was added to the Canadian Tobacco Use Monitoring Survey. Therefore, the survey is now entitled Canadian Tobacco, Alcohol and Drugs Survey (CTADS).

The overall objective of CTADS is to provide regular and reliable data on tobacco, alcohol and drug use and related issues, with the primary focus on 15- to 24-year-olds.

Definitions, data sources and methods: survey number 4440.

The Canadian Tobacco, Alcohol and Drugs Survey (CTADS) is a biennial general population survey of tobacco, alcohol and drug use among Canadians aged 15 years and older. It replaces the Canadian Tobacco Use Monitoring Survey (CTUMS), which was conducted from 1999 to 2012, and the Canadian Alcohol and Drug Use Monitoring Survey (CADUMS), which was conducted from 2008 to 2012. The CTADS merged the core tobacco content from CTUMS and the core drug and alcohol content from CADUMS, resulting in more efficient data collection and providing a tool to monitor and compare the use of multiple products and substances with addictive properties. The CTADS is conducted by Statistics Canada on behalf of Health Canada.

Understanding Canadian trends in tobacco, alcohol and drug use is vital to the effective development, implementation and evaluation of national and provincial health strategies, policies and programs.


Monday, November 7, 2016

Concussion statistics

I have a student looking for the number of sports-related concussions sustained by 18-24 year olds (university-aged) in Canada. We’re finding concussion numbers (not specifically sports-related) or sports-related injuries (not specifically concussions), and we cannot find the age group she wants. I think the Canadian Community Health Survey would have this data, but would something this specific be a custom tab?

I did find some CIHI data that might be close enough, but I wanted to check with others in case I’m missing something.

From subject-matter:
Since this is DLI, I assume the student is using the PUMF. For 18-24, they could use DHHGAGE=3 or 4 (18-19 and 20-24). However, for injuries, the type of injury that represents concussions has been grouped with another category due to potential disclosure risks on certain categories. If they look at INJG05=8,this will represent concussion, brain injury or other internal injury. And for sports, they’d have to look at INJG092=1 (injured while doing sports or exercise).

If they don’t want internal injuries grouped in, then it would have to be a custom tab. In that case, you’d use INJ_05=8 (concussion or brain injury) and INJ_09=1 (injured while doing sports or exercise).

One important thing to explain to the client is that the INJ variables represents the most serious injury reported by the respondent, not all injuries in Canada. Each of those people who reported a different injury could have also had a concussion, but since it was not the most serious of the year, we don’t have data for it. So the INJ content gives you a rough idea of the injury rate.

From production:
If the client wishes to have a custom tabulation, we recommend using a two years estimate (2013-2014) as the sample size for a single year is quite small and reduces the reliability of the data.

Cost estimate : $189.00 plus applicable taxes
Delivery time: 10-15 business days.

Friday, November 4, 2016

Notification of changes to the Harmonized Commodity Descriptions and Coding System (HS)

We’ve recently made changes to descriptions of import and export commodities in the Harmonized Commodity Descriptions and Coding System (HS). The purpose of these changes is to mirror international standards and clarify commodity descriptions. No changes to the HS codes themselves have been made at this time.

Furthermore, we would also like to advise our clients of the upcoming 2017 World Customs Organization (WCO) 6-digit Harmonized System updates. Updates up to and including the 6-digit HS are made every 5 years by the WCO with the next scheduled update set for January 2017. The structure of the classification, the codes, and the descriptions will be made in order to reflect international standards and requirements. It should also be noted that Statistics Canada has reduced the number of low value import and export HS codes. These changes will be reflected in the March 2017 release of Canada’s merchandise trade data.

Upon request, starting in January 2017, Statistics Canada will make available a concordance between the 2012 6-digit HS and the 2017 6-digit HS classification codes, as well as concordances between the 2016 and 2017 detailed levels on Export (HS8) and Import (HS10).

LFS - October 2016

Labour Force Survey (LFS) – October 2016

LFS data for October 2016 are now available on the EFT site.

This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products
The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

Eft: /MAD_PUMF_FMDG_DAM/Root/3107_LFS_EPA/1976-2016/data/

We are pleased to inform you that the following are now available on the Statistics Canada Nesstar WebView site (

Labour Force Survey (LFS), 2016 – October

Wednesday, November 2, 2016

Annual Retail Trade Survey & Retail Store Survey Firm-Level Data

I’m working with a researcher interested in the Annual Retail Trade Survey (2447) –formerly the Annual Retail Store Survey (2446) & the Annual Survey of Manufactures and Logging (ASML) –formerly the Annual Survey of Manufacturing and Logging Industries (ASML) formerly the Annual Survey of Manufactures.

Would firm level data or microdata be available by custom tab?

Firm level data would not be available as a custom tab, as:

Statistics Canada is prohibited by law from releasing any data that would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization.

Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. As such, you will find tables that are available for the ARSS here:
Summary tables
Analysts and researchers

The list of chain is the only detailed information available (because we have waivers with the companies asking for their permission to divulge this information), no other data at firm level is available.

Here is what she is referring to
2014 Annual Retail Trade Survey – Partial List of Retail Chain Stores

International Merchandise Trade Data by Commodity Classification, 2009

There is a researcher here on campus using the "International Merchandise Trade Data by Commodity Classification, 2009". We were noticed for some rows zeroes are listed for quantity (position 33-45 in the row) and value (CDN$) (position 46-58) is not zero. It is strange for nothing to have a value. I have looked through the documentation here: But, it is not clear to me.


“CIMT is limited to HS6 level data and as such there are likely to be HS8 or HS10-level codes nested under any given HS6 code. The nested HS8 or HS10 codes may have different units of measure for each commodity they represent. Without a common unit of measure for all nested codes under a given HS6 code any quantity amounts cannot be represented at the HS6 level.”

Friday, October 28, 2016

WES Synthetic File


“A researcher is interested to know the process of generating WES synthetic files, especially for workplace surveys. Is there a document somewhere outlining that? She is trying to find out if the synthetic data have any resemblance with the master data, especially for descriptive analysis?”


Synthetic files contain dummy data and have the same record layout as the master files to enable researchers with remote access privileges to write and test their programs before sending them to Statistics Canada to be run against the actual master files.

“It is the researcher’s responsibility to ensure that their analysis programs run properly. To this end Statistics Canada provides them with synthetic files that they can use for development and testing of their programs. The synthetic files have the same format as the Master microdata files, but contain some artificial data and fewer records. The primary objective in the creation of the synthetic files is that the artificial data be consistent with the codeset and skip patterns in the questionnaire. This is important since realistic data are needed to test analysis programs properly. A secondary objective is to preserve, at least approximately, the marginal distributions of variables and the relationships between closely related variables from the Master microdata files.”

When creating a synthetic file to test programs it is necessary to have a minimum semblance of "realism". For example, one cannot test a program with a 10-variable regression model when there are only 9 observations in the dataset; another program may fail because the SDF has no observations with a given studied characteristic. But again, the data in the file is not real.

Thursday, October 27, 2016

Heat Related Deaths

(1)I am helping a graduate student who is looking for a very precise dataset which will probably require a custom tab.

She is looking for the daily number of heat-related deaths (I know there is a category for that in the Vital Statistics database) for the following dates:

· 10th-13th of July 2005,
· 15th of July to 2nd of August 2006
· 17th to 25th of July 2011

For the following areas:
· Montreal CMA
· Toronto CMA
· Province of Quebec
· Ontario
· All of Canada

The data should be broken down by:
· Sex
· Age

(1 - Revised) After talking to the researcher, she has reformulated her request hoping that data could be provided at a less granular level.

So, now, she would be is looking for the daily number of heat-related deaths (I know there is a category for that in the Vital Statistics database) for the following dates:

· July 2005
· July 2011

For the following areas:
​Montreal CMA
Province of Quebec

(1)Unfortunately the custom tab as requested is not feasible, please see response below from the Health Statistics production team:

This request is not feasible as specified. While there is no minimum for suppression in Deaths custom tabulations, the data has to be randomly rounded by 5 to abide to our confidentiality rules. In this particular case, the number of deaths linked to the ICD-10 codes provided is lower than 10, which would translate into meaningless strings of 0, 5 and 10 if we should go on with the request as specified.

(1 - Revised) From subject matter: The sample is still too small to provide any usable data. The number of deaths due to exposure to natural heat (X30) for the entire province is usually less than 10 a year. Which makes monthly data essentially random. The number of deaths due to artificial heat (Flames, fire and smoke-X00-X09) is closer to 50 annually. But monthly data is still essentially random numbers. It is even more so for the CMA of Montreal. We would not recommend to process this request considering the low numbers and the necessary random rounding.

Wednesday, October 26, 2016

CMHC Microdata


I have a researcher who is looking for information on rental prices that ideally comes with indicators of type of rental (one-bedroom, two-bedroom etc.) and location at the neighbourhood level. She indicates that rather than aggregates per neighbourhood (e.g. average rental price for Kitsilano in 2015), she wants the underlying data where the rental (or survey respondent as may be the case) is the unit of analysis.

I have pointed her to Table 6 "Private Row (Townhouse) and Apartment Average Rent by Bedroom Type" but she is interested in the underlying data if its available.

Is this available--? I realize this is a CMHC survey but they have partnered with Statistics Canada in the past.


Detailed information for the data from Canada Mortgage and Housing Corporation can be obtained from the contact below.

Status: Active
Frequency: Monthly
Record number: 7505

The data we have from CHMS is provided to statcan:

This data is obtained from Canada Mortgage and Housing Corporation and is provided subject to CMHC's End-Use License Agreement.

Tuesday, October 25, 2016

Water sources + health

I have a student researcher looking for data on access to water sources in Canada and wanting to link to health outcomes. Ideally, she would like to focus on indigenous communities; but she is really just deciding where to focus and the availability of data will influence her decision. Is it possible to obtain first nations data on water related issues at either the individual or household level?

I started looking at the Survey of Drinking Water Plants and the Households and the Environment Survey, but I’m not sure if I’m on the right track. (By the way, the links in Nesstar to the 2013 HES User Guide and the HES Data Dictionary do not work.)

Would the First Nations Data Centre have the information your student is looking for? I noticed on their data page there are some stats and visualizations about both water supply and some health indicators. If you need access to microdata there is a cost per access model. I’m not an expert on this particular resource by any means, but there is some contact information for the FNDC Coordinator on this webpage.

The Households and the Environment Survey does indeed have questions on water, however, it does not seem to contain many health outcome indicators on the file.

Have you considered the Aboriginal Peoples Survey? I came across the following table that pointed to the APS as being the source. The APS also has numerous health indicators and health status variables on the file.

In nesstar, you will be able to compare the variables for APS available on the PUMF vs the Masterfile.

The table is from the article: The Health of First Nations Living Off-Reserve, Inuit, and Métis Adults in Canada: The Impact of Socio-economic Status on Inequalities in Health

I did come across these resources:
Health Canada’s site: Drinking Water and Wastewater for First nations and inuit health

Related information links to:
Ensuring Safe Drinking Water in First Nation communities in Canada

Indian and Northern Affairs Canada (2003). National Assessment of Water and Wastewater Systems In First Nations Communities - Summary Report

Another (non-Statcan) source that might be useful is the First Nations Regional Health Survey from the First Nations Information Governance Centre (FNIGC).

​"The First Nations Regional Health Survey (FNRHS, or RHS for short) is the only First Nations-governed, national health survey in Canada. It collects information about on reserve and northern First Nations communities based on both Western and traditional understandings of health and well-being."

The survey has had several data collection cycles:

Pilot Survey: 1997
Phase 1: 2002/03
Phase 2: 2008/10
Phase 3: 2015/16

It has gathered data on water quality (see 2002/03 Questionnaire excerpt below) as well as many health-related variables.​ Some tables are freely available online from​ These seem to be frequency tables of individual variables, not cross-tabulations of multiple variables such as water quality and health variables. That might need to be ordered as a custom tabulation ($) from FNIGC. Not sure.

Sunday, October 23, 2016

National Student Survey of Student Engagement

I've had a question / request for the National Survey of Student Engagement which is not a Statistic Canada survey:

Does anyone know anything about acquiring results, preferably the most recent. (2014 survey year?)

The NSSE is administered for Canada and the US by Indiana University Bloomington. They administer the survey for the US and Canada, and this is where MacLean’s got the data from for the article that you included in your message.

You can poke around their website to find different data tools, but of note are the public report builder, and a copy of the Canadian surveys in English and French.

Wednesday, October 19, 2016

DLI Logo Images


The DLI Survival Guide table of contents indicates that there should be a downloadable version of the DLI logo somewhere, but I have yet to find it. Could it be made available, please?


We are in fact working with Dissemination to put back up the new graphic identifier.

In the interim, it is available through the DLI EFT site here

New files on Statistics Canada Nesstar

We are pleased to inform you that the following are now available on the Statistics Canada Nesstar WebView site (


Labour Force Survey (LFS), 2014 – January (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – February (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – March (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – April (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – May (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – June (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – July (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – August (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – September (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – October (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – November (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2014 – December (Rebased, 2011 Census of Population)
Labour Force Survey (LFS), 2016 – January
Labour Force Survey (LFS), 2016 – February
Labour Force Survey (LFS), 2016 – March
Labour Force Survey (LFS), 2016 – April
Labour Force Survey (LFS), 2016 – May
Labour Force Survey (LFS), 2016 – June
Labour Force Survey (LFS), 2016 – July
Labour Force Survey (LFS), 2016 – August
Labour Force Survey (LFS), 2016 – September

Public Master Files (metadata only)

Aboriginal Peoples Survey (APS), 2012
Canadian Internet Use Survey (CIUS), 2012
Employment Insurance Coverage Survey (EICS), 2013

To access the microdata housed in the Research Data Centres (RDCs), researchers must submit a project proposal to the Social Sciences and Humanities Research Council (SSHRC) and Statistics Canada.

PCCF/PCCF+ Usability Question

We have a researcher that has collected her own data where each entry has either a Postal Code or Full Address. This collected data spans from 2006 to 2013. They would like to use the NHS 2011, to get demographic information on a CSD level for this data. We know we need to use a PCCF file but, are unsure which one release to use due to the time span of the data (2006 to 2011).


Have you considered using the PCCF+ file?

I consulted a presentation, available on our Training Repository here, delivered by Health Analysis Division in 2015 regarding the differences of PCCF and PCCF+, and the presentation highlighted that:

“Consider using PCCF+ rather than PCCF-SLI if any of the following apply

4. The “vintage” of the postal codes on your file spans more than one census”
4: “Vintage” of postal codes

  • Postal codes on your file spans more than one census
  • PCCF+ assigns DA or EA from each census from 1981 through 2011
  • Useful for time-varying variables
The presentation also presents several case scenarios and which product to use – example of one of the case scenarios presented

Potential case strategies
Several geocoding scenarios are possible
1.Only postal codes available

  • Use PCCF-SLI or PCCF+ to assign geographic codes, etc…
2.Full street address available
  • Use address geocoding software (GIS)
  • Use PCCF-SLI or PCCF+ on postal code portion of address
3.Telephone numbers available
  • Reverse lookup to get postal code or address
  • Use 911 system maps to get location from address
The most recent PCCF+ is found on the EFT here

There is a read me file at the root of the safe that indicates the release date of the versions. Version 6C is the most recent.



A researcher here has 2 questions:

1. Pertaining to the PUMF for Cycle 3 of the National Longitudinal Survey of Children and Youth: the PUMF collection contains the Primary File and the 10-15 Year Olds file. It appears that the Primary File contains data collected from the Book 1 – Parent Survey questionnaire and the 10-15 years old file contains data collected from Book 2, specifically the questionnaires for the children. In Book 2 there is also a series of teacher questionnaires, but it is unclear what, if any, of that data appears in the PUMF. There are variables in the PUMF Primary File that relate to performance on academic tests for math, reading etc., which were presumably conducted by teachers, but it is very difficult to match any of that clearly to questions listed in the surveys in Book 2. Is there a clear way to determine which variables in the Primary File were collected from parents (PMKs) vs teachers? Is there additional data from the teacher surveys available through the RDCs?

2. Pertaining to the Canadian Health Survey on Children and Youth (2016 pilot) described on the Statistics Canada website. Do you know whether there are plans to release any of the pilot data? And, do you know whether this survey will be going beyond the pilot phase and if yes, what are the timelines for conducting the survey and/or releasing data, either as PUMF or via the RDC?


“We are planning to release the CHSCY pilot data to the RDC if the data quality is acceptable. We will make that determination by September 2017 at the latest. There are no plans to produce a PUMF with the pilot test data.

We have funding for three full cycles of the CHSCY. Data collection is tentatively planned for:

Cycle 1- October 2018-May 2019 (Data release ~February 2020)
Cycle 2- October 2021-May 2022 (Data release ~February 2023)
Cycle 3- October 2024-May 2025 (Data release ~February 2026)

There has been such tremendous interest in this survey that I am confident there will be funding announcements for additional cycles in the next few years. We are planning to put all CHSCY data in the RDC and RTRA system. There is no decision on whether a PUMF will be created for the full cycles. We will study the data from the first full cycle to evaluate the uniqueness of the records, required suppression and overall suitability for a PUMF before making a recommendation to the CHSCY Steering Committee. We will likely make this recommendation in spring/summer 2020 once we are more familiar with the data.

Thursday, October 13, 2016

Updated Products - ICO Q3 2016

Inter-corporate ownership (ICO) – Q3 2016

This product is a directory of corporate ownership in Canada that provides information on every individual corporation that is part of a group of commonly controlled corporations with combined assets exceeding $600 million or combined revenue exceeding $200 million. Individual corporations with debt obligations or equity owing to non-residents exceeding a net book value of $1 million are covered as well.

Ultimate corporate control is determined through a careful study of holdings by corporations, the effects of options, insider holdings, convertible shares and interlocking directorships.

The information presented is based on non-confidential returns filed by Canadian corporations under the Corporations Returns Act and on research using public sources such as Internet sites. Entries for each corporation provide both the country of control and the country of residence.


Wednesday, October 12, 2016

Marriages and Divorces by Conjugal Status


A student has found the number of married people by opposite sex and same sex from the 2011 and 2006 Census and is wondering if the number is available annually and at the provincial level. She is also wondering if data on divorce of opposite sex and same sex is available.

It appears that Justice Canada took over Vital Statistics from StatsCan and at that point, all that lovely marriage and divorce data disappeared


Thank you for your question on the number of married people by opposite sex and same sex.

These numbers are not available annually. The census, every five years, is providing numbers on these topics, and the 2016 Census data on opposite-same sex married people will be disseminated on August 2nd, 2017. 

No data is available from Statistics Canada on divorces as this program was terminated back in 2008.

For the Vital Statistics question, I can confirm this with subject matter but from what I gather, it appears that Statistics Canada underwent a strategic review in 2008. As such, Statistics Canada made the decision to discontinue the collection and dissemination of national marriage and divorce data. 

The Department of Justice Canada is responsible for collecting the divorce data through the Central Registry of Divorce Proceedings and, as such, has the divorce data ( Data on marriages are still being collected by the provincial and territorial vital statistics registrars.

Tuesday, October 11, 2016

Industrial Water Survey (IWS): any PUMF access?

I have a University of Toronto researcher who is interested in data obtained through the Industrial Water Survey (IWS).

She has looked at several CANSIM tables (I have the numbers)  that are using data from the IWS, but the tables do not provide enough detail for her either at the geographic level nor at the manufacturing subsector level. She is also looking for further breakdowns related  to water acquisition and public utilities costs.

I don’t see the IWS listed as a DLI product. Would it still be possible to get access to a IWS PUMF through DLI, or should the researcher contact Statistics Canada directly?

I received a reply from subject matter. It appears your client [has two] ways forward - Custom tabulation or application through the Canadian Centre for Data Development and Economic Research (CDER) 

If they are interested in custom tables it would help to have a clear understanding of the specific dimensions for the data request (e.g., what level of NAICS and geography) along with a list of variables. I’m assuming by the reference to the manufacturing subsector that the client is only interested in the manufacturing industry, is that correct?  I can pass this information off to subject matter for an estimate.

Should they wish to pursue access through CDER, they would be subject to CDER’s research application process and be expected access the files onsite at Statistics Canada head office. More information can be found here:

UCR Monthly Numbers


I have a researcher who is making use of the UCR but needs more up-to-date information. Is it possible to obtain monthly data--if so, how should she proceed?

Unfortunately, UCR data is not available monthly.

Response from subject matter: UCR is an annual release.

The 2015 data is here and was published in July. This data includes the entire 2015 year as the data reflected is annual. 2016 data will not be available until July 2017. Our 2015 RDC file has not yet been released.

Friday, October 7, 2016

Release - LFS - September 2016

Labour Force Survey (LFS) – September 2016

LFS data for September 2016 are now available on the EFT site.

This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products
The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

Eft: /MAD_PUMF_FMDG_DAM/Root/3107_LFS_EPA/1976-2016/data/

Thursday, October 6, 2016

PCCF SAS command file


A student is looking for a SAS command file for the latest PCCF. Is this available somewhere?


As per the release email from subject matter there is a readme file with:

SPSS code intended for users of the Postal CodeOM Conversion File (PCCF) was prepared by the Western Libraries Map and Data Centre, UWO by Vince Gray.

SAS code intended for users of the Postal CodeOM Conversion File (PCCF) was prepared by the Data Resource Centre, McLaughlin Library, University of Guelph by A. Michelle Edwards, Ph.D., MLIS.

You can find the syntax files in the following folder: /MAD_PCCF_FCCP_DAM/Root/2011/pccf-fccp/pccf_jun2013_fccp/doc/

The syntax can be used with other versions of the PCCF. As per subject matter: “all the same files exist for the current PCCF and the two previous versions.”

Wednesday, October 5, 2016

MLS information for Vancouver

A pair of students in the 4th-year Economics thesis course have settled on a topic for their year – they would like to investigate the impact of the 15% foreign buyers tax on the real estate market in Toronto. Following the methodology of a similar paper done on the Toronto real estate market a number of years ago, they have concluded that they will need data from the Multiple Listing Service (MLS) on each home sale.

Have you had experience dealing with the MLS on obtaining data of this type? 

Is there a contact to whom you can refer me? 
What is the most detailed information about the Vancouver real estate market that Statistics Canada could provide?


I've had a researcher successfully request information from the Canadian Real Estate Association (CREA), which runs the MLS and CREA does have a microsite for stats on the housing market ( and we were able to request additional data from the CREA stats team.

At that time (November last year), here were the terms of use:
Please note that our data cannot be published or redistributed. It can only be used for analytical purposes or to generate charts and graphs so long as the underlying data are not displayed and The Canadian Real Estate Association is sourced.

Ordinarily the cost would be $90 + HST but due to the Professor’s affiliation to the University there will be no charge for this his first data request. 

 Future requests will be billed at $40 base price + $5 per data series + HST. 

However due to the fact that the data cannot be redistributed the request will have to come from the professor. Confirmation that the professor will adhere to the terms of use outlined above is needed.

Here is the relevant real estate info available from statcan:
The lowest level of geography publicly available is at the CMA.

Residential Property Values
Record number: 5213

The Property Values Program produces residential property value estimates by province and territory, as well as by Census Metropolitan Areas (CMAs). This data series represents residential property values at current market price, by reference year. This includes all taxable and exempt properties.

Annual Survey of Service Industries: Real Estate Agents, Brokers, Appraisers and Other Real Estate Activities

Record number: 4706

This annual sample survey collects data required to produce economic statistics for the Real Estate Brokers industry in Canada.

Data collected from businesses are aggregated with information from other sources to produce official estimates of national and provincial economic production for this industry.

Survey estimates are made available to businesses, governments, investors, associations, and the public. The data are used to monitor industry growth, measure performance, and make comparisons to other data sources to better understand this industry.

Canada Mortgage and Housing Corporation

Record number: 7505

The Canada Mortgage and Housing Corporation (CMHC) produces a comprehensive database containing more than 14,000 housing series. From CANSIM you can download residential housing statistics on housing starts, completions, under construction and newly completed and unoccupied; vacancy rates; and mortgage information.