Thursday, December 11, 2014

Canadian Community Health Survey (CCHS) Mental Health 2012 PUMF

The Canadian Community Health Survey (CCHS) – Mental Health 2012 PUMF!

After the release of the CCHS - Mental Health data file, typographical revisions were made to the Derived Variable descriptions for Work Stress (WST).

The revisions are described on the EFT. Most changes only affect the variable descriptions and therefore the data are not impacted. However, the derived variable WSTDJST had an error in the specifications which also resulted in the variable being calculated incorrectly. Data users who are working with the WSTDJST variable are instructed to use the code provided in this Errata to correct the variable.

Eft: /MAD_DLI/Root/other-products/Canadian Community Health Survey-cchs/2012-mh-sm

We regret any inconvenience this may have caused you or your organization and thank you in advance for your understanding.

Wednesday, December 10, 2014

Updated Products: 2011 National Household Survey Public Use Microdata File

2011 National Household Survey Public Use Microdata File – Hierarchical File

This PUMF product provides access to non-aggregated data covering a sample of 1% of the Canadian households. It is a comprehensive social, demographic and economic database about Canada and its people, and contains a wealth of characteristics on the population. The file enables the study of individuals in relation to their census families, economic families and households. The geographic identifiers have been restricted to the provinces, the three territories grouped into a region called Northern Canada and selected metropolitan areas (Toronto, MontrĂ©al, Vancouver, Edmonton and Calgary) to ensure respondents’ anonymity.

This product, contains the data file (in ASCII format); user documentation and supporting information; all licence agreements; and SAS, SPSS and Stata program source codes to enable users to read the set of records. It is important to note that users will require knowledge of data manipulation packages (or software) such as SAS, SPSS or Stata to use this product.

EFT: MAD_DLI/Root/NHS_ENM/2011/PUMF-FMGD/hierarch

Monday, December 8, 2014

Transfer Payments

Question

I am currently working on a project related to Canadian Government's expenditures. I am trying to find annual data, preferably from 1976 to 2013, about government actual expenses on social policies, such as child related transfers to families, or public income support payments during periods of maternity and parental leave.

I found these tables from CANSIM (there is no data available from 1981-1988 or after 2009 when the series was terminated):

- Federal government and government sector revenue and expenditure, annual (Dollars), 1961 to 1980 <http://www5.statcan.gc.ca/cansim/a26lang=eng&retrLang=eng&id=3840022 &tabMode=dataTable&srchLan=-1&p1=-1&p2=9>

- Federal, provincial and territorial general government revenue and expenditures, for fiscal year ending March 31, annual (Dollars), 1989 to 2009 <http://www5.statcan.gc.ca/cansim /a26?lang=eng&retrLang=eng&id=3850002&tabMode=dataTable&srchLan=-1&p1=-1&p 2=9>

If you could advise anything further that would be great.

Answer

You may have noted in the CANSIM table 385-002 that the 385-XXXX series was compiled under the old Financial Management System framework. Statistics Canada is adopting the international standard Government Finance Statistics. More information on this can be found here:

- Moving from the Financial Management System to Government Finance Statistics - http://www.statcan.gc.ca/pub/13-605-x/2010001/article/11155-eng.htm.

See release notice of the GFS in the Daily - Canadian Government Finance Statistics, 2008 to 2012 (provisional):
<http://www.statcan.gc.ca/daily-quotidien/141119/dq141119a-eng.htm>

Note: The data sources, methods and concepts that underlie the CGFS-based data depart significantly from the Financial Management System (FMS)-based data previously published by Statistics Canada.

Available in CANSIM: tables CANSIM table385-0033 to 385-0039 <http://www5.statcan.gc.ca/cansim/a03?lang=eng&pattern=385-0033..385-0039&p2=31)>. 
Definitions, data sources and methods: survey number5218 <http://www.statcan.gc.ca/imdb-bmdi/5218-eng.htm>

The Add/Remove Data Tab has more options with respect to Statement of government operations and balance sheet. If the user wants more detailed information, if available, it would require a custom tabulation.

Friday, December 5, 2014

CSD geography changes (2011) corresponding to years 2002 - 2010

Question

I have a researcher looking for information on CSD geography changes (2011) corresponding to years 2002 - 2010

1) What sources she should work with to trace Census-to-Census CSDuid correspondences?

So far, we have come across the following document as a reference source: "Interim List of Changes to Municipal Boundaries, Status, and Names (geography products: geographic reference products)" <http://www5.statcan.gc.ca/olc-cel/olc.action?ObjId=92F0009X&ObjType=2&lang=en&limit=1>

We note that this document lists changes by CSD names and not by CSDuid’s (in tables 1 and 2. 
There seem to be two types of changes that would be relevant to her CSDuid matching work (re: 2011 CSDuid’s to the four CCHS cycles): 4 Dissolution and 5A CSD has annexed a complete CSD (could also include the annexation of another CSD part).

2) Is this correct, are these above-mentioned two changes the ones that would be relevant for CSD correspondence purposes? Are there others?

3) Could these tables be made available in a spreadsheet-friendly format with matching CSDuid’s, so that she can search by Remarks, e.g. Complete annexation or Now part of (dissolution), and then capture the relevant CSDuid’s? 

4) If not, should she create her own CSDname/CSDuid lookup tables from the PCCF’s? 

5) Would there happen to be an easier way to do this? Are there actual tables that would give the Census-to-Census CSDuid corresponding changes? 

Answer

1) That is correct, we don’t have an electronic Census to Census CSDuid correspondence file but we do track changes through this document. There are concordance tables showing changes to Census Subdivisions between Census years, however they only show those CSDs which have changed and is not a complete listing of CSDs for each Census year.
The links to these tables are found on the Concordances between classifications page of the Definitions, data sources and methods website: <http://www.statcan.gc.ca/concepts/con cordances-classifications-eng.htm>. Table 2 does contain the CSDuid. The link I provided above also includes the CSDuid.

2) Please refer to Table B within the desired Interim List; the bottom of the table outlines all CSD change codes. To best address this person’s inquiry about correspondence, are they asking only about geographic change? One could argue that name changes and changes to population counts also constitute a change.

3) Please refer the client to the link above and see if it meets her needs.

4) See above.

5) See above. 

Updated Products - Labour Force Survey (LFS) Nov 2014

Labour Force Survey (LFS) – November 2014

LFS data for November 2014 are now available on the EFT site.

The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

The LFS guide: <http://www.statcan.gc.ca/pub/71-543-g/71-543-g2014001-eng.htm>

Eft: /MAD_DLI/Root/other-products/Labour Force Survey - lfs/1976-2013/data/micro2014-11.zip

Thursday, December 4, 2014

CCHS Bootstrap Weights

Question

A researcher is looking for the bootstrap weights for the 2003, 2005, and 2007 CCHS. Are these available? I did find the information in the Nesstar WebView for Cycle 2.1 (2003):

"Weighting - The principle behind estimation in a probability sample is that each person in the sample "represents", besides himself or herself, several other persons not in the sample. For example, in a simple random 2% sample of the population, each person in the sample represents 50 persons in the population. In the terminology used here, it can be said that each person has a weight of 50. The weighting phase is a step that calculates, for each person, his or her associated sampling weight. This weight must be used to derive meaningful estimates from the survey. For example, if the number of individuals who had a major depressive episode is to be estimated, the weights of survey respondents having that characteristic should be summed. In order for estimates produced from survey data to be representative of the covered population and not just the sample itself, a user must incorporate the survey weights into their calculations. In order to determine the quality of an estimate, the variance must be calculated. Because the CCHS uses a multi-stage survey design, there is no simple formula that can be used to calculate variance estimates. Therefore, an approximative method is needed. Coefficient of variation, standard deviation and confidence intervals can then be calculated from the variance. Thebootstrap re-sampling method used in the CCHS involves the selection of simple random samples known as replicates, and the calculation of the variation between the estimates from replicate to replicate. In each stratum, a simple random sample of (n-1) of the n clusters is selected with replacement to form a replicate. Note that since the selection is with replacement, a cluster may be chosen more than once. In each replicate, the survey weight for each record in the (n-1) selected clusters is recalculated. These weights are then post-stratified according to demographic information in the same way as the sampling design weights in order to obtain the final bootstrap weights. The entire process (selecting simple random samples, recalculating and post-stratifying weights for each stratum) is repeated B times, where B is large. The CCHS typically uses B=500, to produce 500 bootstrap weights. To obtain the bootstrap variance estimator, the point estimate for each of the B samples must be calculated. The standard deviation of these estimates is the bootstrap variance estimator. Statistics Canada has developed a program that can perform all of these calculations for the user: the Bootvar program."

Is the Bootvar program what he would need?

Answer

I have confirmed with the author division that Bootstrap weights are only available for the masterfile. I will review the information from the Cycle 2.1 with the codebook.

Wednesday, December 3, 2014

Requested List of 2001 Global Non-Response Rates by Census Tracts

Question

May we please request help locating a list of global non-response rates for 2001 Census Tracts? And in follow-up may we also request technical documentation on the global non-response rates threshold for suppressing Census Tract aggregate data (100% and 20%)? If this is not digitized, catalogue numbers would be great. A graduate student would like to know the reason for suppression of CT 5050054.00 data for a Geography thesis.

Answer


Have you consulted 2001 Census Handbook, Appendix B: <http://www.statcan.gc.ca/access_acces/archive.action?l=eng&archive=1&loc=http://www12.statcan.gc.ca/english/census01/products/reference/dict/appendices/app002.pdf>. Data Quality, Sampling and Weighting, Confidentiality and Random Rounding?

Within the appendix, it is noted: "Area suppression is the deletion of all characteristic data for geographic areas with populations below a specified size. The extent to which data are suppressed depends upon the following factors:
- If the data are tabulated from the 100% database, they are suppressed if the total population in the area is less than 40.
- If the data are tabulated from the 20% sample database, they are suppressed if the total non-institutional population in the area from either the 100% or 20% database is less than 40.

There are some exceptions to these rules.