Thursday, March 26, 2015

Updated products: EICS 2013

Please note the updated products listed below and the path to access them via the EFT site.

Employment Insurance Coverage Survey: PUMF 2013
The Employment Insurance Coverage Survey provides a meaningful picture of who does or does not have access to EI benefits among the jobless and those in a situation of underemployment. The survey also covers access to maternity and parental benefits.

Definitions, data sources and methods: survey number 4428   <http://www.statcan.gc.ca/imdb-bmdi/4428-eng.htm>

EFT: /MAD_DLI/Root/other-products/Employment Insurance Coverage Survey – eics/2013

Updated Products: APS PUMF 2012

Please note the updated products listed below and the path to access them via the EFT site.

Aboriginal Peoples Survey: PUMF 2012

This public use microdata file contains data for a wide variety of variables collected from the 2012 APS.

The APS is a national survey on the social and economic conditions of Aboriginal people (First Nations people living off reserve, Métis and Inuit) aged six years and older. It is designed to identify the needs of Aboriginal people and focus on issues such as education, employment, health, language, income, housing and mobility.

Definitions, data sources and methods: survey number 3250  <http://www.statcan.gc.ca/imdb-bmdi/3250-eng.htm>

EFT: /MAD_DLI/Root/other-products/Aboriginal Peoples Survey-aps/2012

Updated Products: CFCS 2014

Canadian Financial Capability Survey: PUMF 2014

The survey sheds light on Canadians' knowledge, abilities and behaviour concerning financial decision-making. In other words, the survey covers how Canadians understand their financial situation, the financial services available to them and their plans for the future.

The survey collects information on respondents' approaches to day-to-day money management and budgeting, longer term money management and general financial planning.

Definitions, data sources and methods: survey number 5159  <http://www.statcan.gc.ca/imdb-bmdi/5159-eng.htm>

EFT:

/MAD_DLI/Root/other-products/Canadian Financial Capability Survey - cfcs/2014

2012 Aboriginal Peoples Survey (APS)

Question

When will the 2012 Aboriginal People's Survey be released? January 2105 had originally been estimated, but it isn't available yet.

Answer

I just confirmed that the APS masterfile is available in the RDCs and that a PUMF will be available later this year, with no definitive release date set yet.

Canadian Financial Capability Survey

Question
I have a researcher interested in the 2014 Canadian Financial Capability Survey PUMF advertised here:

<http://www.statcan.gc.ca/daily-quotidien/150323/dq150323d-eng.htm?cmp=mstatcan>

It mentions it being available on request. I don’t see anything other than the 2008 file on the EFT site, and the DLI release dates page only mentions a 2013 file to be released this Spring. Is this something that has to be acquired outside of the DLI?

Answer

We received the file, but before placing it on the EFT, modifications are made, including staging and renaming files, and running it through a verification tool. The Data Coordinator is done staging the files and should be placing them on the EFT this afternoon. An announcement will be send to the list.

Wednesday, March 25, 2015

CANSIM Table 381-5000 (GDP at CMA)

Question

A student has found CANSIM Table 381-5000, which according to footnote 1 is a table “based on the research paper "Metropolitan Gross Domestic Product: Experimental Estimates, 2001 to 2009" by Mark Brown and Luke Rispoli,” so now of course would like past and more current years worth of data.

The Daily article (dated Nov. 10, 2014) on this report <http://www.statcan.gc.ca/daily-quotidien/141110/dq141110a-eng.htm> gives some explanation, including:

Traditionally, Statistics Canada has produced estimates of GDP nationally and by province, but not at the metropolitan level. With growing interest in measuring the dynamics of metropolitan economies, new experimental estimates of GDP by CMAs have been developed.

These include estimates of GDP over the 2001 to 2009 period for 33 CMAs and the non-metropolitan regions of the nine provinces with a CMA.

Are these the only years for which GDP at CMA is currently available? Will data for future (or past) years will be forthcoming at some point?

Answer


The subject matter has confirmed that:

The study mentioned in your correspondence was the result of a pilot project undertaken by our colleagues in the Economic Analysis Division. I do not think there is any data in this series that goes back before 2001 or is more current than 2009.

There is no set date for new data from the experimental project:

“We hope in the coming year to develop a set of more recent estimates, but no delivery date has been set for these.”

Tuesday, March 24, 2015

DOIs for Stats Canada and Co. Datasets

Question

While trying to work on solutions for research data identifiers (DOIs in our case), we have stumbled upon some unique issues with the Stats Canada datasets you are all familiar with.

In a nutshell, we have more than 27,000 data files that our ever amazing Paul Lesack is managing for four BC schools for licensed data in our Dataverse <http://dvn.library.ubc.ca/ dvn/>

As we are planning to assign DOIs to any Dataverse files on our end...we are wondering about the federal government datasets. These do not seem to have any unique URLs (e.g. handles or DOIs). If we assign DOIs to these files on our end and say five other schools assign DOIs to the same datasets, we will have a small army of DOIs floating around for the same data.

Working with NRC/CISTI as a Canadian DataCite agency, we seem to be the first ones to deal with this issue. Mind you, we also plan to assign DOIs to all our digital objects (CONTENTdm, DSpace, etc). Still, we are really interested to hear your thoughts.

I have recommended to CISTI to proactively approach Stats Can and Health Can (and others) about assigning DOIs to content they produce. Would you agree to this practice?

And please excuse me for my ignorance as I am not really a data librarian but trying to build a research data service for our large campus.

Responses

I would agree with you that CISTI should proactively approach STC and other gov. depts. about DOIs. It is an issue which was first mentioned a couple of years ago to DLI and which needs to be addressed.

You have hit the nail on the head when you state that different schools/repositories cannot all be assigning unique DOIs to the same STC dataset. At Scholars Portal we have discussed this before with respect to <odesi> and we, like you, had concerns with the idea of there being many DOIs floating around for the same data.

Personally, I have no problem with assigning DOIs to the locally hosted datasets, but then again these datasets will have numerous persistent identifiers depending on the hosting institution. I 
am guessing the DLI Nesstar server would not work at this stage for this? 

However, in regards to how DOIs work, it is my understanding that a DOI registered by Statistics Canada would resolve to a webpage maintained by them. So, any time this DOI were cited somewhere it would point back to Statistics Canada, which makes sense. Except what if you specifically wanted to point people back to the copy of the dataset housed in your Dataverse? Would you ever want to do that? For instance, we recently noticed that at least one third party (in the US) has registered at least one DOI for StatCan data: <http://data.datacite.org/10.6068/DP14A4B06A47153>. Not sure if this is just poor practice, or if it is something that will occur on a regular basis.

Dataverse allows us to host a variety of licensed and research data. Even data on social housing of dairy calves ​<hdl.handle.net/11272/10178>. This is the blurb on NRC's website re DataCite: "NRC is a founding member of DataCite and is its DOI allocation agent for Canada." I believe this is the direction many are going with respect to DOIs and assignment for multiple iterations/versions/copies of data. However, there needs to be some authority record or reference identifier to properly identify data sets and the study.

At last year’s IASSIST conference there was a panel discussion on data discovery that mentioned the use of data set identifiers and I believe this issue was discussed by the panelists <http://www.library.yorku.ca/cms/iassist/program/sb4/#sb4o>. Many data organizations host the same data sets (in terms of data ‘s content), but the iterations/copies to go for access are different (UBC hosted, vs. <odesi>, vs. DLI etc.). There is no metadata clearinghouse for data sets, DataCite being the closest thing.

I think there could be a practice in place where the original data producer assigned the DOI for the data set, and that DOI reference was carried forward into other iterations of the data set for reuse, with some customization of the identifier for different access points. Likewise, I believe there is a way to customize the DOI to include reference to other identifiers such as STC catalogue #s or the IMDB, etc., in cases where there is no DOI. For example the ISBN > DOI integration <http://www.doi.org/factsheets/ISBN-A.html>.

However, we need to be very careful with regard to assigning DOI especially where we may be assigning multiple DOI identifiers to the same object. Such a practice is discouraged by the doi.org.  The doi Handbook <http://www.doi.org/hb.htmlhttp://www.doi.org/hb.html> states:

"Each DOI® name is a unique "number", assigned to identify only one entity. Although the DOI system will assure that the same DOI name is not issued twice, it is a primary responsibility of the Registrant (the company or individual assigning the DOI name) and its Registration Agency to identify uniquely each object within a DOI name prefix.

Uniqueness (specification by a DOI name of one and only one referent) is enforced by the DOI system. It is desirable that two DOI names should not be assigned to the same thing."

Likewise, in regards to republished or duplicate datasets:

"We strongly recommend that DOIs be created only for ‘original’ datasets, not duplicate datasets. There may at times be a need to deposit duplicate copies of a dataset in multiple data centres, for example where a project has been funded by multiple funders and each funder requires deposition in a different data centre. If possible we would suggest identifying the primary version of the dataset and assigning a DOI to this version only. Where there is an unavoidable need to publish a dataset in different locations each with a separate DOI, the metadata for each appearance of the dataset should indicate the association."

<http://cisti-icist.nrc-cnrc.gc.ca/obj/cisti-icist/doc/datacite/datasets.pdf> -- scroll to the bottom..

Monday, March 23, 2015

Canadian Financial Capability Survey PUMF

Question

When will we be receiving the Canadian Financial Capability Survey: Public use microdata file <http://www.statcan.gc.ca/daily-quotidien/150323/dq150323d-eng.htm?cmp=mstatcan>?

Answer

We received the files this morning. Before we release them on the EFT, we perform some quality checks, including validating the syntax cards, running it through a verification tool, renaming the files as per the FTP naming convention. The Data Coordinator is working on the staging of the files now, and it should be released soon, we will send an announcement upon release.

Replicate Weights

Question

A number of studies from Statistics Canada contain multiple ‘replicate weights’ – are these the same as bootstrap weights?

Answer

The subject matter responded:

"Not exactly, bootstrap weights are a type of replicate weight and we [census] used the “nonindependent random groups method” to create the replicate weights.

If your client would like more information on the method he can consult chapter 2 of the book: Introduction to Variance Estimation.

Friday, March 20, 2015

British Columbia Tourism Data

Question

A student here is looking for data on the number of foreign tourists in British Columbia.

He would like to obtain monthly data for the number of foreigners in British Columbia, engaged in each of the following activities, over the past 5 or 10 years.

- Skiing/snowboarding

- Sightseeing

- Visiting national parks

- Shopping

- Hiking

- Visiting beaches

The International Travel Survey doesn’t get into that level of detail, as far as I can tell; the Travel Survey of Residents of Canada has closer to that level of detail, but isn`t for foreign visitors.

Do you know if such data exist and are available? I have contacted the organization: DestinationBC, to ask if they might provide such data for a student who wishes to use them for a data analytics class. I also sent him a link to the data set on OpenDataBC, on the numbers of visitors to visitor information centres.Where else can he look?

Answer

Have a look at <http://bcstats.gov.bc.ca/StatisticsBySubject/BusinessIndustry/Tourism.asp x>, unfortunately the data they provide doesn't get down to the activity level in detail, but it does provide break downs for foreign tourists.

The Room Revenue Statistics <http://bcstats.gov.bc.ca/Files/c071216f-989d-42ea-9a8c-f68e885274ec/TourismRoomRevenue1303.xlsx> provides data for specific locations that the student could cross correlate with information on popular tourist type activities. For example, Tourism Vancouver statistics available at: <http://www.tourismvancouver.com /media/marketing-research/>. Most of the other economic regions/ municipalities in B.C. will have similar types of information.

There is a good guide to finding addition statistics called "WHERE TO GO FOR MORE TOURISM RESEARCH INFORMATION BC, CANADIAN, & INTERNATIONAL ORGANIZATIONS SOURCE LIST" available at:<http://www.tourismvancouver.com /pdf/research/where_to_go_tourism_research.pdf>.

National Parks Attendance statistics found at:<http://www.pc.gc.ca/eng/docs/pc/attend/ notes.aspx> and they could check the BC Parks website for <http://www.env.gov. bc.ca/bcparks/>. As you have already indicated the destination BC web site's Tourism Indicator: <http://www.destinationbc.ca/Research/Industry-Performance/Tourism-Indicators.aspx#.VQxJrPzF-Ag> has good information.

Sexual History of Youth/Adults

Question

I have a researcher who is looking for cross-sectional data over time that deals with sexual history of youths/adults in Canada. He has looked at both the Canadian Health Survey and the Canadian Community Health Survey which does deal with these questions but doesn't go far enough back. I've also looked at the National Population Health Survey.

Another lead is the BC Adolescent Health Survey <http://www.mcs.bc.ca/>, but any other suggestions would be most appreciated.

Answer

In addition to the ones your researcher has found, the Health Promotion Survey, 1990 asks these questions and you may also find additional information for college age people in the survey of Male-Female Dating Relationships in Canadian Universities and Colleges. Some Gallup opinion polls also seem to cover this as well. You'd think some cycles of the GSS cover this too?

I searched "intercourse" in <odesi> at the variable level:<http://odesi.ca/search/search.html?q=intercourse&field=VL&coll=odesiall&coll=DLI&coll=DLI_aggregate&coll=CGP&coll=CAD&date-gt=1871&date-lt=2015#10>.

There may also be data in ICPSR that covers this with a Canadian sample...

Thursday, March 19, 2015

CANSIM Table 381-5000 (GDP at CMA)

Question

A student has found CANSIM Table 381-5000, which according to footnote 1 is a table “based on the research paper "Metropolitan Gross Domestic Product: Experimental Estimates, 2001 to 2009" by Mark Brown and Luke Rispoli,” and would like past and more current years worth of data.

The Daily article (dated Nov. 10, 2014) on this report <http://www.statcan.gc.ca/daily-quotidien/141110/dq141110a-eng.htm> gives some explanation, including:

"Traditionally, Statistics Canada has produced estimates of GDP nationally and by province, but not at the metropolitan level. With growing interest in measuring the dynamics of metropolitan economies, new experimental estimates of GDP by CMAs have been developed.

These include estimates of GDP over the 2001 to 2009 period for 33 CMAs and the non-metropolitan regions of the nine provinces with a CMA.

May I please confirm that these are the only years for which GDP at CMA is currently available and whether or not data for future (or past) years will be forthcoming at some point?

Answer

Subject matter has confirmed that:

The study mentioned in your correspondence was the result of a pilot project undertaken by our colleagues in the Economic Analysis Division. I do not think there is any data in this series that goes back before 2001 or is more current than 2009.

There is no set date for new data from the experimental project, but we hope in the coming year to develop a set of more recent estimates, but no delivery date has been set for these.

Tuesday, March 17, 2015

HES 2013 Release Questions

Question

I noticed a couple of things with the HES 2013 Release:
1. The syntax zip file has 4 data files packed into it, which makes the name “Syntax” not entirely correct and


2. There are 2 copies of HES_i.sas and HES_o.sas, and as far as I can determine the only difference between them is that the comments have been removed from the newer set and the zero padding is different.

MD5 (HES_i.sas) = 17c775919840b54523a95421a0148adc
MD5 (./SAS/HES_i.SAS) = b10a6994da4fb62dee590be82afa8e34

MD5 (HES_o.sas) = c5b4e88fddd4ef942db393957530a906
MD5 (./SAS/HES_o.SAS) = bcdca0d8261600edcdd284532a70e23d


Answer

1.  We have begun staging the data files and using a verification tool to ensure variable and observation counts, see attached. If you would like to suggest how to save the files differently, we are open to your suggestions.


2. Subject matter confirmed:
"I can't shed any particular light on why this is, particularly with respect to the zero padding. The two flavours of each file will produce essentially the same end result. There are no differences in the column offsets for the inputs, so the ones outside the SAS can be deleted."

As recommended, we will remove from the EFT the .sas syntax cards outside the SAS sub-folder.

Updated Products: HES 2013

Households and the Environment survey 2013

The file provides data for Canada, the provinces and census metropolitan areas and includes information on a wide range of topics, including water quality concerns; consumption and conservation of water; energy use and home heating and cooling; pesticide and fertilizer use on lawns and gardens; recycling, composting and waste disposal practices. It also provides information on the socio-demographic, income and labour force characteristics of the population.

EFT: /MAD_DLI/Root/other-products/ Households and the Environment Survey-hes/2013

Pump Prices by Province 1960-1964

Question

Does anyone know of a source for gasoline pump prices (not a price index) by province (or even just Alberta) for 1960-1964? I've found prices for later years via NRCan.

The fact that analysts compiled pump prices in some publications <http://publications.gc.ca/collections/collection_2008/lop-bdp/prb/prb0755-e.pdf> could be an indication that they aren't available in an official publication, but any leads would be helpful.

Answer

There might be some information in the following publications. I have copied some records from our library catalogue but I have not confirmed whether they include useful details. The last title may not be useful as it may not have historical information. Also, our library does participate in interlibrary loan if you are interested in borrowing titles. If you want to check further you can use our library catalogue via the website at: <http://legislative.voyager.uregina.ca:7208/vwebv/searchBasic>.

Titles:

Title: Gasoline marketing in the context of the oil industry.
Variant Title:Report to the Honourable A. Russell Patrick, Minister of Industry & Tourism from the Gasoline Marketing Enquiry Committee. Author:Alberta. Gasoline Marketing Enquiry Committee.
Other Author(s):McKenzie, Kenneth A.
Publisher:[Edmonton] : L. S. Wall, Queen's Printer for Alberta, 1968, c1969.

Title:Report relating to the distribution and sale of gasoline in the city of Winnipeg and elsewhere in the province of Manitoba.
Variant Title:North Star and Shell gasoline consignment plans
Author:Canada. Restrictive Trade Practices Commission.
Other Author(s):Canada. Office of the President of the Privy Council.
Canada. Dept. of Justice.
Publisher:Ottawa : Dept. of Justice, 1966.

Title:Competition in the Canadian petroleum industry / Restrictive Trade Practices Commission.
Author:Canada. Restrictive Trade Practices Commission.
Other Author(s):Stoner, O. G.
Canada. Restrictive Trade Practices Commission.
Publisher:Ottawa : The Commission, c1986.

Monday, March 16, 2015

Fixed Reproducible Tangible Wealth

Question

I received this question from one of our Economics PhD students:

"I need net amount of household final consumption on durable goods for my thesis. If you look at Cansim table 38-0038, you will see durable goods numbers for each year but those are gross amounts.

Chain-type quantity index for net stock of fixed reproducible tangible wealth is another data set which I need. This index helps us to get real net stock of fixed reproducible tangible wealth which is the sum of durable goods owned by consumers, lows and stocks residential capital Flows and stocks of fixed non-residential capital.​"

Is this available via DLI?

Answer


The subject matter confirmed that:

"We don’t match the Capital cost allowance (CCA) with the type of durable goods. This data is not available. 
We also don’t deflate wealth."

Friday, March 13, 2015

Updated Products: CTADS 2013 documentation

Canadian Tobacco, Alcohol and Drugs Survey - CTADS

The public use microdata file for the 2013 Canadian Tobacco, Alcohol and Drugs Survey is now available on the DLI EFT site.

As of 2013, new content, covering alcohol use as well as prescription and non-prescription drug use, was added to the Canadian Tobacco Use Monitoring Survey. Therefore, the survey is now entitled Canadian Tobacco, Alcohol and Drugs Survey (CTADS).

The overall objective of CTADS is to provide regular and reliable data on tobacco, alcohol and drug use and related issues, with the primary focus on 15- to 24-year-olds.

DLI EFT:


/MAD_DLI/Root/other-products/Canadian Tobacco, Alcohol and Drugs Survey - ctads


The French user guide is now is now available on the DLI EFT

DLI EFT:

/MAD_DLI/Root/other-products/Canadian Tobacco, Alcohol and Drugs Survey - ctads/2013/doc/2013ectad-gid-fra.pdf

Updated Products - Labour Force Survey (LFS) Feb 2015

Labour Force Survey (LFS) – February 2015

LFS data for February 2015 are now available on the EFT site.

The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.

The LFS guide: <http://www.statcan.gc.ca/pub/71-543-g/71-543-g2014001-eng.htm>

Eft: /MAD_DLI/Root/other-products/Labour Force Survey - lfs/1976-2013/data/micro2015-02.zip

Incarceration Statistics

Question

Are there statistics in Canada indicating incarcerations by race? I believe that is how it is done in the U.S.

I think we have some figures on aboriginal numbers but nothing more. However, I noticed an article in the Winnipeg Sun referring to a 69% increase in the black population in federal prisons.

Answer

I received a very similar request recently and here is the response from subject matter:

Canadian Centre for Justice Statistics has further investigated the data availability, the inquired ‘ crime data by race’ is not available even through custom data request because Uniform Crime Reporting Survey does not collect the inquired data by race.

There is some data by ‘aboriginal/non-aboriginal identity’ that may be of interest to you.

Adult Correctional Services (ACS) is the source of data <http://www23.statcan.gc.ca :81/imdb/p2SV.pl?Function=getSurvey&lang=en&db=imdb&adm=8&dis=2&SDDS=3306>

Related CANSIM tables can be found at: <http://www5.statcan.gc.ca/COR-COR/COR-COR/objList?lang=eng&srcObjType=SDDS&srcObjId=3306&tgtObjType=ARRAY>

· 251-0026 Adult correctional services, community admissions to provincial and territorial programs by Aboriginal identity: <http://www5.statcan.gc.ca/COR-COR/Result?lang=eng&srcObjType=SDDS&tgtObjType=ARRAY&srcObjId=3306&tgtObjId=2510026>

· 251-0022 Adult correctional services, custodial admissions to provincial and territorial programs by aboriginal identity: <http://www5.statcan.gc.ca/COR-COR/Result?lang=eng&srcObjType=SDDS&tgtObjType=ARRAY&srcObjId=3306&tgtObjId=2510022>

Thursday, March 12, 2015

Concordance File for 1991 - 1996 EAs

Question

I was wondering if there is a concordance/correspond file linking EAs from 1991 to 1996. All I can seem find on the EFT are files for CTs and one linking 1996 EAs to FEDs.

Answer

The 1991 to 1996 EA correspondence was provided for the 1996 GeoSuite product. Here is a note that can be found on page 15 of the 1996 Catalogue:

“GeoSuite is completely redesigned for 1996 as a Windows-based product. For 1996, GeoSuite includes information that was previously available (1991 Census) as separate products, namely, the EA Reference Lists (92-312, 92-313, 92-314, 92-315, 92-316, and 92-317), EA Correspondence File (92F0076) and Geographic Attribute File (92F0075). Therefore, these products are not available separately for 1996.”

Because of technological changes GeoSuite96 is almost impossible to run on modern machines.We can, however, attach the file for your client.

Travel Activities and Motivation Survey 2006

Question

We would like to ask about a data issue with the TAMS 2006/EAPV 2006. The command
files in French and English declare the following:

VALUE LABELS </E07_Q01> 7

Should we go ahead and re-run the command file omitting the 7? Please note, the data file itself contains no values of 7 for this variable as per the screen shot from <http://www62.stat can.ca/webview/>



Answer

We consulted the TAMS Codebook and there is no value label 7. In order to remain consistent with the codebook, I would remove the value. It seems to be an error in the syntax provided.

Mother Tongue and Occupation

Question

I have a researcher looking for a recent breakdown of University professors (NOC-S code 4011) by mother tongue. There does not seem to be a table for this from the NHS 2011. The closest I can find only lists NOC-S codes at the single digit level

CAUT used to republish this data in its almanac based on Census data but their last data point comes from 2006. Is there a location for this data I have missed?

Answer

I confirmed with Census and Education Statistics Division that this information is not available as a standard product. However, the data can be obtained through a custom tabulation at a fee, contact us if your researcher would like to pursue this direction.

Wednesday, March 11, 2015

Concordance file for 1991 - 1996 EAs

Question

Is there a concordance/correspond file linking EAs from 1991 to 1996. All I can seem find on the EFT are files for CTs and one linking 1996 EAs to FEDs.

Answer

There are EA conversion files for 1971/76, 1981/86, and 1986/91, and a conversion file for 1996 EA to 2001 DA – but not for 1991/96! A few years ago a file was sent around which *might* fit the bill. It's 1996-1991, contact us if you'd like it sent to you.

Metadata for 2011 Census/NHS

Question

There is no metadata for the master files of either the 2011 census or the NHS on the STC Nesstar site; are either of them forthcoming soon (will there even be one for the 2011 census? I know there won't be a PUMF.) I have a researcher who is contemplating as custom tab from each and needs variable lists to consult, and I suspect the PUMF codebook for the NHS won't be detailed enough.

Answer

The DLI has been able to add the DDI metadata for masterfiles into the DLI Nesstar, thanks to the RDC Metadata project, which ended in 2012. There was additional funding to code several more master files, which ended in March 2014.

We have recently hired two students to code PUMF and masterfile microdata into DDI, however, I will have to confirm if NHS is on the RDC's priority list.

I will inquire if a zero-count codebook is available that could be shared.

DA level tables on Aboriginal Identity and Gender for the 2006 Census

Question

I was surprised not to find Dissemination Area level tables on Aboriginal identity groups and Gender (women) for the 2006 Census. Would I be missing something? If not, may I inquire how long a custom tabulation would take and whether there is interest in the DLI for a group purchase? We are interested in Winnipeg at the very least. One of our researchers is looking at worsening homelessness for Winnipeg and has a DA-level table from the 2011 NHS.

Answer

Try consulting the cumulative profile of the 2006 Census at the DA level:

Variables at the DA level include Aboriginal identity population

Item Code: 565

Default Label: Total Aboriginal identity population

--------------------------------------------------------------------------------

Notes:

FOOTNOTE:

Included in the Aboriginal identity population are those persons who reported identifying with at least one Aboriginal group, that is, North American Indian, Métis or Inuit, and/or those who reported being a Treaty Indian or a Registered Indian, as defined by the Indian Act of Canada, and/or those who reported they were members of an Indian band or First Nation.

There is also the female and male population in the profile.

/MAD_DLI/Root/census_pop_recens/2006/profiles/complete-cumulative-profile/b2020/94-581-X2006002v2-ivt.zip

Tuesday, March 10, 2015

APS 2012 PUMF release date

Question

I am wondering when a PUMF is due to be released for 2012 APS. I see reference that it was tentatively due to be released this past January, but find no data file on the FTP server or and only Excel files available through Nesstar.

Answer

The file is set to be released on Tuesday March 24, 2015.

Friday, March 6, 2015

CDATA within a qstnLit

Question

I downloaded the documentation for a survey but some question literal tag <qstnLit> in the documentation includes CDATA tags, this doesn’t reflect the DTD or the Best Practice document. (i.e. Let’s say I’m looking for the variable name (name="FHP_Q110") within this document (gss-12M0020-E-2006-ft.xml line number: 51 134) but the <qstnLit> text value is within a CDATA). Also, I double checked the xml document (gss-12M0020-E-2006-ft.xml) it’s not all the <qstnLit> that have a CDATA. It’s written in the DDI best practices that : CDATA is text that will NOT be parsed by a parser. Tags inside the text will NOT be treated as markup and entities will not be expanded. (p.92). But I don’t understand why some <qstnLit> text value include a CDATA and some don’t?

<http://odesi2.scholarsportal.info/webview/index.jsp?object=http://142.150.190.128:80/obj/fVariable/gss-12M0020-E-2006-ft_V598> - This was the link for FHP_Q115 you need to select the previous variable to access FHP_Q110

Dataset: General Social Survey, Cycle 20, [Canada] 2006: Family Transitions

<var ID="V597" name="FHP_Q110" files="F1" dcml="0" intrvl="discrete">
<location StartPos="1173" EndPos="1173" width="1" RecSegNo="1"/>
<labl>

Thinking about your experience before and after the birth/adoption of your (youngest) child, did you contact or receive assistance from a health care professional or community-based service or support group concerning:...parenting skills (i.e. pre-natal c

</labl>
<qstn>
<qstnLit>
<![CDATA[ Thinking about your experience before and after the birth/adoption of your (youngest) child, did you contact or receive assistance from a health care professional or community-based service or support group concerning:...parenting skills

(i.e. pre-natal courses, mother's support group, public health nurse, etc.)?]]>

</qstnLit>


Answer

CDATA tags are just a low-level detail of XML data management software. If characters like ‘<‘ or ‘&’ show up in the XML, they would invalidate the document. These characters have to be encoded to prevent this. The CDATA construct is just a way to encode text that might contain these characters. For software, it is always a good idea to put such guards around any data that might be entered by hand. In any event, these data are the same whether or not there is a CDATA.

For this particular document in <odesi>, it looks like most of the CDATA in the <qstnLit> are because of new line characters that were in the original. Without doubt, these data have passed through multiple automated systems.

Voluntary Questions LFS May 2011

Question

A researcher is interested in data from 3 voluntary questions added to the May 2011 LFS at the request of Elections Canada, but those variables do not appear to be included in odesi, Equinox, or DLI-Nesstar copies of the dataset. I haven’t actually downloaded it from EFT yet, but should I be looking somewhere else for these variables?

Here’s the source of information about the inclusion of these questions: <http://www.statcan.gc.ca/pub/75-001-x/2012001/article/11629-eng.htm#source>

Answer

The 3 voluntary questions added to the May 2011 LFS are available on the DLI EFT. They are not present on the PUMF, but rather we received tables.

On the EFT site:

/MAD_DLI/Root/other-products/Labour Force Survey – lfs/1976-2015/tables_on_voting_data