Tuesday, January 22, 2008

Labour Force Survey 2007 Question

Question

Looking at the 2007 LFS files - what sps file corresponds to these? All the files I see go as far as 2005. Is this file still valid for the 2007 data files?

Answer

The author division has confirmed that the record layout and the micro-data file haven't changed since 2005 and that you can use the 2005 LFS SPSS files with the 2007 LFS data files

Monday, January 21, 2008

Cultural Diversity in Housing

Question

I have a researcher who is examining cultural diversity in housing. Specifically he is looking dwelling type by place of birth for the CMA level. I am having quite a time finding anything, can someone put me on the right track?

Answer

Consider using the Census public use microdata to pursue your patron's request. The individual level file has the following dwelling variables as well as CMA and place of birth.

6 1 10 N HHCLASSP Household classification
7 2 11-12 N HTYPEP Household type
8 1 13 N UNITSP Household size
9 2 14-15 N ROOMP Number of rooms
10 1 16 N CONDWELP Condition of dwelling
11 6 17-22 N VALUEP Value of dwelling
12 1 23 N TENURP Tenure
13 1 24 N RCONDP Tenure -- condominium
14 4 25-28 N OMPP Owner's major payments (monthly)
15 4 29-32 N GROSRTP Monthly gross rent

Correctional Facilities Population

Question

I understand from the Census Dictionary that residents of correctional facilities are enumerated using the institution's administrative records. Is it possible to find out exactly the nature of the information recorded? And would anyone know the method by which the admin record is translated into a census record?

Is the correctional facilities population included in aggregate records for the Census at any geographic level?

Answer

It is correct that correctional facilities are enumerated using the institution's administrative records. Census enumerators complete the 3A Census questionnaire for usual residents in correctional facilities using their administrative records. The 3A form collects basic demographic data: marital status, sex, date of birth(age), and first language.

Usual Residents of correctional facilities are considered as part of the institutional population. Any Census product (regardless of geographic level) where the universe includes institutional population will include residents of correctional facilities unless otherwise stated. For example, the 2006 Census population counts released on March 13, 2007 and the Age & Sex data released on July 17, 2007 include the institutional population and as a result include the residents of correctional facilities.

Wednesday, January 16, 2008

Annual data for Automobile/Auto Parts Exports and Imports

Question

A student here is hoping to get data back to 1961 for Canada - United States trade in automobiles and auto parts (North American Industrial
Classification System codes 3361 through 3363). He would like annual data for these.

We can go back to 1980 with the World Trade Database - it uses the Standard International Trade Classification codes: the student is going to try to
identify the codes comparable to the NAICS.

Any ideas on earlier data? If someone had happened to digitize the back issues of Canadian international merchandise trade, (and they all have what the earliest electronic copy does) I think the student's life would be made much easier.

Answer

You may want to have a look at the trade data available through the UNs Comtrade database (http://comtrade.un.org/db/default.aspx). Comtrade has commodity based trade data for a number of reporting countries, including Canada, and some of their data goes back to 1962. The trade data won't be industry based however, it will be commodity based. Some helpful Comtrade user guides are available on the right side of Comtrade's home page (http://comtrade.un.org/db/default.aspx) and on the Basic Query Screen (http://comtrade.un.org/db/dqBasicQuery.aspx).

After checking with the International Trade Division, they stated that: "On a cost-recovery basis, we could produce automobile data by SEG (Summary Export Goods) (1966 to present) but it's not necessarily limited to manufacturing and it could become extremely costly. For your information, here are the SEG codes that cover the automobile industry:
51110 Passenger automobiles and chassis
51120 Trucks, truck tractors and chassis
51131 Other motor vehicles
51132 Motor vehicle engines and parts
51139 Motor vehicle parts, except engines".

They also told me that we started using NAICS in 1992 and that we had SIC (Standard Industrial Classification) prior to that. They mentioned that Industry Canada's Trade Data Online (http://www.ic.gc.ca/sc_mrkti/tdst/engdoc/tr_homep.html) would give you NAICS based data back to 1992 but that it would be difficult for the student to find NAICS data going back further than that.

Rounding Census 2006 Numbers

Question

If I recall properly, the census numbers used to be perturbed during rounding - is this the case with the 2006 Census?

Answer

Below is the description of how random rounding is used and how it applies to data for the 2006 Census. The information was provided by our Census Consultants and comes from pages 6 and 7 of "Data Quality and Confidentiality Standards and Guidelines" for the 2006 Census
(http://www12.statcan.ca/english/census06/reference/notes/DQguidelines/PDF/2006-DQ-Public-Guide-E.pdf).

From pages 6-7

2.2. Random rounding
All counts in census tabulations are subjected to a process called random rounding. Random rounding transforms all raw counts to random rounded counts. This reduces the possibility of identifying individuals within the tabulations. For 2A (100%) data, all counts are rounded to a base
of 5. This means that all 2A counts will end in either 0 or 5. The random rounding algorithm employed controls the results and rounds the unit value
of the count according to a pre-determined frequency.

2B (20%) data require a slightly different random rounding algorithm. All counts greater than 10 are rounded to base 5, as is done for 2A data. Counts less than 10 are rounded to base 10. This means that any 2B counts less than 10 will always be changed to 0 or 10. The table below shows the effect of rounding on 2B counts with a value less than 10.

The random rounding algorithm uses a random seed value to initiate the rounding pattern for tables. In these routines, the method used to seed the
pattern can result in the same count in the same table being rounded up in one execution and rounded down in the next.

FED Land Area & Population

Question

A researcher at McMaster is looking for population density data of Canadian Federal Electoral Districts from 1972 to present. So the researcher needs land areas and populations for each FED. Does anyone know where this data might be available?

Answer

The Geography Division recommends that your client use the 1991 - 2006 GeoSuite products to get the population data for FEDs. Beyond 1991, the Geography Division suggests that you use the Geographic Attribute File (GAF) files, which go back to 1971, to get the population data. The GAF, their user guides and record layouts are available in the Geography folder on the DLI FTP site. (Please note that there is no user guide for the 1996 GAF, only a record layout.)

Geosuite 2006 and 2001 will also provide you with land area for FEDs. GeoSuite1996 and GeoSuite1991 do not have land area for FEDs however, nor do the GAF files. You may need to refer to print publications or other sources to find FED land areas for 1996 and previous years.

Monday, January 14, 2008

Publication of Beyond 20/20 Tables in Textbook

Question

I know part of the answer to this question, but I was hoping for a bit more clarification. A professor here is writing a textbook and wants to generate tables in Beyond 20/20 (I think using homicide/victim/crime data) and wants to know if she can use them in her book. Now, I'm pretty sure that if she is getting the B2020 tables from DLI, the answer is NO and that she would need to seek permission directly. However, what if she is getting them from E-Stat? I suspect the answer is still no because E-Stat is for educational use? AND, what if she generates them on the Stat-Can website and then pays for the time series that way? Any help/clarification you can provide is much appreciated.

Answer

Publishing Statistics Canada products in textbooks falls outside the DLI license. The Regional Office should help your professor get the approval she needs.

The DLI Licensing database on the DLI website says, "Faculty are referred to the Marketing Division of Statistics Canada for discussions on textbook licensing."