Question:
I have a question regarding the restrictions outlined in the PCCF end-use license agreement.
Here is the question, in the researcher’s own words:
“I am inquiring in relation to end-use license agreement, and how much we would be able to expand the analysis geographically. We have done some organizing of the data according to PC, and then explored via geonames.org in order to get an approximate location just so we can get a sense of what size of town/small city is defined within urban and rural. However, I do not want to violate terms of use bullet #2. What we would like to be able to do is to separate into large urban/small urban and rural, which we think we can do with the information we have. Our research team is also wondering if there is a possibility to split north and south province. Can you provide any clarification to us on the agreement and what we can and cannot do?”
My understanding is that the researchers are worried that manipulating the data as outlined might count as “disassembling” or “decompiling” the data, as restricted by point 2 of the agreement. Do they have cause for concern?
Answer:
I’ve received the following response from subject matter:
“It seems that your client is using the PCCF only to correspond the postal codes to different geographies. That is the purpose of the product.
Using geonames.org to find locations, or other methods may not be necessary; that information is readily available at StatCan. Using the PCCF, most of the postal codes are linked to a dissemination block (DB), or perhaps a dissemination area (DA). There will be a representative point for those, with xy coordinates for more accurate location finding. Census Subdivision (CSD) is also provided, which is the postal code legal municipality.
Linking our Geographic Attribute File (GAF, https://www12.statcan.gc.ca/census-recensement/2011/geo/ref/att-eng.cfm) to the PCCF provides links to higher level census geographies like population centres (PopCtr, https://www12.statcan.gc.ca/census-recensement/2016/ref/dict/geo049a-eng.cfm) and Census Divisions (CD). Using PopCtr, the client can determine rural/urban. Using CDs the client can determine north/south (http://www23.statcan.gc.ca/imdb/pUtil.pl?Function=getNote&Id=792561&NT=04).
We can provide an Excel file of the CDs in North/South, rather than the client having to get it from the website.”
Showing posts with label DLI Administrative. Show all posts
Showing posts with label DLI Administrative. Show all posts
Wednesday, August 28, 2019
Monday, August 19, 2019
PCCF+ Header File
Question:
I would like so information on how to massage a pccf+ output (2006 census geo, version 5k). The preview in SAS is clear, but I generate a .csv which contains more variables (without any header) than the one that is being displayed in SAS (first 500 records). Could someone help me with this?
Answer:
I was given the following response:
“The CSV is output automatically when you run PCCF+. The output variables are indeed provided with variable names (at the top of the column), and the variables are all listed and explained in the documentation.
If the user wishes, they can import the CSV back into SAS and eliminate some of the columns, or rename their variables. This can also be done easily in Excel.
Alternatively, if they prefer the version that pops up automatically in SAS (which in this case seems to lack problems), they can hard-save that output as a SAS file using a DATA step, or output it as another CSV.
Without having more details, it’s hard to provide more feedback than that.
Please let the user know that there is a newer version of the PCCF+ available.”
I would like so information on how to massage a pccf+ output (2006 census geo, version 5k). The preview in SAS is clear, but I generate a .csv which contains more variables (without any header) than the one that is being displayed in SAS (first 500 records). Could someone help me with this?
Answer:
I was given the following response:
“The CSV is output automatically when you run PCCF+. The output variables are indeed provided with variable names (at the top of the column), and the variables are all listed and explained in the documentation.
If the user wishes, they can import the CSV back into SAS and eliminate some of the columns, or rename their variables. This can also be done easily in Excel.
Alternatively, if they prefer the version that pops up automatically in SAS (which in this case seems to lack problems), they can hard-save that output as a SAS file using a DATA step, or output it as another CSV.
Without having more details, it’s hard to provide more feedback than that.
Please let the user know that there is a newer version of the PCCF+ available.”
Wednesday, July 3, 2019
PUMF Yearly Subscription Fee
Question:
Under the Statistics Canada Open License, there is no cost to access to the PUMFs. In this link about PUMFs (updated today), it states there is a $5000 subscription fee. I'm confused, why there is still a subscription fee?
Answer:
The yearly subscription fee mentioned is for access to the PUMF collection – not for individual PUMFs.
Under the Statistics Canada Open License, there is no cost to access to the PUMFs. In this link about PUMFs (updated today), it states there is a $5000 subscription fee. I'm confused, why there is still a subscription fee?
Answer:
The yearly subscription fee mentioned is for access to the PUMF collection – not for individual PUMFs.
Labels:
DLI Access,
DLI Administrative
Monday, June 24, 2019
PCCF Reference Guide
Question:
Where on the EFT is the 2019 February version of the Reference Guide?
Answer:
It is currently being stored here: /MAD_PCCF_FCCP_DAM/Root/EnglishSASInstructionsPCCF+
Where on the EFT is the 2019 February version of the Reference Guide?
Answer:
It is currently being stored here: /MAD_PCCF_FCCP_DAM/Root/EnglishSASInstructionsPCCF+
Wednesday, June 19, 2019
PCCF+ Licensing
Question:
We are working with a Master’s student that would like to use the PCCF+ to run postal codes to match to DAs. She will be collecting the postal codes from a survey that is being run as part of a larger project that involves Queen’s University, the City and a consulting firm in town. She is a co-investigator and the work she will be doing will contribute to her Master’s thesis and results will be shared with the City and the consulting firm.
While the survey data will be accessible to non-Queen’s members of the project team; the PCCF+ would only be available to the student researcher. If the student uses the PCCF+ to run the collected postal codes and get associated DAs, can just the DAs be shared as a result with non-Queen’s members of the project team?
The composition of the project team (with the municipality and consultants) partnering with the University is throwing us off a little bit.
Answer:
We’ve received the following response from the PCCF team:
“Yes, just the DAs (geography) can be shared as a result with non-Queen’s members of the project team as long no postal codes are displayed in the student’s results or reports.
All end users receiving these products must abide by the requirements of the End-use Licence Agreement for these products. We would like to point out some specific constraints on the use of these data products:
We are working with a Master’s student that would like to use the PCCF+ to run postal codes to match to DAs. She will be collecting the postal codes from a survey that is being run as part of a larger project that involves Queen’s University, the City and a consulting firm in town. She is a co-investigator and the work she will be doing will contribute to her Master’s thesis and results will be shared with the City and the consulting firm.
While the survey data will be accessible to non-Queen’s members of the project team; the PCCF+ would only be available to the student researcher. If the student uses the PCCF+ to run the collected postal codes and get associated DAs, can just the DAs be shared as a result with non-Queen’s members of the project team?
The composition of the project team (with the municipality and consultants) partnering with the University is throwing us off a little bit.
Answer:
We’ve received the following response from the PCCF team:
“Yes, just the DAs (geography) can be shared as a result with non-Queen’s members of the project team as long no postal codes are displayed in the student’s results or reports.
All end users receiving these products must abide by the requirements of the End-use Licence Agreement for these products. We would like to point out some specific constraints on the use of these data products:
- to be used for the purpose of matching postal codeOM data to geography (Clause 4);
- not to be used for mail preparation purposes (Clause 5);
- acknowledgment of data licensed from Canada Post (Clause 7);
- reproduction and publication of up to 1% of content (Clause 8); and
- a disclaimer for search engine applications (Clause 10).”
Wednesday, June 5, 2019
Modernization Bulletin - May 2009
The May 2019 issue of the Statistics Canada Modernization Bulletin is now available on the EFT. The bulletin aims to keep you up to date on the latest Statistics Canada initiatives and transformations.
EFT: /MAD_DLI_IDD_DAM/Root/ModernizationBulletinModernisation
EFT: /MAD_DLI_IDD_DAM/Root/ModernizationBulletinModernisation
Labels:
DLI Administrative,
Updates
Monday, June 3, 2019
Ways to Contact DLI
I wanted to take a moment to remind you of the different avenues available to contact us here at the DLI.
The listserv is meant to serve as a collaborative venue to a) ask questions in which other members of the community may be able to help with, b) ask questions to the DLI team at Statistics Canada which may be of interest to the entire community, and c) to share other bits of interesting information including upcoming training dates, webinars and presentations, notices, suggestions, etc.
In order to help cut down the amount of messages sent through the listserv, we are asking that if you have a question intended specifically for us here at Statistics Canada (that you don’t feel is beneficial to the entire community) or if you have a custom tabulation request, you contact us directly at statcan.maddli-damidd.statcan@canada.ca.
We also request that you limit the number of emails sent directly to our personal email addresses. While we love hearing from you, it’s always a possibility that we may not be in the office for an extended period of time and your questions may go unnoticed!
The listserv is meant to serve as a collaborative venue to a) ask questions in which other members of the community may be able to help with, b) ask questions to the DLI team at Statistics Canada which may be of interest to the entire community, and c) to share other bits of interesting information including upcoming training dates, webinars and presentations, notices, suggestions, etc.
In order to help cut down the amount of messages sent through the listserv, we are asking that if you have a question intended specifically for us here at Statistics Canada (that you don’t feel is beneficial to the entire community) or if you have a custom tabulation request, you contact us directly at statcan.maddli-damidd.statcan@canada.ca.
We also request that you limit the number of emails sent directly to our personal email addresses. While we love hearing from you, it’s always a possibility that we may not be in the office for an extended period of time and your questions may go unnoticed!
Labels:
DLI Access,
DLI Administrative
Friday, May 31, 2019
2016 Labour Force Survey - missing NOCs
Question:
A researcher is looking for NOC occupation groups for the monthly 2016 Labour Force Survey PUMFs. Is this the way it should be represented: NOCS_01_25 STC Nesstar and NOCS_01_47 STC Nesstar when there actually are no such variables in 2016?
Is it correct that the values are 100% missing for in 2016? It would be helpful to have NOC values for 2016.
What was the intention in 2016 when there are NOC values for the PUMF for these variables, e.g., on Odesi see January 2015: NOCS_01_25 link and NOC_01_47 link and for January 2017: NOC_10 LINK and NOC_40 LINK? Would this be considered a problem with the data files?
As an aside, there are SOC80_21 and SOC80_49 variables (with values) in 2015 and 2016. Revisions to the 2015 LFS
Follow-Up Question:
I’d like to make a correction please to my question, with thanks to Scholars Portal for reviewing the occupation variables from 2015 – 2017 LFS monthly PUMFs.
Is it correct that there are supposed to be placeholders for a number of occupation variables but no data as highlighted in yellow below?
A researcher is looking for NOC occupation groups for the monthly 2016 Labour Force Survey PUMFs. Is this the way it should be represented: NOCS_01_25 STC Nesstar and NOCS_01_47 STC Nesstar when there actually are no such variables in 2016?
Is it correct that the values are 100% missing for in 2016? It would be helpful to have NOC values for 2016.
What was the intention in 2016 when there are NOC values for the PUMF for these variables, e.g., on Odesi see January 2015: NOCS_01_25 link and NOC_01_47 link and for January 2017: NOC_10 LINK and NOC_40 LINK? Would this be considered a problem with the data files?
As an aside, there are SOC80_21 and SOC80_49 variables (with values) in 2015 and 2016. Revisions to the 2015 LFS
Follow-Up Question:
I’d like to make a correction please to my question, with thanks to Scholars Portal for reviewing the occupation variables from 2015 – 2017 LFS monthly PUMFs.
Is it correct that there are supposed to be placeholders for a number of occupation variables but no data as highlighted in yellow below?
- All the LFS monthly PUMFs, 1987-2015, have data for NOCS-01-25 and NOCS_01_47, but as for the variables SOC80_21 and SOC80_49, there is no data.
- The monthly 2016 LFS PUMFs have variables named NOCS_01_25, NOCS_01_47, SOC80_21 and SOC80_49 but with no data.
- The monthly 2017 LFS PUMFs have two occupation variables only, the new NOC_10 and NOC_40.
Answer:
The short answer to this is simply that the labels have changed from the older years to the newer ones. For example, SOC80_21 was replaced with an NOC listing instead. From what I’ve been told, the descriptions of the variables should remain the same, it’s just a matter of needing to match them up from year to year (not necessarily the answer you were looking for I’m sure!)
So yes, it is correct that there is no data.
Thursday, May 30, 2019
Where are the PCCF Files?
Question:
I was looking for the most recent version of the PCCF which is supposed to be under MAD_PCCF_FCCP_DAM/ROOT/2019 but this directory does not show up (there are folders for 1986 all the way to 2018, but no 2019). Also, if I look under /MAD_PCCF_FCCP_DAM/Root/2018/pccf-fccp for the previous version, this folder is empty!
Answer:
We moved it to reflect the most recent Census release! See attached. Sorry for the confusion!
*Original email included attachment*
I was looking for the most recent version of the PCCF which is supposed to be under MAD_PCCF_FCCP_DAM/ROOT/2019 but this directory does not show up (there are folders for 1986 all the way to 2018, but no 2019). Also, if I look under /MAD_PCCF_FCCP_DAM/Root/2018/pccf-fccp for the previous version, this folder is empty!
Answer:
We moved it to reflect the most recent Census release! See attached. Sorry for the confusion!
*Original email included attachment*
Wednesday, May 29, 2019
PCCF in csv Format
Question:
Until we complete hiring a new data librarian, I’m the contact for anyone in our institution needing the PCCF and PCCF+ files and my knowledge is very limited. Usually I just provide the files and all is good, but now a grad student who has received the files sent me the following: “I am not familiar with SAS. Could you please send me a csv or Excel file which contains the postal codes and corresponding regions”.
Answer:
I’ve received the following response from the PCCF team:
“We do not provide the PCCF or PCFRF files in csv or Excel format. The files are quite large, and would take up even more room if they were provided is some application format. They are provided as a flat, fixed width, ASCII text file (.txt), which is readable/importable into a wide variety of applications. This is not a format specific to SAS. The client should be able to use it in whatever data application they are familiar with.
The record layout is provided in the Reference Guide (page 11 in the 2019 February version). That explains the format of the text file, and how the client needs to define the file in their application.
It’s not the most convenient way to provide the files. However, it is the format that is usable in the most applications, and has the smallest file size.”
Until we complete hiring a new data librarian, I’m the contact for anyone in our institution needing the PCCF and PCCF+ files and my knowledge is very limited. Usually I just provide the files and all is good, but now a grad student who has received the files sent me the following: “I am not familiar with SAS. Could you please send me a csv or Excel file which contains the postal codes and corresponding regions”.
Answer:
I’ve received the following response from the PCCF team:
“We do not provide the PCCF or PCFRF files in csv or Excel format. The files are quite large, and would take up even more room if they were provided is some application format. They are provided as a flat, fixed width, ASCII text file (.txt), which is readable/importable into a wide variety of applications. This is not a format specific to SAS. The client should be able to use it in whatever data application they are familiar with.
The record layout is provided in the Reference Guide (page 11 in the 2019 February version). That explains the format of the text file, and how the client needs to define the file in their application.
It’s not the most convenient way to provide the files. However, it is the format that is usable in the most applications, and has the smallest file size.”
Wednesday, May 22, 2019
PCCF Problem
Question:
A student here is using the latest edition of the PCCF, and is running into a curious problem -- she only gets results for two provinces - Newfoundland and Nova Scotia. I don't have any experience actually using the PCCF, so I'm stumped as to where the problem might be.
Any idea why this might be happening?
Answer:
The PCCF team will need a little more information in order to troubleshoot:
"If we can get further information, as in any of the following…
— complete PCCF vintage
— software tool used to do the query / analysis (MS Access, SAS, etc.)
— what the query / analysis actually is
— results anticipated
— actual results"
A student here is using the latest edition of the PCCF, and is running into a curious problem -- she only gets results for two provinces - Newfoundland and Nova Scotia. I don't have any experience actually using the PCCF, so I'm stumped as to where the problem might be.
Any idea why this might be happening?
Answer:
The PCCF team will need a little more information in order to troubleshoot:
"If we can get further information, as in any of the following…
— complete PCCF vintage
— software tool used to do the query / analysis (MS Access, SAS, etc.)
— what the query / analysis actually is
— results anticipated
— actual results"
Tuesday, May 21, 2019
Estimated Release of 2017 APS PUMF
Question:
I have a researcher interested in the 2017 Aboriginal Peoples Survey data. When might the PUMF be released? It’s not listed on the tentative release dates page yet. I understand if nothing specific is known at this time, but it would be helpful to know if it won’t be in 2019, for example.
Answer:
The 2017 APS PUMF will not be disseminated until 2020 – the exact date hasn’t been determined yet however.
I have a researcher interested in the 2017 Aboriginal Peoples Survey data. When might the PUMF be released? It’s not listed on the tentative release dates page yet. I understand if nothing specific is known at this time, but it would be helpful to know if it won’t be in 2019, for example.
Answer:
The 2017 APS PUMF will not be disseminated until 2020 – the exact date hasn’t been determined yet however.
Tuesday, May 14, 2019
PCCF7B / FCCP7B
The latest version of the PCCF file (PCCF7B) is now available on the EFT. It can be found at the following location:
/MAD_PCCF_FCCP_DAM/ROOT/2019/PCCF7B.zip
/MAD_PCCF_FCCP_DAM/ROOT/2019/PCCF7B.zip
Monday, May 13, 2019
New Files on Statistics Canada Nesstar - CCHS 2015
We are pleased to inform you that the following (metadata) are now available on the Statistics Canada Nesstar WebView site.
- Canadian Community Health Survey (CCHS) 2015 Nutrition
- Canadian Community Health Survey (CCHS) 2015-2016 Annual
Labels:
DLI Access,
DLI Administrative
Friday, May 10, 2019
New Release - LFS April 2019
We are pleased to inform you that the following product is now available.
Labour Force Survey (LFS) - April 2019
This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products. The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.
EFT: /MAD_PUMF_FMGD_DAM/Root/3701_LFS_EPA/1976-2018/data/micro2019-04.zip
Nesstar Webview: Labour Force Survey (LFS), April 2019
Labour Force Survey (LFS) - April 2019
This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products. The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.
EFT: /MAD_PUMF_FMGD_DAM/Root/3701_LFS_EPA/1976-2018/data/micro2019-04.zip
Nesstar Webview: Labour Force Survey (LFS), April 2019
Monday, May 6, 2019
Modernization Bulletin
The April 2019 issue of the Statistics Canada Modernization Bulletin is now available on the EFT. The bulletin aims to keep you up to date on the latest Statistics Canada initiatives and transformations.
EFT: /MAD_DLI_IDD_DAM/Root/ModernizationBulletinModernisation
Please note that we are currently working on having these accessible with an ISSN number in the near future!
EFT: /MAD_DLI_IDD_DAM/Root/ModernizationBulletinModernisation
Please note that we are currently working on having these accessible with an ISSN number in the near future!
Labels:
DLI Administrative,
Updates
Tuesday, April 16, 2019
Updated DLI Licencing Page
We are pleased to announced that we have updated our Licence page on the DLI website. We hope this will help answer any of your questions in regards to the new Open Data Licence. As a reminder, the only change that has occurred is that the PUMFs now fall under this Open Licence. All other DLI products and licences will remain the same.
Please visit: https://www.statcan.gc.ca/eng/dli/caselaw/license
Please visit: https://www.statcan.gc.ca/eng/dli/caselaw/license
Labels:
DLI Access,
DLI Administrative
Monday, April 15, 2019
Question about Publicizing Data Visualizations
Question:
I'm practicing my STATA coding skills and I was wondering,
- would I be violating any regulations if I were to graph any interesting associations and post the graphs on social media?
For example, if I create graphs using the Canadian Community Health Survey PUMF and post the graphs on my social media, would I be breaching any of the policies in the Data use restrictions?
Please let me know as I think it would be interesting to post a few data visualization series across my social media profiles.
Answer:
As long as the researcher is citing Statistics Canada, it shouldn’t be a problem!
I'm practicing my STATA coding skills and I was wondering,
- would I be violating any regulations if I were to graph any interesting associations and post the graphs on social media?
For example, if I create graphs using the Canadian Community Health Survey PUMF and post the graphs on my social media, would I be breaching any of the policies in the Data use restrictions?
Please let me know as I think it would be interesting to post a few data visualization series across my social media profiles.
Answer:
As long as the researcher is citing Statistics Canada, it shouldn’t be a problem!
Labels:
DLI Administrative
Wednesday, April 10, 2019
New files on Statistics Canada Nesstar
We are pleased to inform you that the following are now available on the Statistics Canada Nesstar WebView site.
Labour Force Survey (LFS) March 2019 - PUMF
Survey of Emergency Preparedness and Resilience in Canada (SEPR) 2014
Labour Force Survey (LFS) March 2019 - PUMF
Survey of Emergency Preparedness and Resilience in Canada (SEPR) 2014
Labels:
DLI Access,
DLI Administrative
Friday, April 5, 2019
New Release: LFS March 2019
We are pleased to inform you that the following product is now available.
Labour Force Survey (LFS) – March 2019
This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products. The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.
EFT: /MAD_PUMF_FMGD_DAM/Root/3701_LFS_EPA/1976-2019/data/micro2019/micro2019-03.zip
Nesstar access will be available on Monday, April 8th.
Labour Force Survey (LFS) – March 2019
This public use microdata file contains non-aggregated data for a wide variety of variables collected from the Labour Force Survey (LFS). The LFS collects monthly information on the labour market activities of Canada's working age population. This product is for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products. The Labour Force Survey estimates are based on a sample, and are therefore subject to sampling variability. Estimates for smaller geographic areas, industries, occupations or cross tabulations will have more variability. For an explanation of sampling variability of estimates, and how to use standard errors to assess this variability, consult the Data Quality section in the Guide to the Labour Force Survey.
EFT: /MAD_PUMF_FMGD_DAM/Root/3701_LFS_EPA/1976-2019/data/micro2019/micro2019-03.zip
Nesstar access will be available on Monday, April 8th.
Subscribe to:
Posts (Atom)