modern methods of data collection in poland - unece … · methods of data collection corstat...
TRANSCRIPT
Janusz DygaszewiczDirector of Programming and Coordination
of Statistical Surveys Department
Central Statistical Office of Poland
UNECE Workshop on Statistical Data Collection
Washington DC, USA, 29 April – 1 May 2015
Methods of data collection
CorStat System – supporting process of collecting data
Transformation of administrative data to statistical data
Processing data
A geographic information system (GIS) as a tool to support,
monitor and control the work of interviewers in the field
2
CAWI
• Computer Assisted Web Interview
• Reporting Portal for legal entities, organizational entities without
legal status and persons conducting economic activity businesses
CAII
• Computer Assisted Internet Interview
• for individual persons selected for participation in surveys
CATI• Computer Assisted Telephone Interview
CAPI• Computer Assisted Personal Interview
REG• Administrative sources
3
CAWI - Reporting Portal
data validation and correction by the reporting person and
the statistician during data collection
monitoring of the report’s completion
notification of the approaching report submission deadlines
and reminders in electronic format
communication between the reporting person and the
statistician
savings - elimination of paper questionnaires
faster data processing - no entering data from paper
questionnaires
4
0
100 000
200 000
300 000
400 000
500 000
600 000
700 000
October
2008
January
2009
December
2009
June 2011 August
2012
October
2014
April 2015
33 745
202 641
359 573
559 883
643 190 661 719700 000
5
persons answer survey questions using the
application available online on the Internet;
implemented for agricultural surveys;
and social surveys, like household budget survey:◦ CAII also support not only respondents but also
interviewers, e.g.:
the interviewers have an option of remote preview of
the form completed independently by respondents,
the interviewers actively assist the respondents,
quickly explaining possible mistakes.
6
currently implemented in all agricultural surveys and
in most social surveys;
scheduled as the first or the second (following CAII)
channel of collecting data;
working posts of telephone interviewers - located in
separated Call Center studies;
telephone interviewers - provided with professional
equipment.
7
agricultural surveys ◦ the third channel of data collection in agricultural surveys,
in the case of failure to obtain a complete set of data via
CAII and CATI channels
social surveys ◦ direct interviews in households (first or second channel)
where such a way of proceeding results from adopted
methodology or
whose members has not expressed consent for a telephone
survey
8
• CAII - Computer Assisted
Internet Interview,
• CAPI - Computer Assisted
Personal Interview,
• CATI - Computer Assisted
Telephone Interviewing.
CAxI
9
CAXI
9
HTC Touch Pro2
Screen
◦ touch-screen
◦ size 3,6’’
◦ resolution 480 x 800 pixels
◦ sliding, tilting - convenient usage
sliding, 5-rows QWERTY keyboard
GSM/GPRS/EDGE/UMTS/HSPA
GPS module
camera - 3,2 MP
Windows Mobile® 6.5
11
CORstat system:
◦ controlling surveys flow between channels and
interviewers – to avoid double counting or omission;
◦ automated analysis process of interviewers’ workload;
◦ reasonable allocation of tasks for interviewers;
◦ monitoring and control of interviewers work in the field;
◦ monitoring the course of the survey including remedial,
emergency actions and creation of reports;
◦ first implemented in the last Population Census.
12
13
14
◦ as a direct source of census data
◦ for personalisation of questionnaires
◦ to create:
compilations of buildings, dwellings and
persons,
an address-residence register
a sampling frame
15
Data from administrative systems was used in the census:
16
Preliminary preparation of an administrative register1) Importing
2) Mapping
3) Simple deduplication
4) Denormalisation
Data transition• The processing of identification and address variables• The processing of substantive variables
Validation and adjustment
Integration
Complex deduplication
The selection of statistical variable value from many registers
Statistical Data Set
17
Portal
XML
TXT
Registry 1
Metadata server
OperationalMicrodata
Base
Registry 2
Registry nAnalitycalMicrodata
Base
ETL Tools
CAXI
XML
Files
Statistical
Files
Golden
Record
Metadata MetadataMetadata
SDMX
Questionaries
Integration with Census Frame and CAxI data,
Validation,
Correction,
Operational Imputation,
Transfer proper values to Golden Record,
18
Registers 1..n
CAxI
Golden Record
OMB Layers
AMB
In 2010 Census Round a combination of data coming from sample surveys and administrative registerscontaining spatial data (including x,y coordinates) was used for the first time
19
2010
Census
Round
registers
containing
spatial data
administrativ
e sources
Population
and Housing
Census 2011
•update in
municipalities
• WEB map service
• on-line editing
• visual pre-census
survey by
enumerators
20
Statistical address points
•address points for residential
buildings
Statistical distribution
boundaries
•statistical regions
•enumeration areas
21
Map module - GIS◦ Ortophotomap
◦ Cadastral Data
◦ Assigned Tasks
◦ Started Tasks
◦ Completed Tasks
2222
Questionnaire completeness analysis
Enumerator monitoring activity in the field– using
GPS/GIS technology
◦ Census Progress
◦ Localization and route
Emergency situation management
◦ Providing help for enumerators
Providing necessary information to enumerators
23
24
25
26
28
29
GIS technology is used not only at the stage of data
collection but also for:
conducting spatial analysis;
further dissemination of survey results via
Geostatistics Portal
http://geo.stat.gov.pl//
30
31
European Forum for
Geography and Statistics
represents a professional network of experts
cooperating within the framework of the ESS
to create a common geostatistical data
infrastructure and work out the best practices
in collecting, producing and disseminating
georeferenced statistics
Census in 2002
180 thousands of census enumerators
120 mln of questionnaires
1 000 tons of papers
At the end shredding
census questionnaires
Census 2011
18 thousands of census enumerators
0 questionnaires
0 tons of papers
ca. 50 mln € less
better data the more reliable results statistical surveys in the
future
33
Janusz Dygaszewicz
Central Statistical Office of Poland