Bridging the gap: Online job postings, survey data and the assessment of job vacancies in Canada
LMICIMT
371 views
35 slides
Jun 11, 2024
Slide 1 of 35
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
About This Presentation
OJP data from firms like Vicinity Jobs have emerged as a complement to traditional sources of labour demand data, such as the Job Vacancy and Wages Survey (JVWS). Ibrahim Abuallail, PhD Candidate, University of Ottawa, presented research relating to bias in OJPs and a proposed approach to effectivel...
OJP data from firms like Vicinity Jobs have emerged as a complement to traditional sources of labour demand data, such as the Job Vacancy and Wages Survey (JVWS). Ibrahim Abuallail, PhD Candidate, University of Ottawa, presented research relating to bias in OJPs and a proposed approach to effectively adjust OJP data to complement existing official data (such as from the JVWS) and improve the measurement of labour demand.
Size: 5.45 MB
Language: en
Added: Jun 11, 2024
Slides: 35 pages
Slide Content
Bridging the Gap: Online Job Postings, Survey Data and the Assessment of Job Vacancies Author & Presenter: Ibrahim Abuallail Special thanks to Brittany Feor & Sukriti Tehran Co-author: Anne-Lore Fraikin
Introduction & Motivation
Motivation Importance of Vacancy Data in Economics Pissarides (1986) Macro-labour economists studying business cycles ( Yashiv (2007)). Conventional Approach: Survey Data (JVWS)
Limitations of JVWS Data Time lag in reporting. Impractical to inform real-time policy decisions The sample size is less than 10% of all employers in Canada. Data suppression for confidentiality. Exclusion of federal, territorial, and provincial employees. This also includes a considerable number of management occupations that are not captured in the data
Data & Methodology
An Alternative Approach: Online Job Postings Turell et. al (2019) use a weighting function to improve online job posting data representativeness using UK data. Evans et. al (2023) use a winsorization algorithm to improve online job postings forecast errors (with respect to survey) using Australian data. Several studies have already begun leveraging online job postings data as a reliable source (see for example Marinescu (2017), Deming and Kahn (2018), and Goldfarb et. al (2020)). Related Literature
Vicinity Jobs Data Web scraping tool that collects online job postings from multiple website sources Contains over 103 million job postings for the period between 2018 and October 2023 Around half of the data sources are from actual employer websites Other third-party non-employer job posting websites may also be included; Ex: indeed and monster.com Most third-party data sources included are cost based (employers have to pay to post), or include high administrative costs
Vicinity Jobs Data Figure 1: Top 10 website sources for VJ data. Employer websites and government job boards comprise the majority of the data sources
Vicinity Jobs Data → Graph for survey Figure 2: Distribution of Broad Occupational Categories in Online Job Postings Data. Breakdown of NOC categories in → NOC 2016 Table 7
Limitations of Online Job Postings Data Potential for duplication Authenticity Job posting not for the purpose of hiring Postings by fraudsters, with job scams growing in recent years Employers could attempt to recruit multiple people using a single job posting (LMIC 2020). Online job postings also suffer from a lack of historical data, with the Vicinity Jobs dataset only starting from 2018 Type of Data (stock vs flow)
Flow vs Stock Variable Convert VJ flow variable to stock variable Research from the Josh Bersin Company 2023 Where d represents a day, and q is specified as the quarter of the JVWS survey and is equal to the 44 days preceding the end of the quarter.
Online Job Postings vs Survey Data Figure 4: Total Stock of Vacancies Comparison of VJ and JVWS
Granger Causality - Wald Test The null hypothesis ( H ): VJ t does not Granger cause JVWS t : The alternative hypothesis ( H 1 ): VJ t Granger causes JVWS t :
Granger Causality Test Results Granger Causality Test Model 1: JVWS ∼ Lags(JVWS, 1:2) + Lags(VJ, 1:2) Model 2: JVWS ∼ Lags(JVWS, 1:2) Res. Df Df F Pr (>F) 1 6605 2 6607 -2 29.514 1.743e-13 Table 1: Granger test results
Machine Learning Forecasts Figure 5: Comparison of model predictions
Goals Improve the bias in online job postings towards certain sectors Improve forecast-ability of online job postings data for job vacancies from the survey
Algorithms & Results
Weighting Function
Vacancy Stocks – Weighting Function Figure 6: Total Stock of Vacancies
Weighting Function - Errors Figure 7: Average Errors by Quarter
Winsorization Algorithm The first step would be to split the dataset by job posting sources, and identify sources j = 1, …, J with a certain cut off, k , for postings per source. Then we compute the percentage change between q − 1 and q for the online job postings per source:
Winsorization Algorithm 1 2 3 1 2
Winsorization Algorithm Figure 8: Average Source Errors by Quarter
Robust Regression with Huber Weights
Robust Regression with Huber Weights Figure 9: Average Errors by Quarter – Winsorized and Robust
Combined Approach The goal is to solve the bias / representation issue, while also improving forecast ability. Combine weights function with winsorization . Combine weights function with robust regression.
Combined Approach Figure 10: Comparison of combined algorithm errors with weighting function errors, and original errors.
Combined Approach Figure 11: Performance of Machine Learning Models on combined algorithm data.
Conclusion
Conclusion Vacancy data has important applications in Economics The conventional source of data has documented flaws, mainly a reporting lag The alternative being used, online job postings, has its own limitations and drawbacks
Conclusion Develop a new enhanced methodology to improve the forecast-ability of online job postings. The robust regression method has not been used in this context before. Develop a combined approach (robust method that is also weighted by survey). Using machine learning models to forecast survey data. Applying this analysis on Canadian labour market data. Our main contribution
Questions?
Appendix
NOC Version 2016 Figure 5: NOC 2016 Version 1.3 Data source: NOC, Government of Canada Back to VJ Data
JVWS Broad NOC Distribution Figure 13: Distribution of Broad Occupational Categories in Survey Data. Breakdown of NOC categories in → NOC 2016 Table → Back to VJ Data