understand the demand of analyst opportunity in u.s
DESCRIPTION
The slides summarize an analysis on the demand pattern of analyst opportunity (like data analyst, data science) in the U.S. In a nutshell, it answer four questions, including the demand trend, demand source, degree and skill requirement based on the online job posting data.TRANSCRIPT
Analytical Jobs Demand Analysis
Jason ZhangCarnegie Mellon University
Final Presentation
Objective
Understand the demand of analytics jobs
a. What is the trend of demands for analyst?
b. Which location/industry the demand come
from?
c. What degree is required to launch job in
analytics?
d. What skills are needed in the analytics jobs?
Approach
Data Source
ExtractedFeatures
● Job Title● Company Name● Company Type● Company Industry● Job Location● Key Skills (SQL,
Python, etc)● Degree
Web Scrape
Database
TextMining CouchDB
NoSQL DB
Second Hand Data
First Hand Data
Indeed Trend Search
Definition
What do we mean by “analyst”?
Analyst
Data Analyst(Engineer)
Business Analyst(Business)
Match Keywords
data engineer, data scientist, machine learning, technical analyst, system
analyst, security analyst, data modeler, data architect etc.
Match Keywordsbusiness intelligence,
business analyst, program manager, product manager,
marketing,and other analysts not categorized
asdata analyst (engineer)
who is responsible for designing algorithms, softwares,
or infrastructure that enable analysis
who is responsible forconducting decision-related
analysis and communicate w/ client
Business Analyst (Business) is four time as much as Data Analyst (Engineer)
4,000+
16,000+
* Number of Active Job Posts on Linkedin Given the Pre-selected Keywords
What is the trend of demands for analyst?
“Analyst“ position shifts title over time
Data Scientist
Data Analyst Business Analyst
Business Intelligence
Big Data Related Job Increased Steadily
NoSQL Hadoop
Big Data
What is the source (location/industry) of the demands?
Most western opportunities are located at CA whileeastern opportunities are even distributed
Business Analyst is more evenly distributed across four regions
Demand for Business Analyst and Data Analyst are slightly different across industries
Top1 for Both Role
* which industry the job belong comes from the Linkedin company profile page
What is the degree requirementfor the analyst position
Data Analyst (Engineer) has higher preference towards Ph.D than Business Analyst (Business)
65% jobs express degree requirement
65%
% of Job Mentioning BS/MS/Phd
18%PhD Only
What skills companies wantfrom their analysts?
Data Analyst has more specific skills requirement than Business Analyst
Data Analyst(Engineer Side)
Business Analyst(Business Side)
3.0 1.3
# of Skills Required in the Job Post
* Among the extracted skill set, on average, data analyst position requires 3.0 skills while business requires 1.3 skills
SQL is the most important skill for Analyst (Engineer) as Excel for Analyst (Business).
SQLHigh for Engineer
ExcelHigh for Business
Java, Hadoop, PythonR, SAS, Machine LearningModerate for Engineer
PresentationModerate for Business
What is the trend of the demandfor those high/moderate skills?
Python R Machine L’ning
Hadoop SAS Java
Excel Presentation SQL
What about the trend for the skill
Business Engineer
High Moderate High Moderate
Excel Presentation SQL Python, R, ML, Hadoop SAS, Java
Takeaway
For Staffing and Recruiting Company:
1. Monitor skills instead of job titles
2. Focus on “Business Analysis” if the company want
to expand business in eastern coast
3. Help candidates prepare interview by helping
them enhance top required skills (SQL,
Presentation)
Q & A
Thanks you!
Appendix
Top 1 State - California vs. Top 1 City - New York
Top 5 States Top 5 Cities
Analysis Tend
The skill requirement of analyst is very diverse - not a skills satisfy more than half of the jobs
Top 20 Skills for Analytical Positions
NO skill can satisfy the requirement of more than 35% jobs
Limitations/Next Steps
Limitations
1. Single Source: Job data is only from single source (only
Linkedin)
2. Only June/July data could be crawled, no time series
analysis could be done with our own data
3. No way to detect job duplication (single job, multiple
posts)
Next Step
4. Scrape data from multiple sources (indeed, monster)
5. Run the script weekly so calculate job data over time
6. Calculate the similar of two job posts to reduce
duplication
7. Store the job into MySQL database
Appendix
Appendix
Appendix